# Nutrition surveillance using a small open cohort: experience from Burkina Faso

- Mathias Altmann
^{1}Email author, - Christophe Fermanian
^{2}, - Boshen Jiao
^{3}, - Chiara Altare
^{1}, - Martin Loada
^{4}and - Mark Myatt
^{5}

**13**:12

**DOI: **10.1186/s12982-016-0052-0

© The Author(s) 2016

**Received: **23 March 2016

**Accepted: **19 October 2016

**Published: **15 November 2016

## Abstract

### Background

Nutritional surveillance remains generally weak and early warning systems are needed in areas with high burden of acute under-nutrition. In order to enhance insight into nutritional surveillance, a community-based sentinel sites approach, known as the Listening Posts (LP) Project, was piloted in Burkina Faso by Action Contre la Faim (ACF). This paper presents ACF’s experience with the LP approach and investigates potential selection and observational biases.

### Methods

Six primary sampling units (PSUs) were selected in each livelihood zone using the centric systematic area sampling methodology. In each PSU, 22 children aged between 6 and 24 months were selected by proximity sampling. The prevalence of GAM for each month from January 2011 to December 2013 was estimated using a Bayesian normal–normal conjugate analysis followed by PROBIT estimation. To validate the LP approach in detecting changes over time, the time trends of MUAC from LP and from five cross-sectional surveys were modelled using polynomial regression and compared by using a Wald test. The differences between prevalence estimates from the two data sources were used to assess selection and observational biases.

### Results

The 95 % credible interval around GAM prevalence estimates using LP approach ranged between +6.5 %/−6.0 % on a prevalence of 36.1 % and +3.5 %/−2.9 % on a prevalence of 10.8 %. LP and cross-sectional surveys time trend models were well correlated (p = 0.6337). Although LP showed a slight but significant trend for GAM to decrease over time at a rate of −0.26 %/visit, the prevalence estimates from the two data sources showed good agreement over a 3-year period.

### Conclusions

The LP methodology has proved to be valid in following trends of GAM prevalence for a period of 3 years without selection bias. However, a slight observational bias was observed, requiring a periodical reselection of the sentinel sites. This kind of surveillance project is suited to use in areas with high burden of acute under-nutrition where early warning systems are strongly needed. Advocacy is necessary to develop sustainable nutrition surveillance system and to support the use of surveillance data in guiding nutritional programs.

### Keywords

Nutrition Surveillance Burkina Faso Selection bias Observational bias Humanitarian## Background

*Nutrition surveillance* means “to watch over nutrition in order to make decisions that lead to improvements in nutrition in populations” [1]. Nutrition surveillance refers to a continuous process and focuses on monitoring trends over time, rather than providing one-time estimates of (e.g.) absolute levels of the prevalence of malnutrition, in order to identify and respond to crises in a timely manner [2].

Although they have been recognized as an important component in fighting malnutrition, nutritional surveillance systems remain weak in most developing countries [3]. Reasons for this include (1) no common agreement on the best methods to implement nutrition surveillance, (2) a lack of confidence in surveillance data, and (3) little comparable data on the costs of different potentially effective systems that would justify investments in such a system [2, 4]. It is, therefore, essential for practitioners to share experiences regarding nutritional surveillance in order to provide insights into what works and what does not work in the field.

Nutritional surveillance data tend to come from two main sources: administrative (e.g. health facility/feeding centre caseloads and schools health services reports) and repeated probability sample household surveys [2, 5]. Limitations of administrative data are well known. There may be a selection bias due to incomplete distribution of facilities and populations that are covered by programs contributing data [6]. Even when the facilities are well running, only people with better access may attend clinics or nutrition program sites, thus underestimating the true prevalence/incidence of the condition of interest. Furthermore, unless active case-finding is used, beneficiaries may tend to come to the facilities only when the disease is severe. This means that indicators may lag behind incidence, making surveillance data inappropriate for an early warning system. The second data source, repeated probability sample household surveys, is the most commonly used approach to nutrition surveillance [5, 7, 8]. Surveys provide a representative picture of the situation at a given time and allow comparisons over time and between geographical areas. However, unless they are repeated frequently enough, surveys may miss seasonal effect and cannot provide timely information on changes over time [9].

Less attention has been given to the community-based sentinel sites approach to nutrition surveillance. These surveillance systems are characterised by the selection of a small sample of communities from which a set of information is collected regularly. There are two main criticisms to the sentinel approach. First, the purposive sampling of selected sites according to predefined criteria (e.g. the most “vulnerable” settlements) results into non-representative estimates (likely overestimates) [4]. Second, an observational effect that acts to reduce prevalence over time as the selected sites tend to be progressively positively affected by the inputs of the survey teams (e.g. giving education, advice and counselling, referral of cases for treatment, and treating illness) [10]. It is not clear, however, that a statistically representative sample, as might be used in a population survey, is an essential attribute of a surveillance system. It may, for example, be more useful to select and watch over communities that are vulnerable to shocks so as to detect potential crises early in their development. Experiences of sentinel sites nutrition surveillance have been reported from Sudan [11] and the Central African Republic [12].

This paper presents the experience of the international non-governmental organization Action Contre la Faim (ACF) with nutrition surveillance using a community-based sentinel sites approach, known as the Listening Posts (LP) project. We established a surveillance system in order to estimate nutritional and food security needs and to identify when and where these needs were highest. The system was set up to describe patterns over time, and also to provide accurate estimates of the point prevalence of acute undernutrition and to provide predictions of the caseloads. In this paper, we report and examine our experience in Burkina Faso in order to assess the reliability and validity of the LP method compared to repeated cross-sectional surveys in terms of selection and observation biases.

## Methods

### Selection of livelihood zones

Criteria for selecting the setting were as follow: existence of a programme implemented by ACF; availability of sufficient capacity to conduct surveillance; nationally and locally weak nutrition information systems in the government sector; no other sentinel surveillance system in place in the government sector; involvement of the government in the selection of the livelihood zone (LHZ); and “vulnerability” of LHZ on the basis of an Household Economy Approach (HEA) food security assessment [13]. The LHZ was defined as a geographical area where people share broadly the same patterns of access to food and income, and have the same access to local markets.

### Sample size calculation

The sample size was calculated taking both accuracy and costs into account, keeping in mind that low cost is an important factor for the sustainability of surveillance systems. The following aspects were included in the calculation of the sample size: (1) age range was reduced to 6–24 months. Besides the fact that this reduced the size of the universe (small population), this age group is most vulnerable to acute malnutrition; (2) semi-longitudinal design: the use of an “open cohort” (see “top-up replacement and referral” below) decreased the required sample size through reduction of between round sampling variation compared to taking a new sample at each round; (3) estimation of GAM was done using a Bayesian normal–normal conjugate analysis with an objective prior followed by a PROBIT analysis [16–19]. The Bayesian conjugate analysis was used because the prior contains information that contributes “pseudo-observations” to the conjugate analysis. This means that the Bayesian conjugate analysis will have a larger effective sample size than a frequentist analysis of similar sample size provided that there is no gross conflict between the prior and the likelihood (i.e. observed) data. A larger effective sample size translates to improved precision of estimates [19, 20]. The Bayesian normal–normal conjugate analysis yields posterior estimates of the mean and standard deviation (SD) which are the inputs required by the inverse cumulative distribution function used in the PROBIT estimator. The PROBIT estimator retains information about scale and variability that is lost by the classical approach when the data are coded to a case/not case binary variable. This retained information allows the PROBIT approach to return estimates with improved precision compared to the classical (i.e. case counting) approach [17, 18], making the method well suited to work with small samples. With these conditions, and using computer-based simulations using data derived from cross-sectional surveys, we calculated that a sample size of *n* = 96 from each LHZ could be expected to yield a 95 % credible interval (CI) of ±10 % or better at any level of prevalence. A sample size of *n* = 132 children was selected in order to ensure that useful precision is achieved.

### Sampling and eligibility criteria

A two-stage cluster sample of children was taken from each selected LHZ. Six primary sampling units (PSUs), also called “Listening Posts”, were selected using the centric systematic area sampling (CSAS) methodology, by which the sample selected was reasonably evenly distributed across the survey area. This type of sample provides implicit stratification by spreading the sample properly among sub-groups of the population such as rural, urban, peri-urban populations, administrative areas, ethnic sub-populations, religious sub-populations, and socio-economic groups [21–25]. This tends to improve precision of survey estimates from survey data. In the second sampling stage, we selected 22 children from each Listening Post (PSUs) using the Expanded Program on Immunization (EPI) household sampling scheme: the first household was selected by choosing a random direction from the centre of the community, counting the houses along that route, and picking one at random, and the sampling was continued by choosing the household nearest to the preceding one that included an eligible child [26]. All children aged between 6 and 24 months in selected households were included in the sample. Since a child falling into such a narrow age range would not be found in every household, the sample was spread widely across the PSU community [26]. This procedure provided the same advantage as implicit stratification by ensuring that all parts of the PSU were sampled.

### Top-up replacement and referral

When a child reached his or her second birthday, they were replaced by another child aged between 6 and 9 months not already in the cohort (the “top-up sample”) from the nearest household with an eligible child. Before being replaced, nutritional measurements were done and the survey questionnaire administered. A dead or lost-to-follow up (e.g. moved away) child was replaced by another child not already in the cohort and of similar age from the nearest household with an eligible child. These procedures ensured that the age structure of the cohort remained constant between surveillance rounds so that prevalence estimates would not be influenced by aging of the surveillance cohort. All children with a mid-upper arm circumference (MUAC) below 125 mm, as well as sick children, were referred to the nearest health centre.

### Data collection

Two interviewers for the three LHZ were trained to perform the sampling protocol, the required anthropometric measurements, and apply the survey questionnaire. Regular supervisions were conducted to ensure anthropometric measurements were done correctly. Each interviewer visited one Listening Post (PSU) per day, and performed interviews of mothers and measurements (weight and MUAC) of 22 children and top-up sampling when this was required. Data were collected during the first 2 weeks of each month. In order to avoid an interviewer bias, monthly rotations were organized in the visited LP between the two interviewers. 36 monthly rounds of data collection were performed and are included in the analysis presented here. Anthropometric measurement (weight and MUAC), morbidity (prevalence of diarrhoea in the last 15 days), infant and young child feeding (IYCF) practices, food security, and water, sanitation and hygiene (WASH) indicators were collected using a paper based questionnaire. In this article, we will concentrate on the prevalence of global acute malnutrition (GAM), defined as MUAC <125 mm, which is recognized as a sensible indicator to capture variations of the nutrition status at community level [27]. One supervisor prepared the planning of the interviewer, the questionnaires, checked for missing data and validated the data for analysis. Data was entered into an Excel spread sheet, together with quality assurance mechanisms such as cross-field consistency checks, legal value, and range checks.

### Data analysis

^{2}, respectively). Using these parameters, the equation used to estimate the mean MUAC was:

*n*is the number of children in the sample and \(\bar{x}\) is the mean MUAC observed in the survey.

We used these values (i.e. the posterior mean and variance) to find the 95 % credible interval for the posterior mean using the inverse cumulative normal distribution (NORMINV) function in Microsoft Excel. We translated the posterior mean and the associated 95 % credible interval into a proportion (i.e. prevalence) using the cumulative normal distribution (NORMDIST) function in Microsoft Excel using a threshold value of 125 mm. By using this function, we estimated the probability that a child selected at random from the monitored population would have a MUAC below 125 mm (which equates to prevalence).

Data were analysed by the program manager and validated by the ACF Department for Food and Livelihoods. For the purpose of surveillance, data were analysed on a monthly basis using means and estimated proportions of the different collected variables. When plotting the results as a time series, the effect of sampling variation was reduced by applying a simple low-pass filter (i.e. smoothing using a 3 months moving average).

### Comparison between LP and cross-sectional surveys

The external validity of the estimates from the LP survey was assessed by comparing the time trend and variability of LP data with that from other data sources. In Burkina Faso, available data included nutritional cross-sectional surveys using the Standardized Monitoring and Assessment of Relief and Transitions (SMART) methodology [29], conducted by ACF in Tapoa Province in March 2011, May 2012, December 2012, May 2013 and October 2013 (data not published but available upon request). In order to have the same age group as used for the LP surveillance system, only 6–24 months aged children were kept from the cross-sectional surveys.

*T*), squared time (

*T*

^{2}) and cubed time (

*T*

^{3}) were entered into the model as independent variables. This led to the following polynomial model:

*Y*is the continuous MUAC of the individuals assessed at time

*T*,

*β*

_{0}is the intercept of the model,

*β*

_{1},

*β*

_{2}and

*β*

_{3}are the coefficients of respective terms

*T*,

*T*

^{2}and

*T*

^{3}.

*W*) with the value one when the corresponding data were from LP survey and zero when from cross-sectional survey. This led to the following model:

*Y*is the continuous MUAC of the individuals from both LP and cross-sectional data assessed at time

*T*. \(\beta_{0}^{\prime }\) is the intercept of model, \(\beta_{1}^{\prime } ,\beta_{2}^{\prime }\) and \(\beta_{3}^{\prime }\) are the coefficients of respective terms T, T

^{2}and

*T*

^{3}, and

*W*is the new variable mentioned above. Then we performed a Wald test to determine whether this new variable made a statistically significant improvement to the global regression model and to determine whether the coefficients in the LP and cross-sectional surveys models were significantly different from each other.

*Y*is the absolute differences (D) in prevalence estimates between LP and cross-sectional surveys assessed at time \(T^{\prime \prime } ,\beta_{0}^{\prime \prime }\) is the intercept of regression line and \(\beta_{1}^{\prime \prime }\) is the coefficient associated with time. The time at which differences would become significant was estimated as the point where this line crossed the upper confidence limit for the mean.

## Results

### Cohort profile

Characteristics of Listening Posts and cross-sectional surveys data sources, Tapoa Province, Burkina Faso, 2011–2013

Characteristics/livelihood zone | Nord | Centre | South | All 3 LHZ | Cross-sectional surveys (3 LHZ) |
---|---|---|---|---|---|

Number of rounds, n | 36 | 36 | 36 | 36 | 5 |

Sample size (number of children) median (IQR) | 119 (116–132) | 139 (135–144) | 162 (145–164) | 418 (410–423) | 287 (275–367) |

Mean age (months) median (IQR) | 15.6 (15.2–15.9) | 16.0 (15.4–16.2) | 16.0 (15.9–16.2) | 15.9 (15.4–16.2) | 14.7 (14.4–15.1) |

Mean MUAC (mm) median (IQR) | 133.4 (132.0–134.0) | 133.3 (131.8–134.4) | 134.6 (132.6–135.4) | 133.8 (132.3–134.7) | 133.2 (132.9–133.6) |

SD (mm), median (IQR) | 9.98 (9.42–10.60) | 9.11 (8.66–9.67) | 9.68 (9.01–10.23) | 9.60 (9.19–10.04) | 10.73 (10.43–10.77) |

DEFF, median (IQR) | 0.97 (0.76–1.31) | 0.96 (0.80–1.25) | 1.22 (1.07–1.46) | 1.18 (1.09–1.31) | 1.06 (1.04–1.14) |

Design effects were close to 1 over the study period and in all LHZs. The mean monthly replacement rate of the children in the cohort was 8.5 % (SD = 0.28 %) which is above the expected value due to aging of the cohort (1/18 = 5.6 %). The difference was explained by losses to follow up, defined as the absence of child at 3 consecutive visits before reaching their second birthday, which had a mean value of 2.8 % (SD = 0.17 %).

### Global acute malnutrition

*n*= 227 and

*n*= 362 respectively. Using

*n*= 132 proved therefore to be suitable for the LP sample design and analysis plan. The GAM prevalence estimates, based on mean MUAC, did not differ between the three LHZ’s (Fig. 3).

In the three LHZ of Tapoa province, GAM prevalence followed a seasonal variation with an increase during the dry season (January to May) and a decrease during the rainy season (June to September). Over the 3 years, GAM decreased during 2011, increased till the middle of 2013, and then decreased again during the last months of 2013 (Fig. 3).

### External validity

Characteristics of collected data in pooled LP and pooled cross-sectional surveys are displayed in Table 1. While mean age was slightly higher in LP than in cross-sectional surveys, mean MUAC was slightly lower in cross-sectional surveys. The SD was lower for LP than cross-sectional surveys, with a median of 9.60 and 10.73 mm, respectively. Design effect had a median of 1.18 and 1.06, for LP and cross-sectional surveys, respectively.

Regression coefficients, confidence intervals and p value taking continuous MUAC as dependent variable, Tapoa province, Burkina Faso, 2011–2013

Variables | Coefficient | (95 % CI) | p value |
---|---|---|---|

LP model | |||

Time | 1.1597 | (0.9897, 1.3296) | <0.001 |

Squared time | −0.0653 | (−0.0757, −0.0549) | <0.001 |

Cubed time | 0.0011 | (0.0009, 0.0012) | <0.001 |

Intercept | 128.02 | (127.27, 128.76) | <0.001 |

Cross-sectional surveys model | |||

Time | 1.2149 | (0.3633, 2.0661) | 0.005 |

Squared time | −0.0773 | (−0.1328, −0.0219) | 0.006 |

Cubed time | 0.0013 | (0.0003, 0.0023) | 0.008 |

Intercept | 129.87 | (127.14, 132.60) | <0.001 |

Prevalence (by mean MUAC) and its 95 % Credible Interval of Global Acute Malnutrition among 6–24 months aged children, by Listening Posts and cross-sectional surveys, Tapoa province, Burkina Faso, 2011–2013

Time points | Listening post surveys | Cross-sectional surveys | ||||
---|---|---|---|---|---|---|

Mean MUAC (SD) in mm | Prevalence (%) | (95 % CI) | Mean MUAC (SD) in mm | Prevalence (%) | (95 % CI) | |

March 2011 | 130.6 (10.7) | 30.12 | (26.87, 33.52) | 133.0 (11.7) | 23.61 | (20.21, 27.32) |

May 2012 | 133.7 (10.2) | 21.44 | (18.69, 24.41) | 134.8 (10.1) | 19.08 | (16.04, 22.46) |

December 2012 | 132.9 (9.1) | 23.01 | (20.28, 26.12) | 133.1 (10.7) | 22.66 | (19.11, 26.55) |

May 2013 | 131.8 (9.6) | 27.13 | (24.03, 30.42) | 132.2 (12.0) | 25.40 | (22.52, 28.47) |

October 2013 | 135.8 (9.8) | 18.41 | (15.95, 21.08) | 133.6 (10.4) | 21.68 | (18.80, 24.80) |

Extrapolation of the line crossed the upper border of the confidence interval of the mean at time t = 45.96 months, indicating that the difference in GAM prevalence would probably become significant 3 years and 10 months after the beginning of the surveillance.

## Discussion

This paper presents the methodology for nutrition surveillance that was tested by ACF in a rural area of Burkina Faso, using a small open cohort in community-based sentinel sites. It provides detailed information on how to implement it and assess the reliability of its data in term of selection and observational biases. This methodology has been shown to be valid in following trends for a period of 3 years without selection bias.

Criticisms on sentinel system address the representativeness of the sample, including selection and observational biases. In our study, we used CSAS sampling rather than a purposive sampling technique. CSAS sampling has been shown to approximate random sampling [21] and is thereby representative of the survey area. Although mean age of the cohort remained constant over time, we cannot exclude that a selection bias might have been introduced when lost to follow up children were replaced at each round. Dataset comparisons between LP and repeated cross-sectional surveys did not detect any statistical difference in estimating mean MUAC over a 3-year period in our setting. Post-hoc power analysis showed that for these comparisons, the working power was between 79 and 97 %, thus the corresponding tests were not underpowered. Moreover, the use of the *Bayesian*-*PROBIT* approach to estimate GAM has increased the precision of prevalence estimate compared to the use of a frequentist analysis and classical estimator. Using a reduced age-group (i.e. <2 years old) proved to be also interesting in term of reducing the design effect close to 1, probably by spreading the sample in each community and hence, increasing the heterogeneity in each cluster.

A higher decrease in GAM prevalence (0.26 % per visit) was observed in LP compared to cross-sectional surveys. It is probable that in the field, information provided to families by our staff and referral of malnourished children have improved the nutritional status of the children within these sentinel villages. Likewise, the observational bias reported in a recent study piloted in Northern Nigeria by Médecins sans Frontières (MSF) was a progressive deviation of the nutritional status of the sentinel site from that of the wider community it is presumed to represent [10]. In the MSF study, the prevalence of GAM decreased by 1.6 % (95 % CI 0.4–2.7 %; p = 0.012) relative to the prevalence observed during the previous visit. The smaller observational bias in our study might be partially explained by the normal–normal conjugate analysis that uses the same prior for both data sources (see limitations). Other explanations might include program coverage as well as impact of messages and referral in these two contexts. The quality of the surveillance system improved over time which might explain why LP and cross-sectional survey estimates are more different at the beginning (i.e. March 2011) than in 2013. Globally we conclude that in the course of the LP surveys, there was a slight observational bias (defined as the difference in GAM prevalence between the two data sources) that would probably become statistically significant after a 3 year and 10 months period. It would seem reasonable to take a new sample of sentinel sites every 3 years.

The thresholds for GAM by MUAC are internationally agreed and apply to any child in the 6–59 months of age group. However, as surveillance systems do not focus on single estimates but rather look at trends, there were difficulties in interpreting the curves during the first year, where no comparison was available. For the following years, the historical limit approach was applied. However, there is no clear guidance on how to use this approach in the nutrition sector, as it is more commonly used for infectious diseases [31].

While the system is still ongoing and has even been extended to Gnagna province, questions remain regarding its sustainability. The same system was set up in Montserrado, Liberia, and was stopped after 21 months. Reasons for stopping were: (1) low prevalence of GAM; (2) weak connection with ACF programs; (3) no possible integration of this methodology into the national system; (4) the challenging issue of urban sampling and the fact that few data were available to assess the validity of our results in greater Monrovia. In Burkina Faso, the Ministry of Agriculture (MoA) was first interested to use this methodology to develop an early warning system (EWS) for food and nutrition. However, its capacities remain limited and there is actually no fund for a EWR. Furthermore, they are still doubts that MoA is responsible to collect nutritional data. Alternatively, the system could be extended to other areas through other partners. Suggestions to improve the uptake include more regular workshops, set up of a dedicated website and a better design of the newsletter.

The LP system provides monthly information that allows prompt interventions if needed. Alternative methodologies, such as cross-sectional surveys, are usually not frequent enough for early warning systems. Furthermore, cost of the LP system is limited with around 50,000 dollars per year for the 3 LHZ. In comparison, costs of cross-sectional surveys are around 20,000 dollars per survey. Finally, its set up is easier than repeated cross-sectional surveys. We think the LP methodology is adapted to areas with high burden of acute undernutrition and poor access to feeding centres that require continuous monitoring and an early warning system.

There were some limitations in our study. It would have been better if the number and timing of the cross-sectional surveys were the same as surveillance rounds. It would have allowed us to better compare trends from LP and repeated-cross sectional data over shorter time periods. Furthermore, in order to compare with cross-sectional surveys’ area, we had to pool 3 LHZs, increasing the sample size. This has limited our ability to validate MUAC trends for each LHZ. However, it has improved the precision of the LP estimates, increasing our power. The use of the Bayesian normal–normal conjugate analysis may have biased the posterior estimate towards the prior mode, which was the same for the surveillance data and the cross-sectional survey data. We use the same prior for all applications. The prior was, however, tested with computer based simulations using data derived from cross-sectional surveys; we found that it did not introduce a bias even when used with relatively small likelihood survey sample sizes such as n = 96 and n = 132. Recent literature [17, 18] has identified small systematic biases in PROBIT estimates in given populations. This is problematic when estimating prevalence in one-shot cross-sectional surveys. Systematic bias is less important for surveillance systems. Surveillance systems accept bias and attempt to keep it systematic (i.e. consistent) over time in order to observe patterns of change. To detect change, precision (reliability, repeatability) is a more important issue than accuracy (bias) in surveillance applications. For a given sample size, the PROBIT method has greater precision than the more “conventional” methods [17, 18]. The Bayesian approach contributes some additional information (i.e. the prior) which also improves precision. The cohort approach also reduces between-round sampling variation. The use of the Bayesian PROBIT estimator and an open cohort in the LP method reflects the importance of precision in surveillance applications. Other methods, such as conventional prevalence method, can be used to analyze the data as a 6 × 22 cluster sample. They won’t, however, provide the same precision as the Bayesian PROBIT estimator.

Finally, our study was not designed and did not aim to assess the Bayesian-PROBIT methodology in estimating GAM prevalence in comparison to SMART survey methodology. Further studies may be needed to validate this methodology in estimating GAM prevalence in different contexts.

## Conclusions

This paper presented a methodology for nutrition surveillance using a small open cohort from community-based sentinel sites in a rural setting of Burkina Faso. This methodology has proved to be valid in following trends of GAM prevalence over a 3 years period without selection bias; conversely, a slight but significant observational bias was detected, requiring periodical reselection of sentinel sites. We recommend this approach for areas with high burden of acute under-nutrition and poor access to feeding centres where early warning systems are strongly needed. Advocacy is necessary to develop sustainable nutrition surveillance system and to support the use of surveillance data in guiding nutritional programs.

## Declarations

### Authors’ contributions

MA coordinated the research and drafted the manuscript. CF and BJ undertook the statistical analyses. ML coordinated the field data collection. MM and CA critically revised the manuscript. All authors read and approved the final manuscript.

### Acknowledgements

The history of this project and its methodology came first from a discussion between Rebecca Brown (ACF Nutrition senior advisor at that time), Claudine Prudhon (Epidemiologist at the World Health Organization) and Frances Mason (Save the Children France, SCF). The methodology was then developed by SCF (Anna Taylor, Alex Rees, Brian Kriz, Abigail Perry, Mickael Manske) and ACF (Rebecca Brown, Hélène Deret) with the support of Brixton Health (Mark Myatt). We are very thankful to all these people. We also thank the field teams of ACF in Burkina Faso, in particular the Listening Post team for gathering data and the ACF coordination team, Thomas Loreaux (Head of Mission) and Claire Gaillardou (Head of Department Disaster Risk Management) for supporting this project. We are also thankful to Chloe Milloz for her operational expertise and for reviewing. We wish to thank the village authorities and all the families from investigated villages for their participation in this project.

### Competing interests

The authors declare that they have no competing interests.

### Availability of data and materials

The data supporting our findings are available by asking directly to the corresponding author.

### Ethics approval and consent to participate

Ethical approval was obtained from the Institutional Review Board of the Ministry of Health in Burkina Faso (no 2010-028). A Memorandum of Understanding was also signed with the Ministry of Agriculture and its regional office concerning the Listening Post project and ACF support in strengthening the early warning system in Tapoa province. Data from nutritional cross-sectional surveys were collected with the agreement of the Ministry of Health of Burkina Faso. We followed the Declaration of Helsinki, aiming to provide assurance that the rights, integrity, and confidentiality of participants were protected. We obtained oral consent from participants or their parents or guardians. Participation was voluntary. We ensured privacy and confidentiality in the data collected from the participants both during and after the conduct of the study. We entered and analyzed all information anonymously and anonymised findings were shared with all stakeholders in Burkina Faso.

### Funding

This study was funded by Action Contre la Faim (ACF). ACF collected the data, designed the study protocol, contributed to the interpretation of the data and preparation of the manuscript.

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

## Authors’ Affiliations

## References

- Mason JB, Mitchell JT. Nutritional surveillance. Bull World Health Organ. 1983;61:745–55.PubMedPubMed CentralGoogle Scholar
- McKinney P. Module 10: nutrition information and surveillance systems, version 2. Emerg Nutr Netw. 2014. http://files.ennonline.net/attachments/2018/HTP-v2-module-10-technical-notes.pdf
- Jerome NW, Ricci JA. Food and nutrition surveillance: an international overview. Am J Clin Nutr. 1997;65:1198S–202S.PubMedGoogle Scholar
- Bilukha O, Prudhon C, Moloney G, Hailey P, Doledec D. Measuring anthropometric indicators through nutrition surveillance in humanitarian settings: options, issues, and ways forward. Food Nutr Bull. 2012;33:169–76.View ArticlePubMedGoogle Scholar
- Friedman G. Review of national nutrition surveillance systems. Food and Nutrition Technical Assistance III Project (2014).
- Nnyepi M, Gobotswang KSM, Codjia P. Comparison of estimates of malnutrition in children aged 0–5 years between clinic-based nutrition surveillance and national surveys. J Public Health Policy. 2011;32:281–92. doi:10.1057/jphp.2011.32.View ArticlePubMedGoogle Scholar
- Action Contre la Faim (ACF), The United Nations Children’s Fund (UNICEF). Nutrition surveillance system in Karamoja region, Final report number 002. Uganda Ministry of Health Knowledge Management Portal 2010. http://library.health.go.ug/publications/service-delivery-child-health/food-and-nutrition/nutrition-surveillance-system-karamoja.
- Action contre la faim (ACF). Mathare sentinel surveillance report 2009. http://www.actionagainsthunger.org/sites/default/files/publications/ACF-INT-Kenya-Nairobi-Mathare-2009-07-EN.pdf.
- Chotard S, Mason JB, Oliphant NP, Mebrahtu S, Hailey P. Fluctuations in wasting in vulnerable child populations in the Greater Horn of Africa. Food Nutr Bull. 2010;31:S219–33.View ArticlePubMedGoogle Scholar
- Grellety E, Luquero FJ, Mambula C, Adamu HH, Elder G, Porten K. Observational bias during nutrition surveillance: results of a mixed longitudinal and cross-sectional data collection system in Northern Nigeria. PLoS ONE. 2013;8:e62767. doi:10.1371/journal.pone.0062767.View ArticlePubMedPubMed CentralGoogle Scholar
- The United Nations Children’s Fund, Sudan Federal Ministry of Health, the State Ministries of Health in North, South and West Darfur, UNICEF Sudan Nutrition Section. Darfur nutrition update, Summary Issue 30: covering October, November and December 2011; 2011.
- Caleo GM, Sy AP, Balandine S, Polonsky J, Palma PP, Grais RF, et al. Sentinel site community surveillance of mortality and nutritional status in southwestern Central African Republic, 2010. Popul Health Metr. 2012;10:18. doi:10.1186/1478-7954-10-18.View ArticlePubMedPubMed CentralGoogle Scholar
- Save the Children. About the household economy approach: tools for food security and nutrition analysis n.d. http://www.heawebsite.org/about-household-economy-approach. Accessed 18 Feb 2016.
- Ministère de l’économie et des finances - Institut national de la statistique et de la démographie. Annuaire statistique du Burkina Faso 2011. http://www.insd.bf/n/contenu/pub_periodiques/annuaires_stat/Annuaires_stat_nationaux_BF/Annuaire_stat_2011.pdf. Accessed 3 Feb 2016.
- Ministry of Health. National Nutritional Survey in Burkina Faso 2011. http://www.unicef.org/bfa/french/enquete-nutritionnelle_nationale_du_burkina_faso_2011.pdf. Accessed 3 Feb 2016.
- World Health Organisation (WHO). Physical status: the use and interpretation of anthropometry, vol 854. Report of a WHO expert committee. World Health Organisation. 1995. p. 1–452.
- Dale NM, Myatt M, Prudhon C, Briend A. Assessment of the PROBIT approach for estimating the prevalence of global, moderate and severe acute malnutrition from population surveys. Public Health Nutr. 2013;16:858–63. doi:10.1017/S1368980012003345.View ArticlePubMedGoogle Scholar
- Blanton CJ, Bilukha OO. The PROBIT approach in estimating the prevalence of wasting: revisiting bias and precision. Emerg Themes Epidemiol. 2013;10:8. doi:10.1186/1742-7622-10-8.View ArticlePubMedPubMed CentralGoogle Scholar
- Raiffa H, Schlaifer R. Applied statistical decision theory. Cambridge: Division of Research, Graduate School of Business Administration, Harvard University; 1961.Google Scholar
- Western B, Jackman S. Bayesian inference for comparative research. Am Polit Sci Rev. 1994;88:412–23. doi:10.2307/2944713.View ArticleGoogle Scholar
- Milne A. The centric systematic area-sample treated as a random sample. Biometrics. 1959;15:270–97. doi:10.2307/2527674.View ArticleGoogle Scholar
- Myatt M, Feleke T, Sadler K, Collins S. A field trial of a survey method for estimating the coverage of selective feeding programmes. Bull World Health Organ. 2005;83:20–6.PubMedPubMed CentralGoogle Scholar
- Myatt M, Mai NP, Quynh NQ, Nga NH, Tai HH, Long NH, et al. Using lot quality-assurance sampling and area sampling to identify priority areas for trachoma control: Viet Nam. Bull World Health Organ. 2005;83:756–63.PubMedPubMed CentralGoogle Scholar
- Sadler K, Myatt M, Feleke T, Collins S. A comparison of the programme coverage of two therapeutic feeding interventions implemented in neighbouring districts of Malawi. Public Health Nutr. 2007;10:907–13. doi:10.1017/S1368980007711035.View ArticlePubMedGoogle Scholar
- Thompson, SK. Sampling. 3rd edn. Hoboken, New Jersey: Wiley; 2012.
- Bennett S, Radalowicz A, Vella V, Tomkins A. A computer simulation of household sampling schemes for health surveys in developing countries. Int J Epidemiol. 1994;23:1282–91.View ArticlePubMedGoogle Scholar
- Briend A, Hasan KZ, Aziz KM, Hoque BA, Henry FJ. Measuring change in nutritional status: a comparison of different anthropometric indices and the sample sizes required. Eur J Clin Nutr. 1989;43:769–78.PubMedGoogle Scholar
- World Health Organization (WHO). The WHO Multicentre Growth Reference Study (MGRS) n.d. http://www.who.int/childgrowth/mgrs/en/. Accessed 9 Jan 2014.
- Action Contre la Faim. Measuring mortality, nutritional status and food security in crisis situations: the smart protocol 2006. http://smartmethodology.org/survey-planning-tools/smart-methodology/.
- Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1:307–10.View ArticlePubMedGoogle Scholar
- Hutwagner L, Browne T, Seeman GM, Fleischauer AT. Comparing aberration detection methods with simulated data. Emerg Infect Dis. 2005;11:314–6. doi:10.3201/eid1102.040587.View ArticlePubMedPubMed CentralGoogle Scholar