Skip to main content

Diagram-based Analysis of Causal Systems (DACS): elucidating inter-relationships between determinants of acute lower respiratory infections among children in sub-Saharan Africa



Effective interventions require evidence on how individual causal pathways jointly determine disease. Based on the concept of systems epidemiology, this paper develops Diagram-based Analysis of Causal Systems (DACS) as an approach to analyze complex systems, and applies it by examining the contributions of proximal and distal determinants of childhood acute lower respiratory infections (ALRI) in sub-Saharan Africa.


Diagram-based Analysis of Causal Systems combines the use of causal diagrams with multiple routinely available data sources, using a variety of statistical techniques. In a step-by-step process, the causal diagram evolves from conceptual based on a priori knowledge and assumptions, through operational informed by data availability which then undergoes empirical testing, to integrated which synthesizes information from multiple datasets. In our application, we apply different regression techniques to Demographic and Health Survey (DHS) datasets for Benin, Ethiopia, Kenya and Namibia and a pooled World Health Survey (WHS) dataset for sixteen African countries. Explicit strategies are employed to make decisions transparent about the inclusion/omission of arrows, the sign and strength of the relationships and homogeneity/heterogeneity across settings.

Findings about the current state of evidence on the complex web of socio-economic, environmental, behavioral and healthcare factors influencing childhood ALRI, based on DHS and WHS data, are summarized in an integrated causal diagram. Notably, solid fuel use is structured by socio-economic factors and increases the risk of childhood ALRI mortality.


Diagram-based Analysis of Causal Systems is a means of organizing the current state of knowledge about a specific area of research, and a framework for integrating statistical analyses across a whole system. This partly a priori approach is explicit about causal assumptions guiding the analysis and about researcher judgment, and wrong assumptions can be reversed following empirical testing. This approach is well-suited to dealing with complex systems, in particular where data are scarce.

“Epidemiology will only progress if it combines a detailed understanding of the ways through which the historical, economic and political constitution of how the world is influences the health of populations—and thus, the individuals within these populations—with the appropriate development of methodology and concepts to deal with this complexity” – George Davey Smith[1].

“… a shift in our methodological approach may allow us to better grapple with the complexity of causation within a multilevel understanding of disease etiology.” – Sandro Galea[2].


Childhood acute lower respiratory infections in sub-Saharan Africa

Acute lower respiratory infections (ALRI) represent a permanent global emergency [3]. They were responsible for 6.8% of 2.8 million neonatal deaths, 20.1% of 2.0 million infant deaths and 12.4% of 2.0 million deaths among children aged 1 to 4 years respectively in 2010 [4], with sub-Saharan Africa being the most affected world region [5]. Childhood ALRI is one of several enormous public health problems in developing countries that fail to attract the necessary research to elucidate more of their epidemiology, and how this relates to the environmental and social context.

One of the reasons for this lack of attention is the difficulty of obtaining high-quality health information. In most African countries, vital registration systems are either non-existent or do not reach the whole population, and health surveillance is limited [6, 7]. Instead, national and international decision-making heavily relies on nationally representative household surveys that generate cross-sectional datasets that are largely comparable across countries.

Childhood ALRI is the outcome of a web of interacting socio-economic, environmental, behavioral and healthcare factors. Interventions can thus be directed at curbing mortality (e.g. access to healthcare, timely treatment with antibiotics) [8, 9], at reducing the risk of infection (e.g. improved nutrition, promotion of breastfeeding, reduced exposure to indoor air pollution) [1013], or at instigating longer-term socio-economic changes to create healthier societies [14].

Context is crucial. Fewer than 20% of children with ALRI receive appropriate treatment [15]. Even with successful cure, children return to a home with high risk of re-infection. Interestingly, childhood ALRI deaths in the United States fell by two thirds during the first three decades of the 20th century, prior to the introduction of antibiotics or vaccines [3]. This implies that interventions to improve living conditions play a critical role in reducing ALRI morbidity and mortality [16].

Systems epidemiology

Developing effective interventions requires evidence not only on how individual causal mechanisms influence ALRI risk but also on how these jointly determine disease. This concept could be called systems epidemiology [17]: analysis of the whole system of humans in their total environment within which disease occurs. While systems thinking is increasingly being discussed as important [18, 19], a whole-system approach to analysis is still rare in epidemiology.

Causal diagrams provide an important basis for such an approach. Traditional epidemiology is often confined to the analysis of single links between a proximal determinant and disease, and causal diagrams were introduced in this context, mainly to aid causal inference [20, 21]. In addition, systems approaches using causal diagrams have been introduced in specific contexts, notably infectious disease modeling using flow diagrams and differential equations [22, 23]; causal mediation analysis [24]; and the emerging field of dynamic systems modeling [25, 26].

In principle, a systems perspective can also be applied to large and complex causal networks involving environmental or social factors [27] and/or bio-molecular or genetic pathways [17, 28, 29]. A causal diagram then constitutes a means of organizing the current state of knowledge about that specific area of research, and a framework for integrating statistical analyses [17, 28, 29]. It consists of all the constituent hypotheses within the system, each represented by a link, plus the additional pathways that are necessary for causal inference (e.g. the potential confounders).

In systems epidemiology, diagram construction involves a partly a priori approach: the causal diagram does not emerge from the data but is developed by combining prior knowledge with empirical testing. The former is obtained from pre-existing evidence of various types (e.g. mechanistic or statistical). Thus, constructing such a diagram requires that its structure and content be specified upfront from existing knowledge about the system, supplemented by plausible assumptions where evidence is lacking. It is vital that this includes a complete or near-complete set of potential confounding and selection effects for all the hypotheses embodied in the diagram. Several studies have combined a priori conceptualization of health determinants with modeling, using, for example, quantile regression [30], graphical chain models [3133] or hierarchical models [34]. Other possible statistical tools for estimating causal diagrams include structural equation modeling [24, 35, 36], propensity score matching [37, 38] and instrumental variables [28, 39, 40].

To our knowledge, all of the studies published to date were based on a single dataset. In systems epidemiology, the causal diagram represents the overall state of knowledge about the system. This means that empirical testing can, and often has to, involve multiple datasets. One advantage is repeated examination, where the same link is explored using more than one dataset, allowing the robustness of the evidence to be assessed. A second is that, for complex systems which cannot be covered by an individual dataset, different parts of the system can be examined using different datasets. This may entail decomposing the diagram into corresponding sub-diagrams. Ideally, this decomposition can be done using conditional independence, so that each relationship or group of relationships becomes testable as an individual hypothesis, without introducing further confounding or other distortions [41], but this is not always possible. Where appropriate, selection effects (colliders) can also readily be introduced into a causal diagram [42].


In this paper, we seek to contribute to the concept of systems epidemiology [17] by demonstrating how Diagram-based Analysis of Causal Systems (DACS) can be used to investigate complex questions of disease causation, including both proximal health risks and “causes of causes” [43]. This approach to analysis combines the use of causal diagrams with multiple routinely available data sources, employing a variety of statistical techniques. In the following, we demonstrate the step-by-step progression from a merely plausible a priori causal diagram towards one supported by empirical analysis, aimed at deciphering the contributions of proximal and distal risk factors to childhood ALRI.


This section provides an overview of the steps of causal diagram development (Table 1 shows the specific terminology to describe this process), gives a detailed account of the data used and statistical approach employed, and describes transparent criteria for testing causal diagrams.

Table 1 Terminology for key steps in Diagram-based Analysis of Causal Systems

Steps of causal diagram development

The approach follows the notational conventions of Earp and Ennett [44], Robins [21] and Best and Green [45] (see Figure 1). It relies on the initial assumptions that causal relationships apply independently of the data source used to assess them; and that they are stable between populations and over time. These assumptions can be tested empirically and rejected if appropriate. The timeframe must be clearly specified, as must the unit of analysis (i.e. individual vs. aggregate) and level of aggregation or spatial scale, as they influence the identification and definition of variables included. We focus on relatively short-term processes affecting health outcomes among children under five years of age.

Figure 1
figure 1

Notational conventions.

Step 1: a conceptual causal diagram was constructed a priori based on state-of-the-art knowledge, independent of data availability (Table 1). This was organized into hierarchical layers, which represent the causal and temporal ordering of the different groups of variables (we prefer the term “layer” to “level”, as the latter could be confused with its quite different usage in the context of multilevel modeling). A four-layer diagram resulted, with layers representing context, household socio-economic status, proximal health risks and health outcomes; as a convention, health outcomes are depicted as being harmful [27]. Specificity was introduced by attempting to identify all distinct variables in each layer and all potential causal pathways that connect variables within and across layers, including potential confounders.

This diagram resembles a graphical chain model [46, 47] in the sense that its variables are partitioned into hierarchically arranged subsets (i.e. conceptual groupings), with arrows representing causal associations linking variables in adjacent subsets. Within these conceptual groupings associations between variables are assumed to be non-causal. We represent this assumption by the absence of arrows (e.g. no arrow connecting the different variables within material situation), whereas in graphical chain models, non-causal associations (correlations) between variables within a subset are empirically tested and represented by undirected links.

The omission of an arrow is a stronger decision than its inclusion, because specifying a link that does not exist in the real world will be discovered by finding a parameter value of zero, whereas once omitted the link cannot be tested. In other words, a priori inclusion is more conservative than omission – i.e. it errs on the side of caution – as any false assumptions can still be rectified after empirical testing.

Step 2A: data availability informed which parts of this diagram could be examined using a specific dataset. Conceptual variables were replaced with measured variables, resulting in an operational single-dataset causal diagram (see Table 1). As omission of relevant variables and relationships, where these cannot be measured, is likely to result in residual confounding, conceptual variables and the paths connecting them were retained in the diagram. This serves to ensure that all relevant common causes and colliders that could affect a particular path between exposure and outcome are included. Where this is the case, each arrow in the operational single-dataset causal diagram represents a separate hypothesis. These were tested – in our case using a Demographic and Health Survey (DHS) dataset for Benin – to assess overall support for the hypothesis (see Data and statistical analysis and Criteria for testing causal diagrams).

Step 2B: testing across multiple similar datasets generated an operational multiple-dataset causal diagram (see Table 1) in our case using DHS datasets for Kenya, Ethiopia and Namibia. This step has a confirmatory function, the aim being to assess the degree of consistency/robustness of the step 2A diagram across populations. In making these comparisons, attention must be paid to selection factors or colliders which may differ between populations; these could be examined explicitly [42] and, where appropriate, included in country-specific operational causal diagrams. Hypothesis testing was repeated and conclusions were drawn about the sign of the relationships in each hypothesis (i.e. a decreasing or increasing function), and about its strength (i.e. strong or limited). Where a given path between measured variables did not show any effect across settings, the path was removed from the diagram (see Data and statistical analysis and Criteria for testing causal diagrams).

Step 3: as a single type of dataset rarely includes all the information required for testing a whole conceptual diagram, other datasets with a complementary function can be brought in. We used a pooled World Health Survey (WHS) dataset to examine the pathway linking socio-economic status, solid fuel use and ALRI mortality. The results obtained from WHS testing and DHS testing were combined in an integrated causal diagram (see Table 1). This represents our current best understanding of the relationships specified in the original conceptual diagram, but it is not in any sense a final version; such diagrams continue to evolve as more evidence becomes available to confirm, extend and refine the operational version. This involves repeating steps 2 and/or 3, using the most appropriate statistical approaches in view of data availability and the strength and consistency of evidence already represented in the diagram. Suitable methods include regression techniques, structural equation modeling and instrumental variable approaches, depending on the nature of the relationships to be examined and the data available; Bayesian approaches and/or various types of meta-analysis [48] can also be used to formally combine new evidence with the existing evidence represented in the diagram. Also, periodic reassessment of the adequacy of the original conceptual diagram may be necessary (step 1).

Data and statistical analysis

We used DHS [49] and WHS data [50], nationally representative household surveys with response rates well above 90%, that provide high-quality and comparable information on health and risk factors for a large number of developing countries. DHS datasets for Benin, Kenya, Ethiopia and Namibia were selected because they included information on the main cooking fuel used; they were analyzed separately to allow for a distinction between those causal links that are likely to be consistent between populations and those that may operate in a population-specific fashion (Table 2). WHS data for sixteen countries of sub-Saharan Africa were pooled (Table 3; [51]).

Table 2 DHS sample sizes by country
Table 3 WHS sample sizes by country

While the DHS provides detailed information on the recent occurrence of childhood illnesses as well as child mortality rates, the WHS assesses child mortality and symptoms prior to death. Variables representing ALRI determinants also vary between the data sources (Table 4).

Table 4 Variables related to ALRI and its determinants in DHS and WHS

Given the characteristics of the hypotheses, we used logistic regression, ordered logistic regression and survival analysis to test individual hypotheses of different outcomes [52, 53]; survival analysis was also used to examine the impact of cooking-related parameters on ALRI mortality [51]. In this way, empirical evidence (or lack thereof) was assessed for every arrow connecting two measured variables in the operational causal diagrams. In view of non-independence due to children born to the same mother, all analyses were adjusted for clustering to produce robust standard errors, resulting in wider confidence intervals. We did not adjust for stratification and cluster samples and omitted sample weights, a strategy considered conservative for multivariable analysis of household survey data [54]. Models were run in Stata Special Edition 9 for the whole dataset, and separately for urban and rural settings; exploratory analyses, confirmed by cluster analysis, had revealed striking urban–rural differences in all countries [55].

For each hypothesis, the full model including all measured explanatory variables and all possible nested models were run. Initially, provided the univariate model explained the data significantly better than the null model with p < 0.05, all variables were retained. Subsequently, nested models were compared against the full model using the Akaike Information Criterion (AIC) to explore whether different explanatory variables contributed independently to the outcome of interest. Only differences of 3 or more points are considered meaningful in terms of indicating better model fit [56]; for differences of less than 3 points, the more complete model was reported in line with a conservative approach. Where the diagram showed unmeasured explanatory variables, their likely influence in the form of residual confounding was carefully considered in the interpretation of findings.

Criteria for testing causal diagrams

As we relied on different data sources to inform different parts of the diagram, we decomposed it into a series of separate hypothesis tests and, similar to Weng and colleagues [57], employed conventional covariate selection techniques for testing. The criteria described here provide a basis for replicating the analysis, conducting sensitivity analyses and incorporating new datasets and data sources as they become available. They were developed in an attempt to minimize the role of researcher judgment, and where this is unavoidable make its use explicit and transparent; they represent but one way of doing so.

As inclusion is more conservative than omission, it is important to ensure that no hypothesis is rejected prematurely because of a country-specific situation or data quality issues. Therefore, in step 2A, if any of the univariate models explained the data significantly better than the null model in at least one of the three settings, this was considered to provide evidence towards the hypothesis. This initial testing stage only assessed overall support for a given hypothesis involving several variables, not the role of individual variables.

Figure 2 depicts the decision-making process to assess the sign of a relationship in individual settings. For any hypothesis tested in step 2B, each explanatory variable was considered in turn and separately for urban and rural settings; only variables included in the model selected based on the AIC were judged to have a direct and at least partly independent effect on the outcome variable. Absolute sample sizes, relative sample sizes (i.e. small sub-groups with respect to variables of interest) and measurement error can all affect whether a true relationship is measured as statistically significant or not. We therefore contend that consistency (i.e. central estimates of odds ratios are always above or always below 1) across different levels of a given variable (e.g. low, intermediate, high electric goods index) is potentially a more reliable guide for assessing the sign of a relationship than statistical significance. As detailed in Figure 2, this consistency criterion was applied to distinguish between strong versus limited support for an increasing or decreasing effect on the outcome variable, or to conclude that the relationship is non-ordered.

Figure 2
figure 2

Assessment of sign of relationship in individual settings. Consistency in (statistically significant) odds ratios is determined by assessing whether the central estimates of (statistically significant) odds ratios across different levels of a given variable (e.g. low, intermediate, high electric goods index) are always above or always below 1.

Once the assessment on a country-by-country and setting-by-setting basis was completed, a judgment needed to be made about homogeneity/heterogeneity. Three situations put consistency at stake:

➢ A relationship is classified as increasing across all levels of a variable in some settings but absent in the remaining settings. Such findings contradict consistency but may be explained by small differences in causal mechanisms or by random error.

➢ A relationship is classified as increasing across all levels of a variable in some settings but as decreasing in the remaining settings. Such findings strongly contradict consistency as they point towards distinct setting-specific causal mechanisms unlikely to be explained by chance.

➢ A relationship is classified as non-ordered (i.e. showing an effect, with the sign of the relationships varying at different levels of a variable) in some but as increasing in other settings. Consistency can easily accommodate findings that go further in a sub-set of settings relative to the rest by “downgrading” the classification of the relationship to non-ordered.

In deriving an operational multiple-dataset causal diagram in step 2B, we distinguish between an increasing/decreasing function with strong or limited support, a non-ordered relationship, a setting-specific increasing or decreasing function and no effect, taking into account the possibility of “false negatives” due to measurement error, other data quality issues or random error. As illustrated in Figure 3, we made the somewhat arbitrary decision to allow for one exception regarding the presence or absence of an effect (ntotal-1) and the sign of a relationship (neffect-1). We only considered there to be sufficient evidence of a null effect, when an effect was tested across multiple settings and no relationship was observed in any of these; only under these circumstances was an arrow removed from the diagram.

Figure 3
figure 3

Assessment of homogeneity/heterogeneity across settings. ntotal refers to all settings where the hypothesis can be tested. neffect refers to all settings where an effect can be detected.


Rather than giving detailed results for each hypothesis, this section summarizes the overall findings; Table 5 provides a detailed description of the step-by-step process of causal diagram development and testing for one hypothesis; a similar account for all other hypotheses can be found in Rehfuess [55].

Table 5 A step-by-step example: impact of wealth and parental education on solid fuel use

The conceptual causal diagram in Figure 4 (step 1) summarizes how distinct socio-economic determinants (embedded in a given national or local context) and proximal health determinants (grouped as vulnerability, exposure and access to effective healthcare) jointly determine a child’s risk of ALRI morbidity and ALRI mortality. Based on a priori knowledge derived from a review of the literature and assumptions, it is a graphical representation of our conceptualization of reality which, given an ideal dataset, could be examined in its entirety.

Figure 4
figure 4

A conceptual causal diagram.

Using the Beninese dataset, “conceptual variables” in Figure 4 were replaced with actual variables. The DHS does not provide cause-of-death information, making an assessment of ALRI mortality impossible. A significant challenge in examining ALRI morbidity among children is the distinction between frequent but harmless infections of the upper respiratory tract and rare but potentially life-threatening infections of the lower respiratory tract. Preliminary analysis of DHS records of cough and fast breathing during the two weeks prior to the survey indicated that they could not serve as a useful proxy for ALRI. Consequently, impacts on the health layer (i.e. child’s ALRI morbidity and mortality) could not be investigated in the operational single-dataset causal diagram (step 2A). As child’s access to effective healthcare operates by influencing progression from morbidity to mortality, the effect of healthcare-related variables could also not be tested.

The DHS assesses a large number of socio-economic and proximal determinants of child ALRI (Table 4), allowing us to populate many of the originally specified conceptual variables (Figure 5). For some, however, no suitable proxy was available. For example, information on maternal smoking was available for all children in the DHS datasets and selected children in the WHS datasets. In the sub-Saharan countries examined here, however, maternal smoking rates are very low and information was not available about the smoking habits of other household members. Maternal smoking was therefore considered an inadequate measure of environmental tobacco smoke and was not examined further. Independent hypotheses based on measured variables were formulated (Table 6). Causal diagram testing produced evidence in support of all hypotheses in Benin (Figure 5; [55]).

Figure 5
figure 5

An operational single-dataset causal diagram.

Table 6 Empirically testable hypotheses

Figure 6 summarizes results of step 2B. We found that education, occupation and wealth exert their influence on proximal health risks through at least partly independent pathways, and that some dimensions of socio-economic status, in particular material circumstances and related purchasing power, play a greater role in determining risk factor profiles than others. Stunting, solid fuel use and vaccination emerge as particularly strongly structured by socio-economic variables.

Figure 6
figure 6

An operational multiple-dataset causal diagram.

The relationships among socio-economic factors (hypotheses 1–3) and among proximal health determinants (hypothesis 4) show a high degree of consistency and appear to apply independent of country or geographical setting. More heterogeneity is observed for the impact of socio-economic factors on proximal health determinants (hypotheses 5–10). This is due in part to variation in the relative importance of a given measure of socio-economic status, and in part to different mechanisms operating in different settings. The latter highlights the importance of contextual factors, such as ethnicity and urban–rural location. In the present analysis, we used contextual factors (i.e. urban–rural location) as stratification variables. As all variables identified for household socio-economic status identified in the conceptual diagram were populated, unmeasured confounding is negligible, provided the diagram was correctly specified.

The WHS assesses several socio-economic determinants, solid fuel use and, importantly, cause-specific child mortality (Table 4), and can therefore be used in step 3. Here we examined how maternal education, wealth and solid fuel use jointly influence ALRI mortality (hypothesis 11); the explanatory variable paternal education was not available in the WHS but, given its high correlation with maternal education, residual confounding is likely to be minor. We used an a priori ALRI definition of cough accompanied by rapid breathing or chest indrawing that closely resembles the ALRI algorithm in standard verbal autopsy tools [75]. First, robustness of the links between wealth, maternal education and solid fuel use was assessed in individual countries and in the pooled African dataset (Tables 5, 7 and 8), indicating a high degree of compatibility between DHS and WHS. Subsequently, determinants of ALRI mortality were examined in the pooled African dataset.

Table 7 Results for hypothesis 9 using DHS data: Odds ratios for logistic regression of solid fuel use on wealth, maternal education and paternal education**
Table 8 Results for hypothesis 9 using WHS data: Odds ratios for logistic regression of solid fuel use on wealth and maternal education**

In univariate models using the a priori ALRI definition, a child’s risk of dying from ALRI prior to its fifth birthday was 2.54 times higher (95% confidence interval: 1.42, 4.55) for children living in households that cook with wood, dung or charcoal relative to children living in households that cook with gas or electricity. High wealth compared to low wealth and secondary or higher education of the mother compared to no education showed ALRI mortality hazard ratios of 0.68 (0.46, 1.00) and 0.65 (0.40, 1.06) respectively. Including wealth and maternal education variables in the solid fuel use model yielded an adjusted hazard ratio of 2.35 (1.22, 4.52). Alternative definitions of ALRI, which were less or more sensitive, resulted in adjusted hazard ratios of 2.07 (1.22, 3.53) and 3.21 (1.18, 8.76) respectively [51]. Based on these findings an arrow representing an increasing relationship with strong support was introduced to link solid fuel use and ALRI mortality. This integrated causal diagram summarizes our current understanding of the web of ALRI determinants (Figure 7).

Figure 7
figure 7

An integrated causal diagram.


Strengths and weaknesses of Diagram-based Analysis of Causal Systems

The basic idea of DACS is the use of causal diagrams as a formal tool for representing state-of-the-art knowledge of a specific research area. This whole-system approach makes the analyst’s view of reality and assumptions explicit; generates testable hypotheses; provides a framework for model building and statistical analysis, potentially using a broad range of statistical methods; accommodates multiple sources of evidence; evolves from conceptual to empirically supported; identifies research gaps; highlights entry-points for public health interventions; and facilitates communication between stakeholders.

Constructing a causal diagram involves a dialog between an a priori statement of causal structure and an inductive a posteriori method of testing. Each informs the other: the structure facilitates design of studies to test specific hypotheses, and accumulating evidence stimulates revision of the causal structure. Each diagram explicitly represents not only pathways of substantive interest, but also those that could bring about non-random statistical associations in the absence of direct causation: i.e. reverse causation, confounding (common ancestor(s)) or Berksonian bias (common descendant(s)). By representing all relevant potential causal pathways, this resembles established epidemiological methodology of directed acyclic graphs for addressing biases and selection effects relating to a single causal pathway [20, 21, 44, 76, 77].

While individual causal pathways in our approach resemble directed acyclic graphs, the overall DACS approach is more closely related to graphical chain models, where complex association structures are explicitly recognized, sets of variables are placed along a dependence chain and then modeled using a series of regression analyses [46, 47]. DACS goes beyond conventional graphical chain models, in particular by: (i) highlighting the importance of evidence in developing the initial conceptual diagram, which leads to a more finely-structured diagram comprising conceptual groupings within layers (=blocks) rather than only blocks and which explicitly promotes a hypothesis-driven analysis strategy; (ii) emphasizing model building through a structured step-by-step method, and embracing a forward-looking approach, where the causal diagram is expected to continue to evolve over time; and (iii) allowing for the possibility of drawing on multiple datasets, both for the original analysis and for the subsequent addition of information.

The last point in particular is a key strength of DACS and carries two advantages. First, repeated assessment allows the robustness of a single causal pathway across data sources and settings to be examined. Secondly, following decomposition of the system, relevant sub-structures can be assessed using different data sources, although this does entail specifying potential confounders that may be introduced where conditional independence cannot be used.

The requirement to specify all possible causal pathways, including those that cannot be observed, is formidable both with a causal diagram-based and the traditional approach. Researcher judgment plays a role in both, but is more explicit in the combined a priori/empirical method because it is exercised prior to the analysis, and diagramming makes it visible and transparent. With a purely inductive approach, findings that are not based on previously specified hypotheses may be easier to ignore or dismiss when inconvenient [77], and such findings are suspect when novel – the “fishing expedition” problem. With a partly a priori approach, judgments and assumptions are uninfluenced by the data and have to be made transparent; any changes following the statistical analysis must be explicitly stated and justified. Indeed, incorrect researcher judgment is also potentially reversible: the conceptual diagram is periodically revised as further evidence becomes available. Initial conceptual diagrams are, however, not necessarily unique, and it is good practice for researchers to use their diagrams to explore underlying differences in their assumptions [27]. Importantly, it is impossible to specify a correct analytical strategy without causal knowledge or assumptions [57, 76, 78].

Strengths and weaknesses of the application in this study

The application of this method to the problem of ALRI in developing countries here shows several specific weaknesses. In most complex systems there are likely to be some unspecified or unmeasured common causes, and this almost certainly holds true for the present analysis, which was limited by data availability and quality [55]. Neither the DHS nor the WHS include all relevant variables and, as a result, we were unable to control adequately for confounding in several of the observed relationships. Moreover, several variables are subject to significant measurement error (e.g. a household’s main cooking fuel as proxy for a child’s exposure to indoor air pollution in DHS and WHS; a combination of reported symptoms prior to child death as proxy for clinically-confirmed cause of death in WHS).

In addition, the cross-sectional nature of the datasets precluded analysis of the temporal sequence of events. We carefully assessed the likely impact of this limitation, and consider that reverse causation and feedback loops were unlikely to be important given the short timeframe of interest specified.

Another limitation is that our models assume simple (e.g. linear) functional form and an absence of effect modification or statistical interaction. With the future availability of more and richer datasets on ALRI determinants, it will become possible to test, and if necessary modify, this assumption. Finally, the countries investigated here represent a convenience sample – they were not chosen for their representativeness of the African continent. Nevertheless, based on their geography, climate and distinct colonial histories, Benin, Ethiopia, Kenya and Namibia can be considered representative of a variety of living conditions in sub-Saharan Africa, and so can the pooled WHS dataset of sixteen countries. In summary, the results presented here have only interim status, reflecting our understanding of the system based on currently available evidence.

Key findings and their implications

Our research has only been able to shed light on one of the many pathways that link distal and proximal determinants of childhood ALRI, the inter-linkages between socio-economic status, solid fuel use and ALRI mortality.

The risk of dying from ALRI during childhood varies between population groups but socio-economic differences may be less marked than those commonly observed with respect to all-cause child and infant mortality [7987]; indeed most of the observed socio-economic gradients in ALRI mortality do not reach significance. This is consistent with previous reports of minor socio-economic gradients in childhood ALRI mortality [88, 89] and morbidity [58, 9093].

Wealth, maternal education and, to a lesser extent, paternal education are highly protective against solid fuel use. This suggests that these play at least partly independent roles in structuring cooking fuel use in sub-Saharan Africa. Previous reports [58, 59] that cooking with liquefied petroleum gas is more prevalent among sub-populations with higher income and higher educational attainment accord with these findings.

Our research confirms solid fuel use as a major risk factor for ALRI among African children, suggesting that exposure more than doubles the risk of ALRI mortality. These results are comparable to findings in two case–control studies of ALRI mortality conducted in Tanzania [89] and the Gambia [88], and a case–control study of all-cause child mortality in India [94]. Reducing solid fuel use could therefore be an important means of reducing ALRI mortality.

We found that socioeconomic variables affected solid fuel use, and that solid fuel use affected ALRI mortality. For solid fuel use to qualify as a mediator of the impact of socio-economic status on ALRI mortality, the relationship between maternal education/wealth and ALRI mortality should be attenuated once solid fuel use is included in the model [95]. Surprisingly, this was not the case. Common concerns in mediation analysis are unmeasured confounding and measurement error, but in this instance a third issue proved to be the explanation: heterogeneity in the population. Detailed investigations revealed the presence of a threshold effect in the wealth-fuel use relationship: i.e. differences in fuel use are limited among the lower wealth quintiles but stark between the bottom four quintiles and the richest quintile. Within the richest quintile solid fuel use acts as a mediator of the wealth-ALRI mortality relationship: when the variable is included in the model the hazard ratio is attenuated from 0.66 (0.41; 1.07) to 0.87 (0.52; 1.44). Causal diagrams therefore play an important role in making complex issues of mediation and non-transmission apparent.

The integrated causal diagram in Figure 7 illustrates two potential entry-points for interventions. Increasing household wealth and, to a lesser extent, improving education and knowledge represent entry-points for social intervention, which reduce solid fuel use indirectly over long periods of time. A switch to cleaner fuels represents an entry-point for environmental intervention, which directly and immediately addresses hazardous household energy practices. An alternative would be promotion of cleaner-burning, more fuel-efficient stoves to reduce indoor air pollution, as demonstrated by a randomized controlled trial in Guatemala [96] and a systematic review of the effectiveness of different improved stove models in reducing household air pollution (Bruce et al., manuscript in preparation). As many projects and programs to improve access to modern household energy have either failed to reach the lowest-income group (e.g. Indonesian national LPG program [97]) or failed to bring about sustainable benefits (e.g. Indian National Improved Cookstove program [98]) we believe that technical interventions to reduce solid fuel use must be embedded in an integrated approach that considers and, to some extent, addresses socio-economic determinants if they are to benefit those who need them most. This conclusion accords with the findings of a recent systematic review of enablers and barriers to uptake of improved stoves [61].

Importantly, Figure 7 exposes neglected areas of research and can thus be useful in setting the research agenda. For example, it is noteworthy how few of the proximal risk factors are actually measured in DHS/WHS and that, based on routinely available data, no analysis of the multiple determinants of ALRI morbidity is feasible. Apart from further research along the lines reported here using additional datasets, efficacy and effectiveness trials of specific technical interventions, and natural experiments relating to policy changes, could illuminate the interrelationship of socio-economic conditions, solid fuel use and ALRI. The design of research could also benefit, e.g. by ensuring that omitted variables (residual confounding) are kept to a minimum, and, where possible, that datasets refer to a section of the diagram that can be disaggregated using conditional independence.


We have described the use of causal diagrams as a means of organizing the current state of knowledge about a specific area of research, and as a framework for integrating statistical analyses across a whole system. The structure and content of the causal diagram is originally conceptualized using information from previous studies, and supplemented with assumptions and theories about how individual components of the system combine. This conceptualization of reality is then tested against observed data, using multiple datasets and data sources. Importantly, this method does not aim to decipher one-to-one relationships whilst holding all other variables constant; it is designed to understand whether and how components within the system work together and how these jointly promote or prevent disease.

We believe that Diagram-based Analysis of Causal Systems is well-suited to dealing with large and complicated systems, as long as they do not have feedback loops. While the combination of a priori hypotheses and their testing against observed data is common in many other scientific disciplines, such as environmental science, genetics and astronomy, it is still rare in epidemiology and the health sciences. We hope that this paper, together with recent commentaries calling for a shift in methodological focus in epidemiology [2, 99, 100], will help to raise awareness of the potential for systems approaches in epidemiology to account for the complexity of disease causation in populations.

We have applied our approach to explore the determinants of childhood ALRI in sub-Saharan Africa, an area of research that is characterized by limited data availability and poor data quality. These challenges are representative of many other neglected health problems in the developing world. The unique combination of rigor (in terms of comprehensive causal thinking) and flexibility (in terms of accommodating multiple sources of evidence) thus make Diagram-based Analysis of Causal Systems significant for public health research far beyond the research area considered here. The strength of this paper lies in the development of a protocol for developing and testing causal diagrams, and we invite those concerned with multifactorial health problems in the developing or developed world to test and improve our approach.



Akaike information criterion


Acute lower respiratory infections


Demographic and Health Surveys


World Health Survey.


  1. Davey SG: Reflections on the limitations to epidemiology. J Clin Epidemiol. 2001, 54: 325-331. 10.1016/S0895-4356(00)00334-6

    Article  Google Scholar 

  2. Galea S, Riddle M, Kaplan G: Causal thinking and complex systems approaches in epidemiology. Int J Epidemiol. 2010, 39: 97-106. 10.1093/ije/dyp296

    Article  PubMed Central  PubMed  Google Scholar 

  3. Mulholland K: Childhood pneumonia mortality - a permanent global emergency. Lancet. 2007, 370: 285-289. 10.1016/S0140-6736(07)61130-1

    Article  PubMed  Google Scholar 

  4. Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, et al: Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012, 380: 2095-2128. 10.1016/S0140-6736(12)61728-0

    Article  PubMed  Google Scholar 

  5. Bryce J, Boschi-Pinto C, Shibuya K, Black R, : WHO estimates of the causes of death in children. Lancet. 2005, 365 (9465): 1147-1152. 10.1016/S0140-6736(05)71877-8

    Article  PubMed  Google Scholar 

  6. Mahapatra P, Shibuya K, Lopez A, Coullare F, Notzon F, Rao C, et al: Civil registration systems and vital statistics: successes and missed opportunities. Lancet. 2007, 370: 1653-1663. 10.1016/S0140-6736(07)61308-7

    Article  PubMed  Google Scholar 

  7. Setel P, Macfarlane S, Szreter S, Mikkelsen L, Jha P, Stout S, et al: A scandal of invisibility: making everyone count by counting everyone. Lancet. 2007, 370: 1569-1577. 10.1016/S0140-6736(07)61307-5

    Article  PubMed  Google Scholar 

  8. Graham S, English M, Hazir T, Enarson P, Duke T: Challenges to improving case management of childhood pneumonia at health facilities in resource-limited settings. Bull World Health Organ. 2008, 86 (5): 349-355. 10.2471/BLT.07.048512

    Article  PubMed Central  PubMed  Google Scholar 

  9. Marsh D, Gilroy K, Van de Weerdt R, Wansi E, Qazi S: Community case management of pneumonia: at a tipping point?. Bull World Health Organ. 2008, 86 (5): 381-389. 10.2471/BLT.07.048462

    Article  PubMed Central  PubMed  Google Scholar 

  10. Dherani M, Pope D, Mascarenhas M, Smith K, Weber M, Bruce N: Indoor air pollution from unprocessed solid fuel use and pneumonia risk in children aged under five years: a systematic review and meta-analysis. Bull World Health Organ. 2008, 86 (5): 390-398. 10.2471/BLT.07.044529

    Article  PubMed Central  PubMed  Google Scholar 

  11. Greenwood B: A global action plan for the prevention and control of pneumonia. Bull World Health Organ. 2008, 86 (5): 322-322A. 10.2471/BLT.08.053348

    Article  PubMed Central  PubMed  Google Scholar 

  12. Roth D, Caulfield L, Ezzati M, Black R: Acute lower respiratory infections in childhood: opportunities for reducing the global burden through nutritional interventions. Bull World Health Organ. 2008, 86 (5): 356-364. 10.2471/BLT.07.049114

    Article  PubMed Central  PubMed  Google Scholar 

  13. Roth D, Richard S, Black R: Zinc supplementation for the prevention of acute lower respiratory infection in children in developing countries: meta-analysis and meta-regression of randomized trials. Int J Epidemiol. 2010, 39: 759-808.

    Article  Google Scholar 

  14. Marmot M: Creating healthier societies. Bull World Health Organ. 2004, 82 (5): 320.

    PubMed Central  CAS  PubMed  Google Scholar 

  15. UNICEF, World Health Organization: Pneumonia: The Forgotten Killer of Children. New York: UNICEF; 2006.

    Google Scholar 

  16. Rutstein S: Factors associated with trends in infant and child mortality in developing countries during the 1990s. Bull World Health Organ. 2000, 78 (10): 1256-1270.

    PubMed Central  CAS  PubMed  Google Scholar 

  17. Joffe M, Gambhir M, Chadeau-Hyam M, Vineis P: Causal diagrams in systems epidemiology. Emerg Themes Epidemiol. 2012, 9 (1): 1. 10.1186/1742-7622-9-1

    Article  PubMed Central  PubMed  Google Scholar 

  18. Jayasinghe S: Conceptualising population health: from mechanistic thinking to complexity science. Emerg Themes Epidemiol. 2011, 8: 2. 10.1186/1742-7622-8-2

    Article  PubMed Central  PubMed  Google Scholar 

  19. Krieger N: Epidemiology and the web of causation: has anyone seen the spider?. Soc Sci Med. 1994, 39 (7): 887-903. 10.1016/0277-9536(94)90202-X

    Article  CAS  PubMed  Google Scholar 

  20. Greenland S, Pearl J, Robins J: Causal diagrams for epidemiologic research. Epidemiol. 1999, 10 (1): 37-48. 10.1097/00001648-199901000-00008.

    Article  CAS  Google Scholar 

  21. Robins J: Data, design, and background knowledge in etiologic inference. Epidemiol. 2001, 11 (3): 313-320.

    Article  Google Scholar 

  22. Anderson R, May R: Infectious Diseases of Humans: Dynamics and Control. Oxford: Oxford University Press; 1992.

    Google Scholar 

  23. Garnett G, Anderson R: Balancing sexual partnerships in an age and activity stratified model of HIV transmission in heterosexual populations. IMA J Math Appl Med Biol. 1994, 1 (3): 161-192.

    Article  Google Scholar 

  24. Ditlevsen S, Christensen U, Lynch J, Damsgaard M, Keiding N: The mediation proportion: a structural equation approach for estimating the proportion of exposure effect on outcome explained by an intermediate variable. Epidemiol. 2005, 16 (1): 114-120. 10.1097/01.ede.0000147107.76079.07.

    Article  Google Scholar 

  25. Auchincloss A, Diez RA: A new tool for epidemiology: the usefulness of dynamic agent modes in understanding place effects on health. Am J Epidemiol. 2008, 168 (1): 1-8. 10.1093/aje/kwn118

    Article  PubMed  Google Scholar 

  26. Gorman D, Mezic J, Mezic I, Gruenewald P: Agent-based modeling of drinking behavior: a preliminary model and potential applications to theory and practice. Am J Public Health. 2006, 96 (11): 2055-2060. 10.2105/AJPH.2005.063289

    Article  PubMed Central  PubMed  Google Scholar 

  27. Joffe M, Mindell J: Complex causal process diagrams for analyzing the health impacts of policy interventions. Am J Public Health. 2006, 96 (3): 473-479. 10.2105/AJPH.2005.063693

    Article  PubMed Central  PubMed  Google Scholar 

  28. Allin K, Nordestgaard B, Zacho J, Tybjaerg-Hansen A, Bojesen S: C-reactive protein and the risk of cancer: a mendelian randomization study. J Natl Cancer Inst. 2010, 102: 202-206. 10.1093/jnci/djp459

    Article  CAS  PubMed  Google Scholar 

  29. Marott S, Nordestgaard B, Zacho J, Friberg J, Jensen G, Tybjaerg-Hansen A, et al: Does elevated C-reactive protein increase atrial fibrillation risk? A Mendelian randomization of 47, 000 individuals from the general population. J Am Coll Cardiol. 2010, 56: 789-795. 10.1016/j.jacc.2010.02.066

    Article  PubMed  Google Scholar 

  30. Fenske N, Burns J, Hothorn T, Rehfuess EA: Understanding child stunting in India: a comprehensive analysis of socio-economic, nutritional and environmental determinants using additive quantile regression. PLOS ONE. in press.

  31. Caputo A, Foraita R, Klasen S, Pigeot I: Undernutrition in Benin - an analysis based on graphical models. Soc Sci Med. 2003, 56: 1677-1691. 10.1016/S0277-9536(02)00162-4

    Article  PubMed  Google Scholar 

  32. Foraita R, Klasen S, Pigeot I: Using graphical chain models to analyze differences in structural correlates of undernutrition in Benin and Bangladesh. Econ Hum Biol. 2008, 6: 398-419. 10.1016/j.ehb.2008.07.002

    Article  PubMed  Google Scholar 

  33. Mohamed W, Diamond I, Smith P: The determinants of infant mortality in Malaysia: a graphical chain modelling approach. J R Stat Soc Ser A. 1998, 161 (3): 349-366. 10.1111/1467-985X.00113.

    Article  CAS  Google Scholar 

  34. Ferrer S, Strina A, Jesus S, Ribeiro H, Cairncross S, Rodrigues L, et al: A hierarchical model for studying risk factors for childhood diarrhoea: a case–control study in a middle-income country. Int J Epidemiol. 2008, 37: 805-815. 10.1093/ije/dyn093

    Article  PubMed  Google Scholar 

  35. Amorim L, Fiaccone R, Santos C, Santos T, de Moraes L, Oliveira N, et al: Structural equation modelling in epidemiology. Cad Saude Publica. 2010, 26 (12): 2251-2262. 10.1590/S0102-311X2010001200004

    Article  PubMed  Google Scholar 

  36. Dahly D, Adair L, Bollen K: A structural equation model of the developmental origins of blood pressure. Int J Epidemiol. 2009, 38 (2): 538-548. 10.1093/ije/dyn242

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  37. Arnold B, Khush R, Ramaswamy P, London A, Rajkumar P, Ramaprabha P, et al: Causal inference methods to study nonrandomized, preexisting development interventions. Proc Natl Acad Sci U S A. 2010, 107 (52): 22605-22610. 10.1073/pnas.1008944107

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. Cousens S, Hargreaves J, Bonell C, Armstrong B, Thomas J, Kirkwood B, et al: Alternatives to randomization in the evaluation of public health interventions: statistical analysis and causal inference. J Epidemiol Comm Health. 2011, 65 (7): 567-581.

    Article  Google Scholar 

  39. Chen L, Davey Smith G, Harbrod R, Lewis S: Alcohol intake and blood pressure: a systematic review implementing a Mendelian randomization approach. PLOS Med. 2008, 5 (3): e52. 10.1371/journal.pmed.0050052

    Article  PubMed Central  PubMed  Google Scholar 

  40. Greenland S: An introduction to instrumental variables for epidemiologists. Int J Epidemiol. 2000, 29: 722-729. 10.1093/ije/29.4.722

    Article  CAS  PubMed  Google Scholar 

  41. Lauritzen S: Graphical Models. Oxford: Clarendon; 1996.

    Google Scholar 

  42. Genelletti S, Best N, Toledano M, Elliott P, Richardson S: Uncovering selection bias in case–control studies using Bayesian post-stratification. Stat Med. epub ahead of print, 2013.

  43. Rose G: The Strategy of Preventive Medicine. New York: Oxford University Press; 1992.

    Google Scholar 

  44. Earp J, Ennett S: Conceptual models for health education and practice. Health Educ Res. 1991, 6 (2): 163-171. 10.1093/her/6.2.163

    Article  CAS  PubMed  Google Scholar 

  45. Best N, Green P: Structure and uncertainty: graphical models for understanding complex data. Significance. 2005, 2 (4): 177-181. 10.1111/j.1740-9713.2005.00133.x.

    Article  Google Scholar 

  46. Lauritzen S, Wermuth N: Graphical models for associations between variables, some of which are qualitative and some quantitative. Ann Stat. 1989, 17 (1): 31-57. 10.1214/aos/1176347003.

    Article  Google Scholar 

  47. Wermuth N, Lauritzen S: On substantive research hypotheses, conditional independence graphs and graphical chain models. J R Stat Soc Ser B. 1990, 52 (1): 21-50.

    Google Scholar 

  48. Petticrew M, Rehfuess E, Noyes J, Higgins J, Sowden A, Shemilt I: Synthesising evidence on complex interventions: the contribution of meta-analytic, qualitative and mixed-method approaches. J Clin Epidemiol. 2013. epub ahead of print.

  49. OrcMacro International Inc: Demographic and Health Surveys. (accessed 15 August 2011).

  50. World Health Organization: (accessed 12 September 2011).

  51. Rehfuess E, Tzala L, Best N, Briggs D, Joffe M: Solid fuel use and cooking practices as a major risk factor for ALRI mortality among African children. J Epidemiol Comm Health. 2009, 63 (11): 887-892. 10.1136/jech.2008.082685.

    Article  CAS  Google Scholar 

  52. Clayton D, Hills M: Statistical Models in Epidemiology. Oxford: Oxford University Press; 1993.

    Google Scholar 

  53. Hills M, de Stavola B: Regression Commands. A Short Introduction to Stata for Biostatistics. Edited by: Hills M, de Stavola BL. London: Timberlake Consultants Ltd; 2002.

    Google Scholar 

  54. O’Donnell O, van Doorslaer E, Wagstaff A, Lindelow M: Analyzing Health Equity using Household Survey Data: A Guide to Techniques and Their Implementation. Washington D.C: World Bank; 2008.

    Google Scholar 

  55. Rehfuess EA: PhD thesis. Solid Fuel Use and Child Health in Africa: A Causal Analysis of Social Determinants. London: Imperial College London; 2008.

    Google Scholar 

  56. Burnham K, Anderson D: Model Selection and Inference: A Practical Information-Theoretic Approach. New York, Berlin, Heidelberg: Springer; 1998.

    Book  Google Scholar 

  57. Weng H-Y, Hsueh Y-H, Messam L, Hertz-Picciotto I: Methods of covariate selection: directed acyclic graphs and the change-in-estimate procedure. Am J Epidemiol. 2009, 169 (10): 1182-1190. 10.1093/aje/kwp035

    Article  PubMed  Google Scholar 

  58. Johnson A-W, Aderele W: The association of household pollutants and socio-economic risk factors with the short-term outcome of acute lower respiratory infections in hospitalised pre-school Nigerian children. Ann Trop Paediatr. 1992, 12 (4): 421-432.

    CAS  PubMed  Google Scholar 

  59. Rinne S, Rodas E, Rinne M, Simpson J, Glickman L: Use of biomass fuels is associated with infant mortality and child health in trend analysis. Am J Trop Med Hyg. 2007, 76 (3): 585-591.

    CAS  PubMed  Google Scholar 

  60. Lewis J, Pattanayak S: Who adopts improved fuels and cookstoves? A systematic review. Env Health Perspect. 2012, 120 (5): 637-645. 10.1289/ehp.1104194.

    Article  Google Scholar 

  61. Rehfuess EA, Puzzolo E, Stanistreet D, Pope D, Bruce N: Enablers and barriers to large-scale uptake of improved solid fuel stoves: a systematic review. Env Health Perspect. in press.

  62. Aerts D, Drachler M, Giugliani E: Determinants of growth retardation in Southern Brazil. Cad Saude Publica. 2004, 20 (5): 1182-1190. 10.1590/S0102-311X2004000500011

    Article  PubMed  Google Scholar 

  63. Bloss E, Wainaina F, Baileys R: Prevalence and predictors of underweight, stunting and wasting among children aged 5 and under in Western Kenya. J Trop Pediatr. 2004, 50 (5): 260-270. 10.1093/tropej/50.5.260

    Article  PubMed  Google Scholar 

  64. Fawzi W, Herrera M, Nestel P, El Amin A, Mohamed K: A longitudinal study of prolonged breastfeeding in relation to child undernutrition. Int J Epidemiol. 1998, 27: 255-260. 10.1093/ije/27.2.255

    Article  CAS  PubMed  Google Scholar 

  65. Mamiro P, Kolsteren P, Roberfroid D, Tatala S, Opsomer A, Van Camp J: Feeding practices and factors contributing to wasting, stunting, and iron-deficiency anaemia among 3-23-month-old children in Kilosa district, rural Tanzania. J Health Pop Nutr. 2005, 23 (3): 222-230.

    Google Scholar 

  66. Torres-Arreola L, Constantino-Casas P, Flores-Hernández S, Vila-Barragán J, Rendón-Macías E: Socio-economic factors and low birthweight in Mexico. BMC Public Health. 2005, 5: 20. 10.1186/1471-2458-5-20

    Article  PubMed Central  PubMed  Google Scholar 

  67. Bellessa-Frost M, Forste R, Haas D: Maternal education and child nutritional status in Bolivia: finding the links. Soc Sci Med. 2005, 60: 395-407. 10.1016/j.socscimed.2004.05.010

    Article  Google Scholar 

  68. Delpeuch F, Traissac P, Martin-Prével Y, Massamba J, Maire B: Economic crisis and malnutrition: socioeconomic determinants of anthropometric status of preschool children and their mothers in an African urban area. Public Health Nutr. 2000, 3 (1): 39-47.

    Article  CAS  PubMed  Google Scholar 

  69. Fotso J-C: Urban–rural differentials in child malnutrition: trends and socioeconomic correlates in sub-Saharan Africa. Health Place. 2007, 13: 205-223. 10.1016/j.healthplace.2006.01.004

    Article  PubMed  Google Scholar 

  70. Giashuddin M, Kabir M, Hasan M: Economic disparity and child nutrition in Bangladesh. Ind J Pediatr. 2005, 72 (6): 481-487. 10.1007/BF02724424.

    Article  CAS  Google Scholar 

  71. Hong R, Misra V: Effect of wealth inequality on chronic undernutrition in Cambodian children. J Health Popul Nutr. 2006, 24 (1): 89-99.

    PubMed  Google Scholar 

  72. Engebretsen I, Wamani H, Karamagi C, Semiyaga N, Tumwine J, Tylleskär T: Low adherence to exclusive breastfeeding in Eastern Uganda: a community-based cross-sectional study comparing dietary recall since birth with 24-hour recall. BMC Pediatr. 2007, 7 (10).

  73. Topuzoglu A, Ozaydin G, Cali S, Cebeci D, Kalaca S, Harmanci H: Assessment of sociodemographic factors and socioeconomic status affecting the coverage of compulsory and private immunization services in Istanbul. Turkey. Public Health. 2005, 119 (10): 862-869. 10.1016/j.puhe.2005.01.015.

    Article  CAS  PubMed  Google Scholar 

  74. Torun S, Bakirci N: Vaccination coverage and reasons for non-vaccination in a district of Istanbul. BMC Public Health. 2006, 5 (6): 125.

    Article  Google Scholar 

  75. World Health Organization: A Standard Verbal Autopsy Method for Investigating Causes of Death in Infants and Children. Geneva: World Health Organization; 1999.

    Google Scholar 

  76. Pearl J: Causal diagrams for empirical research. Biometrika. 1995, 82 (4): 669-710. 10.1093/biomet/82.4.669.

    Article  Google Scholar 

  77. Pearl J: Causality: Models, Reasoning and Inference. Cambridge: Cambridge University Press; 2000.

    Google Scholar 

  78. VanderWeele T, Hernán M, Robins J: Causal directed acyclic graphs and the direction of unmeasured confounding bias. Epidemiol. 2008, 19: 720-728. 10.1097/EDE.0b013e3181810e29.

    Article  Google Scholar 

  79. Bicego B, Boerma J: Maternal education and child survival: a comparative study of survey data from 17 countries. Soc Sci Med. 1993, 36 (9): 1207-1227. 10.1016/0277-9536(93)90241-U

    Article  CAS  PubMed  Google Scholar 

  80. Brockerhoff M, Hewett P: Inequality of child mortality among ethnic groups in sub-Saharan Africa. Bull World Health Organ. 2000, 78 (1): 30-41.

    PubMed Central  CAS  PubMed  Google Scholar 

  81. Desai M, Alva S: Maternal education and child health: is there a strong causal relationship?. Demography. 1998, 35 (1): 71-81. 10.2307/3004028

    Article  CAS  PubMed  Google Scholar 

  82. Gwatkin D, Rutstein S, Johnson K, Pande R, Wagstaff A: Socioeconomic Differences in Health, Nutrition and Population. Washington D.C: World Bank; 2000.

    Google Scholar 

  83. Gwatkin D, Rutstein S, Johnson K, Suliman E, Wagstaff A, Amouzou A: Socioeconomic Differences in Health, Nutrition and Population. Washington D.C: World Bank; 2007.

    Google Scholar 

  84. Macassa G, Ghilagaber G, Bernhardt E, Diderichsen F, Burström B: Inequalities in child mortality in Mozambique: differentials by parental socio-economic position. Soc Sci Med. 2003, 57: 2255-2264. 10.1016/S0277-9536(02)00545-2

    Article  PubMed  Google Scholar 

  85. Moser K, Leon D, Gwatkin D: How does progress towards the child mortality Millennium Development Goal affect inequalities between the poorest and the least poor? Analysis of Demographic and Health Survey data. BMJ. 2005, 331: 1180-1182. 10.1136/bmj.38659.588125.79

    Article  PubMed Central  PubMed  Google Scholar 

  86. Wagstaff A: Socioeconomic inequalities in child mortality: comparisons across nine developing countries. Bull World Health Organ. 2000, 78 (1): 19-29.

    PubMed Central  CAS  PubMed  Google Scholar 

  87. Wagstaff A: Child health on a dollar a day: some tentative cross-country comparisons. Soc Sci Med. 2003, 57 (9): 1529-1538. 10.1016/S0277-9536(02)00555-5

    Article  PubMed  Google Scholar 

  88. de Francisco A, Morris J, Hall A, Armstrong Schellenberg J, Greenwood B: Risk factors for mortality from acute lower respiratory tract infections in young Gambian children. Int J Epidemiol. 1993, 22 (6): 1174-1182. 10.1093/ije/22.6.1174

    Article  PubMed  Google Scholar 

  89. Mtango F, Neuvians D, Broome C, Hightower A, Pio A: Risk factors for deaths in children under 5 years old in Bagamoyo district, Tanzania. Trop Med Parasitol. 1992, 43 (4): 229-233.

    CAS  PubMed  Google Scholar 

  90. Mahalanabis D, Gupta S, Paul D, Gupta A, Lahiri M, Khaled M: Risk factors for pneumonia in infants and young children and the role of solid fuel for cooking: a case–control study. Epidemiol Infect. 2002, 129: 65-71.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  91. O’Dempsey T, McArdle T, Morris J, Lloyd-Evans N, Baldeh I, Laurence B, et al: A study of risk factors for pneumococcal disease among children in a rural area of West Africa. Int J Epidemiol. 1996, 25 (4): 885-893. 10.1093/ije/25.4.885

    Article  PubMed  Google Scholar 

  92. Shah N, Ramankutty V, Premila P, Sathy N: Risk factors for severe pneumonia in children in south Kerala: a hospital-based case–control study. J Trop Pediatr. 1994, 40: 201-206. 10.1093/tropej/40.4.201

    Article  CAS  PubMed  Google Scholar 

  93. Victora C, Fuchs S, Flores J, Fonseca W, Kirkwood B: Risk factors for pneumonia among children in a Brazilian metropolitan area. Pediatrics. 1994, 93: 977-985.

    CAS  PubMed  Google Scholar 

  94. Bassani D, Jha P, Dhingra N, Kumar R: Child mortality from solid-fuel use in India: a nationally-representative case–control study. BMC Public Health. 2010, 10 (491).

  95. Singh-Manoux A: Commentary: Modelling multiple pathways to explain social inequalities in health and mortality. Int J Epidemiol. 2005, 34: 638-639.

    Article  PubMed  Google Scholar 

  96. Smith K, McCracken J, Weber M, Hubbard A, Jenny A, Thompson L, et al: Effect of reduction in household air pollution on childhood pneumonia in Guatemala (RESPIRE): a randomised controlled trial. Lancet. 2011, 378 (9804): 1717-1726. 10.1016/S0140-6736(11)60921-5

    Article  PubMed  Google Scholar 

  97. Budya H, Arofat M: Providing cleaner energy access in Indonesia through the megaproject of kerosene conversion to LPG. Energy Policy. 2011, 39 (12): 7575-7586. 10.1016/j.enpol.2011.02.061.

    Article  Google Scholar 

  98. Bruce N, Rehfuess E, Mehta S, Hutton G, Smith K, et al: Indoor Air Pollution. Disease Control Priorities in Developing Countries. Edited by: Jamison DT. New York: Oxford University Press; 2006, 793-815.

    Google Scholar 

  99. Diez RA: Integrating social and biologic factors in health research: a systems view. Ann Rev Epidemiol. 2006, 17: 569-574.

    Article  Google Scholar 

  100. Pearce N: Commentary: the rise and rise of corporate epidemiology and the narrowing of epidemiology’s vision. Int J Epidemiol. 2007, 36: 713-717. 10.1093/ije/dym152

    Article  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Eva A Rehfuess.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MJ’s expertise with causal diagrams was the starting point for the approach, which gradually evolved as part of ER’s PhD. ER developed and tested the approach with regular methodological and conceptual input from MJ, NB and DB. ER and MJ jointly drafted the manuscript, which was reviewed and critically revised by NB and DB. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Rehfuess, E.A., Best, N., Briggs, D.J. et al. Diagram-based Analysis of Causal Systems (DACS): elucidating inter-relationships between determinants of acute lower respiratory infections among children in sub-Saharan Africa. Emerg Themes Epidemiol 10, 13 (2013).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Africa
  • Children
  • Acute lower respiratory infections
  • Pneumonia
  • Health determinants
  • Causal diagrams
  • Multi-factorial causality
  • Systems epidemiology
  • Social epidemiology
  • Environmental epidemiology