Analytic perspective | Open | Published:

# Simpson's Paradox, Lord's Paradox, and Suppression Effects are the same phenomenon – the reversal paradox

*Emerging Themes in Epidemiology***volume 5**, Article number: 2 (2008)

## Abstract

This article discusses three statistical paradoxes that pervade epidemiological research: *Simpson's paradox*, *Lord's paradox*, and *suppression*. These paradoxes have important implications for the interpretation of evidence from observational studies. This article uses hypothetical scenarios to illustrate how the three paradoxes are different manifestations of one phenomenon – the *reversal paradox* – depending on whether the outcome and explanatory variables are categorical, continuous or a combination of both; this renders the issues and remedies for any one to be similar for all three. Although the three statistical paradoxes occur in different types of variables, they share the same characteristic: the association between two variables can be reversed, diminished, or enhanced when another variable is statistically controlled for. Understanding the concepts and theory behind these paradoxes provides insights into some controversial or contradictory research findings. These paradoxes show that prior knowledge and underlying causal theory play an important role in the statistical modelling of epidemiological data, where incorrect use of statistical models might produce consistent, replicable, yet erroneous results.

## Introduction

This article discusses three statistical paradoxes that pervade epidemiological research: *Simpson's paradox*, *Lord's paradox*, and *suppression*. These paradoxes are not just tantalising puzzles of purely academic interest; potentially, they have serious implications for the interpretation of evidence from observational studies. Scenarios which are associated with and can be explained by these paradoxes are discussed. A concise explanation of these paradoxes and an historical overview is also provided. Simulated data based upon the foetal origins of adult diseases hypothesis [1, 2] are used to illustrate how the three paradoxes are different manifestations of one phenomenon – *the reversal paradox* – depending on whether the outcome and explanatory variables are categorical, continuous or a combination of both; this renders the issues and remedies for any one to be similar for all three. All statistical analyses were performed within SPSS 15.0 (SPSS Inc, Chicago, USA).

## Foetal origins hypothesis

The 'foetal origins of adult disease' hypothesis (FOAD), which has evolved into the 'developmental origins of health and disease' (DOHaD) hypothesis [1, 2], was proposed to explain the associations observed between low birth weight and a range of diseases in later life. These associations have been interpreted as evidence that growth retardation *in utero* has adverse long-term effects on the development of vital organ systems which predispose the individuals to a range of metabolic and related disorders in later life. Nevertheless, although an inverse association between birth weight and disease in later life was found in some studies, this relationship was only established in many studies after the current body size variables such as body mass index (BMI), body weight and/or body height were adjusted for in the regression analysis. As body sizes may be in the causal pathway from birth weight to health outcomes in later life, the justification of this adjustment of current body sizes has been questioned recently [3–8].

Using the inverse relationship between birth weight and systolic blood pressure in later life as an example, Figure 1 shows the directed acyclic graphs [9–11] for the possible relationships between the three observed variables: birth weight, current body weight and systolic blood pressure. In Figure 1a, current body weight is on the causal pathway from birth weight to systolic blood pressure, so current body weight is not a genuine confounder and should not be adjusted for. In Figure 1b, there is no relationship between birth weight and current body weight, and therefore the latter is not a confounder for the relationship between birth weight and blood pressure either. However, this model cannot explain the observed positive correlations between birth weight and current body weight in many epidemiological studies. In Figure 1c, current body weight is a confounder because it is ancestor to both birth weight and blood pressure in the directed acyclic graph [9–11]. Obviously, this scenario is implausible in reality because current body weight cannot affect birth weight. In Figure 1d, the observed positive correlation between birth weight and current body weight is due to an unobserved confounder, UC, which affects both birth weight and current body weight. Also, there is no path from birth weight and current body weight [7], i.e. if UC could be identified and measured, birth weight and current body weight would be independent, conditional on UC [12]. More complex causal diagrams for the three variables are possible by incorporating more unobserved variables in the model. However, the four scenarios in Figure 1 are sufficient for our discussion in this study, so we do not pursue them further.

Figure 1a,c and 1d all explain the observed correlation structure amongst birth weight, current body weight and blood pressure equally well, and it is not possible to judge which one is true based upon the observed data. For example, researchers may argue current body weight is a genuine confounder in Figure 1d and therefore should be adjusted for [7]. This can only be confirmed when the unobserved confounder (be parental, genetic, or environmental factors) is identified *and* the conditional independence between birth weight and current weight is satisfied.

Nevertheless, the adjustment of current body weight in the statistical analysis will change the estimated relationship between birth weight and blood pressure, as the adjusted relationship is a conditional relationship. Differences between the unadjusted and adjusted (i.e. unconditional and conditional) relationships frequently cause confusion in the interpretations of statistical analyses and they also give rise to three statistical paradoxes, which we shall explain in the next section.

## Simpson's Paradox

Simpson's paradox [13], or Yule's paradox [14], is a well known statistical phenomenon. It is observed when the relationship between two categorical variables is reversed after a third variable is introduced to the analysis of their association, or alternatively where the relationship between two variables differs within subgroups compared to that observed for the aggregated data. Although first discussed by Karl Pearson in 1899 [15], it is George Udny Yule, once Pearson's assistant, who provides a detailed assessment of this problem in 1903 [14].

### A numerical example

Table 1 provides a summary of a hypothetical survey of 1000 adult males in England based on data simulated using values derived from the literature [16] and surveys conducted by the UK Department of Health [17]. Data are simulated such that the three variables systolic blood pressure (*BP*), birth weight (*BW*), and current body weight (*CW*) are positively correlated: the correlation between *BP* and birth weight (*r*_{BW-BP}) is weak (0.11); whereas the correlations between birth weight and current weight (*r*_{BW-CW}) and between current weight and *BP* (*r*_{CW-BP}) are reasonably strong (0.52 and 0.50, respectively).

Suppose the research question is to investigate whether or not there is an association between low birth weight and high blood pressure in later life. In this hypothetical study, low birth weight is defined as birth weight lower than the population mean (i.e. < 3.5 Kg), and high blood pressure is defined as systolic *BP* greater than the mean value (i.e. > 135 mmHg). The results are summarized in Table 2. It is noted that the probability of developing high blood pressure is 0.272 for subjects with low birth weight and 0.362 for subjects with high birth weight. This indicates that low birth weight has a protective effect of developing high blood pressure. However, when these subjects are stratified according to their current weight (> 90 Kg vs. < = 90 Kg), the risk of developing high blood pressure is consistently higher amongst subjects with low birth weight compared to those with high birth weight. It seems to be quite counter-intuitive that low birth weight has an adverse effect on blood pressure for both subgroups of current weight, yet a protective effect on the groups as a whole.

### Interpretation

In this scenario, there are substantial differences in the numbers of subjects with low birth weight between the two subgroups of current weight, because lower birth weight babies on average are smaller in adulthood. Therefore, the overall relation between low birth weight and high blood pressure is a sum of weighted relations between the two variables in each subgroup. A graphical representation of this paradox, first proposed by Paik [18], is given in Figure 2. Due to a greater influence of the lower risk of developing high blood pressure in the subjects with low birth weight and lower current weight, the adverse relation is reversed in the whole-group analysis (solid line in Figure 2). Note that, in the following two scenarios, the adjustment for current weight will not change the relationship between birth weight and *BP*[12], if: (a) there is no difference in the percentages of subjects with high current weight between the two subgroups of birth weight (i.e. no correlation between birth weight and current weight); or (b) there is no association between *CWb* and *BP* in the subgroups stratified by *BWb* (i.e. the association between *BP* and current weight is entirely caused by the association between birth weight and *BP*). The problem is whether the relation between low birth weigh and high blood pressure in the whole group provides an answer to the intended research question, or whether the relation in the two subgroups does this. In other words, should *CWb* be considered a confounder and hence adjusted for in the statistical models?

In statistical language, adjustment for current body weight represents a *conditional* relationship; the relationship between birth weight and blood pressure is *conditional* on current body weight. Although there are substantial differences in the numbers of subjects with low birth weight between the two subgroups of current weight, the adjustment for *CWb* indicates that *if* all subjects had the same level of current body weight, subjects with low birth weight would have a greater risk of developing high blood pressure, i.e. the adjustment of *CWb* erases the greater influence of subjects with low birth weight and lower current weight on the association between birth weight and blood pressure, as people born smaller in general grow into a smaller adults.

Simpson's paradox has broad implications for epidemiological research since it indicates that making causal inference from any non-randomised study (e.g. cohort studies, case-control studies) can be difficult, because, whilst it is possible to control for the differences between cases and controls, there will always be the possibility that an unobserved and therefore unadjusted confounder might attenuate the association (or even reverse its direction) between exposure and outcome, due to the difference in the mean values or the distribution of confounders between the case or control group. Nevertheless, whether or not there is any unobserved (and therefore unadjusted) confounder may not always be an issue of debate, because in most epidemiologic studies, the important confounders are generally known. The controversy in making causal inference arises in situations where the adjusted variable may not be a genuine confounder [6, 7, 19, 20]. Within epidemiology, Simpson's paradox is closely linked to the concepts of confounding [9] and incollapsibility [10].

## Lord's Paradox

Lord's paradox was named after two short articles in the psychology literature by Frederick M Lord regarding the use of analysis of covariance (ANCOVA) within non-experimental studies [21, 22]. In contrast to Simpson's paradox, little discussion of Lord's paradox can be found in the statistical and epidemiological literature [23], though social scientists have shown a great interest in this phenomenon [24–28]. Lord's paradox refers to the relationship between a continuous outcome and a categorical exposure being reversed when an additional continuous covariate is introduced to the analysis. One specific example is that the additional covariate is a measure made at baseline within a longitudinal study, where the outcome is the same variable measured some time later (e.g. following an intervention). Therefore, the aim is to measure change in the outcome by adjusting for the baseline measurements, and the categorical covariate might be the exposure/control groups – this is the familiar design for ANCOVA. This controversy was first discussed in 1910 between Karl Pearson and Arthur C Pigou when they debated the role of parental alcoholism and its impact on the performance of children [29].

### A numerical example

Considering the previous numerical example for Simpson's paradox, we examine current body weight (*CW*) and blood pressure (*BP*) as continuous variables, retaining birth weight as a binary (*BWb*). The two-sample *t*-test shows that, on average, the blood pressure of subjects with higher birth weight is 2.49 mmHg (95%CI: 1.12, 3.87) *greater* than those with lower birth weight. However, using ANCOVA (i.e. linear regression with a (categorical) group-allocating variable and with the adjustment of a continuous confounding variable), adjusting for current weight as a covariate, the blood pressure of subjects with higher birth weight becomes 2.94 mmHg (95%CI: 1.12, 3.87) *lower* than those with lower birth weight.

### Interpretation

Differences in the results of the two analyses are due to adjustment in the second analysis for current body weight (*CW*). As current weight is positively associated with both *BP* and *BWb*, it is expected that the relation between *BP* and *BWb* will change when current weight is adjusted for. In randomised controlled trials, mean values of the adjusted baseline covariate are expected to be approximately equal across treatment and control groups since, assuming randomisation has been achieved, baseline variation should be *within* groups rather than *between* groups), i.e. there is no correlation between the group variable and adjusted covariate (i.e. in our numerical example, no correlation between *BWb* and current weight). In such circumstances it is well known that using ANCOVA achieves the same estimated treatment difference across groups as found by the *t*-test, though the former will generally have greater power [30, 31]. Recall our previous discussion of two scenarios in the section on Simpson's paradox, where the adjustment for *CWb* will not change the relationship between *BWb* and *BP*. Randomised controlled trials may thus be seen as a special case of scenario (a) where there is no difference in the mean current weight between the two sub-groups of birth weight.

Figure 3 is a three-dimensional representation of the associations amongst the three variables. Although the solid black line shows that subjects with higher birth weight (coded as 1) have on average a greater blood pressure than those with lower birth weight (coded as 0), the various horizontal red lines with a negative slope indicate that at each level of current weight, subjects with higher birth weight have a lower mean blood pressure than those with lower birth weight.

In statistical language, results from the regression analyses are *conditional* on both birth weight groups having equal mean current weight in later life, and if true there would be a benefit from low birth weight in terms of blood pressure. However, since the two groups have a different mean current weight in later life, results from the regression analysis need to be interpreted with caution. In Simpson's paradox, the discussion surrounds the differences in results between unconditional and conditional risk/probability, and in Lord's paradox, discussion is around the differences in results between unconditional and conditional means.

## Suppression

Of the three paradoxes, suppression effects within multiple regression are probably the least recognised amongst clinical and epidemiological researchers, though the suppression phenomenon has been extensively discussed by statisticians [32–34] and methodologists from the social sciences [35, 36]. The classical definition of suppression is that a potential covariate that is unrelated to the outcome variable (i.e. has a bivariate correlation of zero) increases the overall model fit within regression (as assessed by *R*^{2}, for instance) when this covariate is added to the model. This seems counter-intuitive and needs some explanation.

Suppose *y* is the outcome variable, and *x*_{1}and *x*_{2} are two covariates (i.e. 'explanatory' variables). Denote the bivariate Pearson correlation between *y* and *x*_{1} as *r*_{y 1}; the correlation between *y* and *x*_{2} as *r*_{y 2}; and the correlation between *x*_{1} and *x*_{2} as *r*_{12}. Within multiple regression, where *y* = *b*_{0} + *b*_{1}*x*_{1} + *b*_{2}*x*_{2}, the standardized partial regression coefficients of *b*_{1} and *b*_{2} for *x*_{1} (*β*_{1}) and *x*_{2} (*β*_{2}), respectively, are given by [37]:

Now suppose that *y* is adult blood pressure (*BP*), *x*_{1} birth weight (*BW*), and *x*_{2} adult current weight (*CW*). Many studies have shown the bivariate correlation (*r*_{y1}) between *BP* (*y*) and birth weight (*x*_{1}) to be negative though weak [38, 39], whilst others show this to be positive [40]; for illustrative purposes only, assume that *r*_{y1}is zero. Many studies show the bivariate correlation (*r*_{y2}) between *BP* (*y*) and current weight (*x*_{2}) to be positive [41]. When *BP* is regressed on birth weight and current weight, the model fit assessed by *R*^{2} becomes [37]:

Since *r*_{y1}is equal to zero, equation (2) becomes:

Since $1-{r}_{12}^{2}$ will always be smaller than 1, ${R}^{2}=\frac{{r}_{y2}^{2}}{1-{r}_{12}^{2}}$ will always be greater than ${r}_{y2}^{2}$. By including *x*_{1} in the regression model, more variance of *y* is 'explained', i.e. the predictability of the model increases. However, this seems counterintuitive, since the zero bivariate correlation between *y* and *x*_{1} (*r*_{y1}= 0) indicates that no more variance in *y* can be explained by *x*_{1}. So where does the additional 'explained variance' in *y* come from when *x*_{1} is entered in the regression model? The answer is that the additional explained variance in *y* comes from *x*_{2}.

Although *x*_{1} is not correlated with *y*, it is positively correlated with *x*_{2}, which in turn is positively correlated with *y*. When *x*_{1} is entered in the model, it 'suppresses' the part of *x*_{2} that is uncorrelated with *y*, thereby increasing overall predictability. In other words, the role of *x*_{1} in the model is to suppress (reduce) the noise (the uncorrelated component of *x*_{2}) within the correlation between *y* and *x*_{2}, as though any uncertainty in *x*_{2} 'predicting' *y* is 'explained' by *x*_{1}.

It is not only *R*^{2} that is increased; the coefficient for *x*_{2}, ${\beta}_{2}=\frac{{r}_{y2}}{1-{r}_{12}^{2}}$, becomes greater than *r*_{y 2}. Furthermore, although *r*_{y 1}is equal to zero, *β*_{1} is not zero and becomes negative: ${\beta}_{1}=\frac{-{r}_{12}{r}_{y2}}{1-{r}_{12}^{2}}$. In general, the greater the positive correlation between *x*_{1} and *x*_{2}, the greater the absolute value of *β*_{1} and *β*_{2}. However, having *r*_{y 1}equal zero (or being negative) is not necessary to observe suppression; *r*_{y 1}may be positive and *x*_{1} may still be a suppressor [35].

It was Paul Horst, in 1941, who first explored this curious phenomenon within educational research [42], and in the last few decades, many statisticians have been interested in this topic [33–35]. There are still very few discussions within the clinical and epidemiological literature regarding the impact of suppression (i.e. the impact on the changes in the regression coefficients and *R*^{2}) on the interpretation of non-randomised studies whilst making statistical adjustment for covariates within regression [12, 43].

### A numerical example

Considering the previous numerical examples for Simpson's paradox and Lord's paradox, all three variables are now treated as continuous. Simple regression shows a positive association between *BP* and birth weight: the regression coefficient for birth weight is 1.861 mmHg/Kg (95% CI: 0.770, 2.953). Simple regression also reveals a positive association between *BP* and current weight: the regression coefficient for current weight is 0.382 (95% CI = 0.341, 0.423) mmHg/Kg. Following the practice of many previous studies, *BP* is regressed on birth weight and current weight simultaneously and the partial regression coefficients for birth weight and current weight are -3.708 (95% CI = -4.794, -2.622) and 0.465 (95% CI = 0.418, 0.512) mmHg/Kg respectively, and both are highly statistically significant (Table 3). Thus, after adjusting for current weight, birth weight has a significant inverse association with *BP*, suggesting that hypertension is associated with lower birth weight.

It is noteworthy that not only the association of birth weight with *BP* is reversed (coefficients change from 1.861 to -3.708 mmHg/Kg), but that the impact of current weight also increases from 0.382 to 0.465 mmHg/Kg. The *R*^{2} for multiple regression is 0.283, which is greater than the sum of the squared correlations for birth weight ((0.105)^{2} = 0.011) and current weight ((0.501)^{2} = 0.251), i.e. 0.262. Therefore, the explained variance of *BP* is greater than the sum of the explained variances for the two simple regression models.

Figure 4 is a three-dimensional representation of the associations amongst the three continuous variables. Although the solid black line shows that birth weight has a positive association with blood pressure, the various horizontal red lines with a negative slope indicate that at each level of current weight, birth weight has an inverse relationship with blood pressure.

### Interpretation

In the hypothetical foetal origins example, the strength of association between *BP* and birth weight differs considerably between simple regression and multiple regression. Which model genuinely reflects their true causal relationship depends on whether or not current weight should be adjusted for; whether or not current weight is a confounder for the relationship between *BP* and birth weight, which depends upon biological and clinical knowledge, not *ad hoc* statistical analyses and changes in the estimated effects [11]. The question is whether or not it is also biologically and clinically feasible to isolate the independent effect of birth weight on *BP* by removing the impact of current weight on *BP*[3, 5–7, 44]. In other words, changes in the regression coefficient for birth weight caused by current weight being adjusted for in multiple regression is irrelevant to whether or not current weight is viewed to be a confounder. The definition of confounding depends upon the *a priori* causal model assumed by the investigator [8, 11], which then dictates which statistical model is adopted.

In statistical language, results from adjustment for current weight are *conditional* on all babies growing to the same size in adulthood. In Simpson's paradox, the 'paradox' is due to differences in the results between unconditional and conditional risk/probability, and in Lord's paradox, it is due to differences in the results between unconditional and conditional means. In suppression, the paradox is due to differences in the results between the marginal (i.e. unconditional) *BP-* birth weight relation and the *BP-* birth weight relation conditional on current weight.

## Discussion

The reversal paradox is often used as the generic name for *Simpson's paradox*, *Lord's Paradox*, and *suppression* (see Table 4). Whilst the original definition and naming of the reversal paradox was derived from the notion that the direction of a relationship between two variables might be reversed after a third variable is introduced, this nevertheless may generalise to scenarios where the relationship between two variables is enhanced, not reduced or reversed, after the third variable is introduced (as with many studies on the foetal origins hypothesis).

In non-randomised studies, the reversal paradox can often occur due to 'controlling' for what is typically termed a confounder, even though a clear definition of what is meant by 'confounder' is rarely provided (contingent on understanding its role in the biological/clinical process being modelled). Differences in the strength or even direction of any association between outcome and exposure might give rise to contradictory interpretations regarding potential *causal* relationships. Furthermore, it is very difficult, if not impossible, to compare results across studies where many varied attempts are made to control for different confounders, especially in the absence of any consistent reasoning given for the choice of confounders. In some situations, statistical adjustment might introduce bias rather than eliminate it [45].

It might be suggested that the adjustment of current weight in our foetal origins example can be viewed as estimations of direct and indirect effects, such as those in path analysis or structural equation modelling. Recall Figure 1a, the path from birth weight to BP is to estimate the direct effect of birth weight → BP, and then the path from birth weight → current weight → BP is to estimate the indirect effect. For instance, in the model 3 of Table 3, the regression coefficient for birth weight, -3.708, is the direct effect, and the indirect effect is derived from 0.465 (the regression coefficient for current weight in model 3) multiplied by 11.976 (the simple regression coefficient for birth weight when current weight is regressed on birth weight) = 5.569. The total effect is therefore -3.708 + 5.569 = 1.861, which is the simple regression coefficient for birth weight in the model 1 of Table 3. Our reservation with interpreting the results from model 3 as the partition of the total effect into direct and indirect effect is that many variables, such as current height and current BMI, can be put in between birth weight and BP, and it can be claimed that there is more than one indirect effect. Furthermore, any body size measured after birth, for example, body weight at year one, year two etc, can be adjusted for in the model and presumably used to estimate the indirect effects and direct effect. Whilst the total effect of birth weight on BP is not affected by the numbers of intermediate body size variables in the model, the estimation of 'direct' effect differs when different intermediate variables are adjusted for. Unless there is experimental evidence to support the notion that there are indeed different paths of direct and indirect effects from birth weight to BP, we are cautious of using such terminology to label the results from multiple regression, as with model 3. In other words, to determine whether the unconditional or conditional relationship reflects the true physiological relationship between birth weight and blood pressure, experiments in which birth weight and current weight can be manipulated are required in order to estimate the impact of birth weight on blood pressure.

Although the three statistical paradoxes occur in different types of variables, they share the same characteristic: the association between two variables can be reversed, diminished, or enhanced when another variable is statistically controlled for. Understanding the concepts and theory behind these paradoxes will provide insights into some of the controversial or contradictory results from previous research. Prior knowledge and theory play an important role in the statistical modelling of non-randomised data. Incorrect use of statistical models might produce consistent, replicable, yet erroneous results.

## References

- 1.
Barker DJ: Fetal origins of coronary heart disease. BMJ. 1995, 311:171-4.

- 2.
Barker DJ, Eriksson JG, Forsen T, Osmond C: Fetal origins of adult disease: strength of effects and biological basis. Int J Epidemiol. 2002, 31:1235-9. 10.1093/ije/31.6.1235

- 3.
Paneth N, Ahmed F, Stein AD: Early nutritional origins of hypertension: a hypothesis still lacking support. Journal of Hypertensio. 1996, 14 (5): S121-S129.

- 4.
Lucas A, Fewtrell MS, Cole TJ: Fetal origins of adult disease-the hypothesis revisited. BMJ. 1999, 319: 245-9.

- 5.
Huxley RR, Neil A, Collins R: Unravelling the fetal origins hypothesis: is there really an inverse association between birth weight and subsequent blood pressure?. Lancet. 2002, 360: 659-65. 10.1016/S0140-6736(02)09834-3

- 6.
Tu YK, West R, Ellison GTH, Gilthorpe MS: Why evidence for the fetal origins of adult disease might be a statistical artifact: the "reversal paradox" for the relation between birth weight and blood pressure in later life. Am J Epidemiol. 2005, 161: 27-32. 10.1093/aje/kwi002

- 7.
Weinberg CR: Invited commentary: Barker meets Simpson. Am J Epidemiol. 2005, 161: 33-5. 10.1093/aje/kwi003

- 8.
De Stavola BL, Nitsch D, dos Santos Silva I, McCormack V, Hardy R, Mann V, Cole TJ, Morton S, Leon DA: Statistical issues in life course epidemiology. Am J Epidemiol. 2006, 163: 84-96. 10.1093/aje/kwj003

- 9.
Pearl J: Causality: Models, Reasoning, and Inference. Cambridge: Cambridge University Press; 2000.

- 10.
Greenland S, Robins JM, Pearl J: Confounding and collapsibility in causal inference. Stat Sci. 1999, 14: 29-46. 10.1214/ss/1009211805

- 11.
Jewell NP: Statistics for Epidemiology. London: Chapman & Hall; 2004.

- 12.
Cox DR, Wermuth N: A general condition for avoiding effect reversal after marginalisation. J R Statist Soc B. 2003, 65: 937-941. 10.1111/1467-9868.00424

- 13.
Simpson EH: The interpretation of interaction in contingency tables. J R Stat Soc Ser B. 1951, 13: 238-41.

- 14.
Yule GU: Notes on the theory of association of attributes in statistics. Biometrika. 1903, 2: 121-34. 10.1093/biomet/2.2.121

- 15.
Pearson K, Lee A, Bramley-Moore L: Mathematical contributions to the theory of evolution: VI – Genetic (reproductive) selection: Inheritance of fertility in man, and of fecundity in thoroughbred racehorses. Philos Trans R Soc Lond A. 1899, 192: 257-330. 10.1098/rsta.1899.0006

- 16.
Hennessy E, Alberman E: Intergenerational influences affecting birth outcome. II. Preterm delivery and gestational age in the children of the 1958 British birth cohort. Paediatr Perinat Epidemiol. 1998, 12 (1): 61-75. 10.1046/j.1365-3016.1998.0120s1061.x

- 17.
Website of the Department of Health, United Kingdom. http://www.doh.gov.uk

- 18.
Paik M: A graphical representation of a three-way contingency table: Simpson's paradox and correlation. Am Stat. 1985, 39: 53-54. 10.2307/2683907. 10.2307/2683907

- 19.
Hernandez-Diaz S, Schisterman EF, Hernan M: The "birth weight" paradox uncovered?. Am J Epidemiol. 2006, 164: 1115-1120. 10.1093/aje/kwj275

- 20.
Wilcox A: Invited Commentary: The perils of birth weight – a lesson from directed acyclic graphs. Am J Epidemiol. 2006, 164: 1121-1123. 10.1093/aje/kwj276

- 21.
Lord FM: A paradox in the interpretation of group comparisons. Psychol Bull. 1967, 68: 304-5. 10.1037/h0025105

- 22.
Lord FM: Statistical adjustments when comparing preexisting groups. Psychol Bull. 1969, 72: 337-8. 10.1037/h0028108. 10.1037/h0028108

- 23.
Glymour MM, Weuve J, Berkman LF, Kawachi I, Robins JM: When is baseline adjustment useful in analysis of change? An example with education and cognitive change. Am J Epidemiol. 2005, 162: 267-278. 10.1093/aje/kwi187

- 24.
Hand D: Deconstructuring statistical questions. J R Stat Soc Ser A Stat Soc. 1994, 157: 317-56. 10.2307/2983526. 10.2307/2983526

- 25.
Campbell DT, Kenny DA: A primer on regression artefact. Guildford: The Guilford Press; 1999.

- 26.
Mohr LB: Regression artifacts and other customs of dubious desert. Eval Program Plann. 2000, 23: 397-409. 10.1016/S0149-7189(00)00029-X. 10.1016/S0149-7189(00)00029-X

- 27.
Reichardt CS: Regression facts and artifacts. Eval Program Plann. 2000, 23: 411-4. 10.1016/S0149-7189(00)00030-6. 10.1016/S0149-7189(00)00030-6

- 28.
Wainer H: Adjusting for differential base rates: Lord's paradox again. Psychol Bull. 1991, 109: 147-51. 10.1037/0033-2909.109.1.147

- 29.
Stigler SM: Statistics on the Table. Cambridge, Massachusetts: Harvard University Press; 1999.

- 30.
Vickers AJ, Altman DG: Analysing controlled trials with baseline and follow up measurements. BMJ. 2001, 323: 1123-4. 10.1136/bmj.323.7321.1123

- 31.
Tu YK, Blance A, Clerehugh V, Gilthorpe MS: Statistical power for analyses of changes in randomized controlled trials. J Dent Res. 2005, 84: 283-287.

- 32.
Lewis JW, Escobar LA: Suppression and enhancement in bivariate regression. Statistician. 1986, 35: 17-26. 10.2307/2988294. 10.2307/2988294

- 33.
Bertrand PV, Holder RL: A quirk in multiple regression: the whole regression can be greater than the sum of its parts. Statistician. 1988, 37: 371-4. 10.2307/2348761. 10.2307/2348761

- 34.
Sharpe NR, Roberts RA: The relationship among sums of squares, correlation coefficients and suppression. Am Stat. 1997, 51: 46-48. 10.2307/2684693. 10.2307/2684693

- 35.
Friedman L, Wall M: Graphical views of suppression and multicollinearity in multiple linear regression. Am Stat. 2005, 127-136.

- 36.
Cohen J, Cohen P: Applied multiple regression/correlation analysis for the behavioural sciences. London: LEA; 1983.

- 37.
Pedhazur EJ: Multiple regression in behavioral research: Explanation and prediction. Fort Worth: Harcourt; 1997.

- 38.
Stocks NP, Davey Smith G: Blood pressure and birth weight in the first year university student aged 18–25. Public Health. 1999, 113: 273-7. 10.1016/S0033-3506(99)00179-1

- 39.
Williams S, Poulton R: Birth size, growth, and blood pressure between the ages of 7 and 26 years: failure to support the fetal origins hypothesis. Am J Epidemiol. 2002, 155: 849-52. 10.1093/aje/155.9.849

- 40.
McNeill G, Tuya C, Campbell DM, Haggarty P, Smith WCS, Masson LF, Cumming A, Broom I, Haites N: Blood pressure in relation to birth weight in twins and singleton controls matched for gestational age. Am J Epidemiol. 2003, 158: 150-5. 10.1093/aje/kwg130

- 41.
Tu YK, Gilthorpe MS, TH Ellison GTH: What is the effect of adjusting for more than one measure of current body size on the relation between birth weight and blood pressure?. J Hum Hypertens. 2006, 20: 646-657. 10.1038/sj.jhh.1002044

- 42.
Horst P: The role of prediction variables which are independent of the criterion. The Prediction of Personal Adjustment. Edited by: Horst P. New York: Social Science Research Council; 1941, 431-6.

- 43.
MacKinnon DP, Krull JL, Lockwood CM: Equivalence of the mediation, confounding and suppression effect. Prev Sci. 2000, 1: 173-81. 10.1023/A:1026595011371

- 44.
Tu YK, Ellison GTH, Gilthorpe MS: Growth, current size and the role of the 'reversal paradox' in the foetal origins of adult disease: an illustration using vector geometry. Epidemiol Perspect Innov. 2006, 3: 9. 10.1186/1742-5573-3-9

- 45.
Von Elm E, Egger M: The scandal of poor epidemiological research. BMJ. 2004, 329: 868-9. 10.1136/bmj.329.7471.868

## Acknowledgements

We are very grateful for the constructive comments of two reviewers. One reviewer brought to our attention of the excellent paper by Cox and Wermuth [9]. YKT conceived the ideas of this study and wrote the first draft. DG and MSG contributed to the discussion of these ideas and writing of the final draft.

## Author information

## Additional information

### Competing interests

The author(s) declare that they have no competing interests.

## Authors’ original submitted files for images

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Birth Weight
- High Blood Pressure
- Current Weight
- High Birth Weight
- Foetal Origin