 Research
 Open Access
 Open Peer Review
 Published:
Betweencentre differences and treatment effects in randomized controlled trials: A case study in traumatic brain injury
Trials volume 12, Article number: 201 (2011)
Abstract
Background
In Traumatic Brain Injury (TBI), large betweencentre differences in outcome exist and many clinicians believe that such differences influence estimation of the treatment effect in randomized controlled trial (RCTs). The aim of this study was to assess the influence of betweencentre differences in outcome on the estimated treatment effect in a large RCT in TBI.
Methods
We used data from the MRC CRASH trial on the efficacy of corticosteroid infusion in patients with TBI. We analyzed the effect of the treatment on 14 day mortality with fixed effect logistic regression. Next we used random effects logistic regression with a random intercept to estimate the treatment effect taking into account betweencentre differences in outcome. Betweencentre differences in outcome were expressed with a 95% range of odds ratios (OR) for centres compared to the average, based on the variance of the random effects (tau^{2}). A random effects logistic regression model with random slopes was used to allow the treatment effect to vary by centre. The variation in treatment effect between the centres was expressed in a 95% range of the estimated treatment ORs.
Results
In 9978 patients from 237 centres, 14day mortality was 19.5%. Mortality was higher in the treatment group (OR = 1.22, p = 0.00010). Using a random effects model showed large betweencentre differences in outcome (95% range of centre effects: 0.27 3.71), but did not substantially change the estimated treatment effect (OR = 1.24, p = 0.00003). There was limited, although statistically significant, betweencentre variation in the treatment effect (OR = 1.22, 95% treatment OR range: 1.171.26).
Conclusion
Large betweencentre differences in outcome do not necessarily affect the estimated treatment effect in RCTs, in contrast to current beliefs in the clinical area of TBI.
Background
Traumatic brain injury (TBI) is a major health and socioeconomic problem throughout the world. It is the field with one of the greatest unmet needs in medicine and public health [1]. Not only is TBI a major cause of death and disability, incurring great personal suffering to victims and relatives, but it also leads to huge direct and indirect costs to society [2].
Many randomized controlled trials (RCTs) have been performed to investigate the effectiveness of new therapies in TBI, but very few have convincingly demonstrated benefit [3]. Multiple factors may have contributed to this disappointing picture, including RCTs in TBI being too small to detect or refute reliably moderate but clinically important benefits or hazards of treatment [4]. To design trials of sufficient size to detect moderate treatment effects, participation of multiple centres is required.
Considerable betweencentre differences in patient outcome have been reported in TBI [5–7]. Recently it was shown that a 3.3fold difference between centres in the odds of having an unfavourable outcome exist (p < 0.001), which was not explained by random variation or patient characteristics [8].
Many clinicians in the field of TBI believe that such betweencentre differences in outcome influence the chances of demonstrating a treatment effect in RCTs [7, 9]. The aim of this study is to assess the effect of betweencentre differences on estimates of the treatment effect in a large RCT in TBI.
Methods
Data
We used the individual patient data of the MRC CRASH trial. The CRASH trial (corticosteroid randomisation after significant head injury) is a large, international, randomised placebocontrolled trial of the effect of early administration of 48 h infusion of corticosteroids (methylprednisolone) on risk of death and disability after head injury. Patients from 239 centres in 48 countries were enrolled between April 1999 and May 2004, when the steering committee stopped recruitment because of a higher 14 day mortality rate in the treatment group [10].
Analysis
We first assessed whether there were differences in outcome between the centres in the CRASH trial, using a random effect logistic regression model (Appendix 1). In this model the outcome of a patient is only determined by the centre that treats the patient. Since some centres only treat a small number of patients, part of the betweencentre differences are caused by random variation. The random effect model estimates the betweencentre differences beyond random variation. The betweencentre differences are expressed as τ^{2}, which is the variance of the random effects.
Part of the differences between centres may be caused by the fact that centres are from a particular country. To separate betweencentre differences from betweencountry differences we extended the random effect model with a country level.
Because part of the betweencentre effect may be explained by differences in patient characteristics, we adjusted the betweencentre differences in outcome for age, Glasgow Coma Scale (GCS) and pupil reactivity at admission. These are the three main generally accepted prognostic factors in TBI [11, 12]. Age and GCS (a scale from 115) where treated as continuous variables and pupil reactivity as a binary variable (both pupils reactive versus one or both unreactive). So now the outcome of a patient is determined by patient characteristics and centre.
The differences between centres in outcome were expressed in a 95% range of odds ratios for centres compared to the average [13]. To avoid confusion with the odds ratio of the treatment effect we refer to this range as the 95% centre effect range.
Next we estimated the treatment effect with and without taking the betweencentre differences into account. We first analyzed the univariate effect of the treatment on 14 day mortality with usual fixed effect logistic regression. Centre effects were ignored, which is a common approach also in multicentre trials. We considered this as the reference strategy.
We furthermore use a random effect model to estimate the treatment effect. The outcome is also determined by the centre, so the treatment effect is adjusted for between centredifferences. This approach assumes a uniform treatment effect across centres. This means we expect the treatment to have equal effects in each centre. As a second approach we used a random effect logistic regression model with interaction between centre and treatment to asses whether the treatment effect varied between the centres. The variation in estimated treatment effect was expressed in a 95% range of the estimated treatment effect across centres. We compared the estimates of the treatment effect and the pvalues in the two approaches with the reference strategy.
The random effect estimates of the individual centres for both outcome and treatment effect were plotted with 95% posterior intervals.
Statistical analysis where performed in R statistical software 2.7.2 using the Design and lme4 libraries (R Foundation for Statistical Computation, Vienna). Random effect models were fitted with Adaptive Gaussian Quadrature with 10 qpoints.
This particular analysis did not need ethical approval.
Results
Descriptives
In total 10,008 patients were included in the RCT. We excluded 30 patients with missing 14 day outcome, leaving 9978 patients from 237 centres for the analyses. After 14 days 1,948 (19.5%) of the patients had died, with higher mortality in the treatment group. (Table 1)
Betweencentre differences
There was a large difference between centres in outcome (τ^{2} _{outcome, centre} = 0.447, p < 0.00001). The corresponding 95% range of centre effects was 0.27 3.71 (Table 2). This means that in centres with the lowest mortality (2.5^{th} percentile) the odds of dying was 0.27 times the average, while in the centres the highest mortality (97.5^{th} percentile) the odds of dying was 3.71 times the average. After adjustment for age, GCS and pupil reactivity the betweencentre in outcome increased to τ^{2} _{outcome, centre} = 0.620 (p < 0.00001) with a corresponding 95% range of centre effects of 0.21 4.68. Figure 1 shows the estimated adjusted odds ratios for mortality for each centre, compared to the average, with 95% posterior intervals.
Part of the differences in outcome between centres were actually differences between countries. When taking into account that centres are from a particular country, the range of betweencentre differences decreased to 0.392.58 (τ^{2} _{outcome, centre  country} = 0.235, p < 0.00001). The range of betweencountry differences was 0.26 to 3.88 (τ^{2} _{outcome, centre  country} = 0.470, p < 0.00001).
Treatment effect
In the reference strategy, the univariate fixed effect logistic regression odds ratio (OR) for treatment was 1.22 (p = 0.0001, Table 3).
Our first approach of adjusting for the betweencentre heterogeneity resulted in an OR for the treatment effect of 1.24 (p = 0.00003). With our second approach we estimated a varying treatment effect between the centres. The mean OR was 1.22 (p = 0.00029). The treatment effect heterogeneity was small, but statistically significant (τ^{2} _{treaetment effect} = 0.02, p < 0.00001). The corresponding 95% range of the estimated treatment effects across centres was 1.171.26 (Figure 2)
Discussion
Although we found large betweencentre differences in outcome in the CRASH trial, taking these into account did not substantially change the estimated treatment effect. Neither did we see major differences in treatment effect by centre. This study provides no support for the hypothesis that betweencentre differences in outcome affect the chances of demonstrating a treatment effect in RCTs, in contrast to current beliefs in this clinical area [7, 9].
Considering differences between centres in outcome and in estimated treatment effect could be of importance from two perspectives. First, betweencentre heterogeneity in the treatment effect between may indicate limited generalizability, which is of importance for example when registering a drug in a particular country. In our study there was no clinically meaningful heterogeneity in overall treatment effect. Although the betweencentre differences in the treatment effect were statistically significant, the 95% range was small (1.171.26). Clearly, determining generalizability is not solely a statistical issue but requires a clinical judgement to the extent to which the trial results might apply to another population.
Some trials have estimated the heterogeneity of the treatment effect between centres or countries or regions, but did not use random effect modelling. The PLATO study (The Study of Platelet Inhibition and Patient Outcomes) compared two platelet inhibitors (Ticagrelor versus Clopidogrel) for prevention of cardiovascular events in patients with acute coronary syndrome. The overall treatment effect was a hazard ratio (HR) of 0.84 in favour of Ticagrelor. The treatment effect was also tested in four different geographic regions separately; AsiaAustralia (N = 1,714), CentralSouth America (N = 1,237), EuropeMiddle EastAfrica (N = 13,859), and North America (N = 1,814). In Europe the estimated HR was 0.80 (95% CI: 0.720.90). The HRs in AsiaAustralia, CentralSouth America were 0.80 and 0.86, both non statistically significant. The estimated HR in North America was however 1.25 (95% CI: 0.931.67). The authors state that "the difference in results between patients enrolled in North America and those enrolled elsewhere raises the questions of whether geographic differences between populations of patients or practice patterns influenced the effects of the randomized treatments, although no apparent explanations have been found."
This interpretation shows the importance to distinct statistical from clinical reasoning. Although the statistical analysis showed significant differences between geographic regions in the PLATO trial, which could be an indication of limited generalizability, the authors have no biological or mechanistic explanation for the heterogeneity of the treatment effect and no heterogeneity was expected on beforehand. In such a situation were region specific estimates of the treatment effect are desired, or when heterogeneity in the treatment effect is expected, we would recommend to use a random effect model to estimate the betweenregion differences in treatment effect. On the other hand, a limited number of centres, countries or regions, complicates estimation of the heterogeneity in treatment effect.
Second, it is thought that heterogeneity between centres might reduce statistical power to detect the treatment effect.^{9} Providing that a trial is large enough, randomization will ensure that the intervention and control group are similar with regard to known and unknown confounders [10]. As expected, our study showed that taking into account betweencentre differences did not affect statistical significance.
Several explanations can be given for our findings. First, differences in outcome between centres in RCTs may be caused by patient characteristics, which we adjusted for in this analysis. We may not expect that patient characteristics result in differences in treatment effect between centres if the treatment is assumed to work for all patients included in the trial. Secondly there may be differences in care. If these only affect the baseline event rate (e.g. fewer ICU capacity) the treatment effect is not likely to be influenced. In contrast there could be differences in care interacting with the treatment, e.g. if time to hospital arrival is structurally longer in some places, an acute treatment may be less effective. If such an interaction is expected, it would usually be captured in inclusion criteria, such as inclusion within a certain time after injury. In our study we found large differences in outcome between the centres but limited variability in the treatment effect. In other words, there was no substantial interaction between centre and treatment, although such an interaction might have been expected since the CRASH trial comprised an acute treatment and was conducted in low to high income countries. This is also an important finding from the perspective of standardisation of care in trials, which some consider very important [9]. Our study suggests that if nonstandardized care only influences the absolute risk and does not interact with the treatment, there is no reason to put much effort in standardizing care.
We consider our results to be applicable to drug interventions, which work on physiological mechanisms. Trials investigating a more complex intervention such as surgery or a complex treatment strategy may be more sensitive to differences in quality of care. The effect of outcome difference on treatment effect is not expected to be related to the magnitude of the treatment effect. We recognize that further studies are required to confirm or refute these findings for other types of interventions and for other diseases. Moreover it is crucial to think in advance on the mechanism of the treatment, and whether heterogeneity or homogeneity of the treatment effect by centre is expected.
In this study we have assessed heterogeneity of the treatment effects on a relative scale, but we can also use an absolute scale (risk difference). We found that there is no heterogeneity on the relative scale, despite heterogeneity in the absolute risks per centre. This combination implies that there is heterogeneity in treatment effects on an absolute scale, which is important to realize when considering treatment for individuals [14].
The demonstration of hetero or homogeneity in treatment effects by country or centre in the single study is conceptually the same as demonstration hetero or homogeneity is a metaanalysis. The CRASH trial could be seen as a prospective metaanalysis of 40 trials in 40 different countries. A simple way showing the heterogeneity in treatment effects would be to present the results of a forest plot metaanalysis and test for heterogeneity. This was done for the CRASH trial (data not shown), also not indicating heterogeneity.
Our finding that betweencentre differences were not explained by patient characteristics corresponds to previous studies in TBI.^{8} Part of the betweencentre differences were actually betweencounty differences. This could be an indication of centredifferences being caused by structural differences between countries such as availability of resources and organisation of trauma care. The exact explanation of outcome differences between centres and countries requires further study.
Our study has some limitations. First, we did not consider differences in data quality between the centres, which might affect the estimated treatment effect [7]. Second, the CRASH might be considered an exception in the sense that the treatment was harmful. However, it is unlikely that our results would depend on the direction of the treatment effect.
Conclusion
Our study shows that there were large between centre differences in the CRASH trial, which had no clinically meaningful effect on the estimated treatment effect. Betweencentre differences do not necessarily affect the chances of demonstrating a treatment effect, which supports the conduct of large, multicentre trials.
Appendix 1
Random effect logistic regression with random intercept for centre
withY_{ij} the outcome for patient i in centre j, β_{0} the intercept, u_{0j} the random intercept for the centre, and e_{0ij} the residuals. The random intercepts are assumed to be normally distributed with τ^{2} _{0j} = var(u_{0j}).
Random effect logistic regression with random intercepts for centre and country
With u_{0k} the random intercept for the country, and e_{0ijk} the residuals. The random intercepts are assumed to be normally distributed with τ^{2} _{0j} = var(u_{0j}) and τ^{2} _{0kj} = var(u_{0k}).
Random effect logistic regression with random intercept for centre, including patient characteristics
with patient characteristics x_{ij}
Range of the centre effects
Fixed effect logistic regression
with x_{ij} the treatment and β_{1} the treatment effect.
Random effect logistic regression with random intercept for centre, including treatment
with x_{ij} the treatment and β_{1} the treatment effect, and random intercept u_{0j}
Random effect logistic regression with random slope of the treatment effect per centre
with u_{1j} as the random slope. The random slopes are assumed to be normally distributed with τ^{2} _{1j} = var(u_{1j})
Random effect logistic regression with random intercept for centre and random slope of the treatment effect per centre
Range of the estimated treatment effect across centres
Abbreviations
 TBI:

Traumatic Brain Injury
 RCT:

Randomized Controlled Trial
 OR:

Odds Ratio
 CRASH:

Corticosteroid Randomisation After Significant Head Injury
 GCS:

Glasgow Coma Scale
 HR:

Hazard Ratio
 ICU:

Intensive Care Unit
References
 1.
Maas AI, Stocchetti N, Bullock R: Moderate and severe traumatic brain injury in adults. Lancet Neurol. 2008, 7 (8): 72841. 10.1016/S14744422(08)701649.
 2.
Finkelstein EA, Corso PS, Miller TR: The incidence and economic burden of injuries in the United States. 2006, New York: Oxford University Press
 3.
Maas AI, Marmarou A, Murray GD, Teasdale SG, Steyerberg EW: Prognosis and clinical trial design in traumatic brain injury: the IMPACT study. J Neurotrauma. 2007, 24 (2): 2328. 10.1089/neu.2006.0024.
 4.
Dickinson K, Bunn F, Wentz R, Edwards P, Roberts I: Size and quality of randomised controlled trials in head injury: review of published studies. BMJ. 2000, 320 (7245): 130811. 10.1136/bmj.320.7245.1308.
 5.
Maas AI, Murray G, Henney H, Kassem N, Legrand V, Mangelus M, Muizelaar JP, Stocchetti N, Knoller N, Pharmos TBI investigators: Efficacy and safety of dexanabinol in severe traumatic brain injury: results of a phase III randomised, placebocontrolled, clinical trial. Lancet Neurol. 2006, 5 (1): 3845. 10.1016/S14744422(05)702532.
 6.
Clifton GL, Drever P, Valadka A, Zygun D, Okonkwo D: Multicentre trial of early hypothermia in severe brain injury. J Neurotrauma. 2009, 26 (3): 3937. 10.1089/neu.2008.0556.
 7.
Clifton GL, Choi SC, Miller ER, Levin HS, Smith KR, Muizelaar JP, Wagner FC, Marion DW, Luerssen TG: Intercentre variance in clinical trials of head traumaexperience of the National Acute Brain Injury Study: Hypothermia. J Neurosurg. 2001, 95 (5): 7515. 10.3171/jns.2001.95.5.0751.
 8.
Lingsma HF, Roozenbeek B, Bayoue L, Lu J, Weir J, Butcher I, Marmarou A, Murray GD, Maas AI, Steyerberg EW: Large betweencentre differences in outcome after moderate and severe traumatic brain injury in the IMPACT* study. Neurosurgery. 2011, 68 (3): 6017. 10.1227/NEU.0b013e318209333b.
 9.
Kirkpatrick PJ: On guidelines for the management of the severe head injury. J Neurol Neurosurg Psychiatry. 1997, 62 (2): 10911. 10.1136/jnnp.62.2.109.
 10.
Roberts I, Yates D, Sandercock P, Farrell B, Wasserberg J, Lomas G, Lomas G, Cottingham R, Svoboda P, Brayley N, Mazairac G, Laloë V, MuñozSánchez A, Arango M, Hartzenberg B, Khamis H, Yutthakasemsunt S, Komolafe E, Olldashi F, Yadav Y, MurilloCabezas F, Shakur H, Edwards P: CRASH trial collaborators. Effect of intravenous corticosteroids on death within 14 days in 10008 adults with clinically significant head injury (MRC CRASH trial): randomised placebo controlled trial. Lancet. 2004, 364 (9442): 13218.
 11.
MRC CRASH Trial Collaborators, Perel P, Arango M, Clayton T, Edwards P, Komolafe E, Poccock S, Roberts I, Shakur H, Steyerberg E, Yutthakasemsunt S: Predicting outcome after traumatic brain injury: practical prognostic models based on large cohort of international patients, et al. BMJ. 2008, 336 (7641): 4259.
 12.
Steyerberg EW, Mushkudiani N, Perel P, Butcher I, Lu J, McHugh GS, Murray GD, Marmarou A, Roberts I, Habbema JD, Maas AI: Predicting outcome after traumatic brain injury: development and international validation of prognostic scores based on admission characteristics. PLoS Med. 2008, 5 (8): e16510.1371/journal.pmed.0050165.
 13.
Timbie JW, Normand SLT: Profiling value of hospital care following AMI. Statist. Med. 2008, 27: 13511370. 10.1002/sim.3082.
 14.
Rothwell PM, Mehta Z, Howard SC, Gutnikov SA, Warlow CP: Treating individuals 3: from subgroups to individuals: general principles and the example of carotid endarterectomy. Lancet. 2005, 365 (9455): 25665.
Acknowledgements
This work was supported by the National Institutes of Health [NS01923521]
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
HL and BR performed the statistical analyses and drafted the manuscript, with help of ES. PP and IR provided the data and made important intellectual contributions. ES and AM designed the study and made important intellectual contributions. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Lingsma, H.F., Roozenbeek, B., Perel, P. et al. Betweencentre differences and treatment effects in randomized controlled trials: A case study in traumatic brain injury. Trials 12, 201 (2011) doi:10.1186/1745621512201
Received
Accepted
Published
DOI
Keywords
 Treatment Effect
 Traumatic Brain Injury
 Random Effect Model
 Ticagrelor
 Random Slope
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate. Please note that comments may be removed without notice if they are flagged by another user or do not comply with our community guidelines.