- Letter to the Editor
- Open Access
- Open Peer Review
Inappropriate statistical method in a parallel-group randomized controlled trial results in unsubstantiated conclusions
© Dimova and Allison. 2016
- Received: 27 February 2015
- Accepted: 18 April 2016
- Published: 6 June 2016
The conclusions of Cassani et al. in the January 2015 issue of Nutrition Journal (doi:10.1186/1475-2891-14-5) cannot be substantiated by the analysis reported nor by the data themselves. The authors ascribed the observed decrease in inflammatory markers to the components of flaxseed and based their conclusions on within-group comparisons made between the final and the baseline measurements separately in each arm of the randomized controlled trial. However, this is an improper approach and the conclusions of the paper are invalid. A correct analysis of the data shows no such effects.
- Statistical analyses
- Research integrity
- Research reporting
Randomized controlled trials (RCTs) are vital to estimating and testing the causal effects of treatments but are only as good as their design, execution, analysis, and reporting . Rigorous assessment of the effects of foods such as flaxseed on health outcomes is informative and useful . Hence, Cassani et al.’s  description of an RCT of the effects of flaxseed consumption on body weight and inflammation markers is of interest. Regrettably, however, the authors draw conclusions that cannot be supported by the statistical analyses used. Moreover, discrepancies between the published paper and clinical trial registration raise questions.
Discrepancies between the published paper and clinical trial registration
The study registration on ClinicalTrials.gov shows that the study was completed in 2007 and yet registered in 2014. Moreover, it was designed as a four-arm study, yet the article reports results from only two groups and offers neither explanation for nor mention of this discrepancy. This seems inconsistent with the transparency that clinical trials registration is designed to promote .
The erroneous nature of the analyses
Cassani et al concluded that “flaxseed added to a weight loss diet could be an important nutritional strategy to reduce inflammation markers such as CRP and TNF- α.” The statistical analyses offered in the article, however, were not legitimate tests of the hypotheses in question. The statistical analyses utilized entailed comparisons of the outcomes measured at the end of the study with their pre-intervention levels in each arm separately. The abstract states, “A decrease in inflammatory markers (CRP and TNF-α) was observed after flaxseed intake (mean decrease of 25 % and 46 % for GflaxLC respectively). All groups also showed improvement in levels of total cholesterol, LDL-c, uric acid and adiponectin.” The main reason for conducting an RCT is to provide comparison of an outcome of interest between two or more experimental groups or conditions to which subjects have been randomly assigned, such that expected distributions of subject characteristics before administering the intervention (s) are identical across randomized groups or conditions . Drawing inferences based on the detected (or not) differences within a group is improper and can lead to inaccurate and misleading conclusions . As Bland and Altman  show, this approach can produce type 1 error rates of up to 0.50 for two-group comparisons when the 0.05 alpha level is used, in other words, a possible inflation of the false-positive rate by up to an order of magnitude. Instead, the analysis should include direct comparison of the outcome measured at the end of the study between the groups, usually adjusted for the pre-intervention levels.
By omitting direct between-group comparison of the outcome variables and by relying on the within-group results, Cassani et al.’s article misinforms. Specifically, the article states that a significant decrease in the inflammatory markers CRP and TNF- α was observed only in the flaxseed group. The authors thus concluded that adding flaxseed to a weight loss diet could reduce the inflammation markers CRP and TNF- α. However, no comparison between the two groups was provided. Thus, the authors did not answer the question of whether there was a significant difference in the outcomes between the two interventions.
What does a proper analysis of between-group differences show?
A proper analysis of these data could take several forms, such as a between-groups t-test (or nonparametric test) on the endpoint data or on the change-scores, or a between-groups ANCOVA on the endpoint with baseline values as a covariate, as described by Allison et al. . All of these involve a test between the randomized groups.
Cassani et al. should be commended for making their raw data publically available and for noting this in their article. We downloaded their data, which were then declared deidentified non-human subjects data by the University of Alabama at Birmingham’s Institutional Review Board. We calculated the mean (SD) baseline and final CRP levels to be respectively 2.77 (2.45) and 1.99 (2.07) in the GriceLC group, and 2.04 (1.48) and 1.51 (0.92) in the GflaxLC group. An ANCOVA of the final log-transformed CRP levels adjusted for baseline log-transformed CRP levels suggests no significant difference between the two groups (p=0.94)1. Data for TNF-α were available for only 9 (GriceLC) and 11 (GflaxLC) subjects, and contained 2 apparent outliers of 3446 and 1140, which we assumed to correspond to 34.46 and 11.40. Accordingly, we calculated the mean (SD) baseline and final TNF-α levels to be 25.67 (16.00) and 22.31 (17.17), respectively, in the GriceLC group, and 15.46 (16.39) and 7.67 (10.05), respectively, in the GflaxLC group. An ANCOVA of the final TNF-α levels adjusted for baseline TNF-α levels suggests no significant difference between the two groups (p = 0.09)2. Removal of the outliers also produced nonsignificant results. Alternatively, a rank-sum test of the change-score between the two groups, for both CRP and TNF-α, produces nonsignificant results (p = 0.81 and p = 0.82, respectively). Additionally, it should be pointed out that the difference in the baseline levels between the two groups, and correctness of the data should be assessed, especially given the small sample size.
Those interested in evidence-based recommendations need evidence from sound scientific studies, including soundness of design, execution, analysis, and reporting  without “spin” . According to the Committee on Publication Ethics , “Journal editors should consider retracting a publication if…they have clear evidence that the findings are unreliable [including]… as a result of …miscalculation or experimental error.” Cassani et al.’s  conclusions are not reliable because an invalid statistical testing approach was used, and retraction should be considered.
1Levene’s test for homogeneity of variance of the final log-transformed CRP levels was insignificant, p = 0.18.
2Levene’s test for homogeneity of variance of the final TNF-α levels was insignificant, p=0.29.
The authors report no financial connection to the content of the paper discussed. David B. Allison and/or his institution have accepted funds from food companies, but not ones who, to his knowledge, market flaxseed.
DBA conceived the paper. Both authors drafted sections of the manuscript and edited the entire paper. Both authors read and approved the final manuscript.
1Department of Biostatistics, University of Buffalo, Buffalo, NY 14214, USA. 2Office of Energetics and Nutrition Obesity Research Center, University of Alabama at Birmingham, School of Public Health, Ryals Public Health Building, Room 140J, Birmingham, AL 35294, USA.
Supported in part by NIH grants P30DK056336, R25DK099080, and R25HL124208. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or any other organization. The authors thank Drs. Peng Li, Brandon George, and Mark B. Cope for constructive input.
Response to “Inappropriate statistical method in a parallel-group randomized controlled trial results in unsubstantiated conclusions”
Dimova and Allison have raised concerns about discrepancies between our published article and the clinical trial registration records. The first relates to the date of the trial registration. Although this study was completed in 2007, our group only started to register our studies in ClinicalTrials.gov in 2014. However, the trial was registered in the database of brazilian studies (CAAE - Certificado de Apresentação para Apreciação Ética - Certificate of Presentation for Ethical Consideration 0161.0.004.000-07) before the work began.
The other discrepancy raised was the absence of two groups in the published study compared with the four groups in the clinicaltrials.gov record. This was a result of suggestions from reviewers.
The reviewer for another journal, to which this article was submitted before submission to the Nutrition Journal, recommended that one of the four groups should be not included in the article. Differences between the subjects in this group and the other three groups were given as the reason for its exclusion. Later, one of the reviewers for the Nutrition Journal requested that another group should be removed because of the different intervention used in this group, compared with the two groups in the published article.
After reading the article by Bland and Altman  and reviewing the statistical analyses, all authors agree that this study does not allow us to conclude that there was decrease in the blood levels of CRP and TNF-alpha that could be attributed to flaxseed. Initially, the reviewer for the Nutrition Journal requested that a between-groups t-test or its non-parametric equivalent should be performed on our data. This was done for all the data described in the Tables, which could be noted in the article as the tests used between-groups mean differences. Unfortunately, only now it was noticed that the same was not done in the data shown in the figures (e.g. CRP, TNF-alpha and isoprostane). However, we would like to emphasize that this error was not observed by the Nutrition Journal, which accepted our article.
We thank Dimova and Allison for commending us for making our data available to the scientific community. Also, we were not “disappointed that their treatment had not produced a significant difference, as well as had not searched for anything that will have the appearance of supporting it”, as pointed out by Bland and Altman .
Finally, by making our data available to the scientific community, we hope that the errors in the statistical analysis will be seen as an honest mistake for which we sincerely apologize.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Casazza K, Allison DB. Stagnation in the clinical, community, and public health domain of obesity: the need for probative research. Clin Obes. 2012;2(3-4):83–5.View ArticlePubMedGoogle Scholar
- Rosen L, Manor O, Engelhard D, Zucker D. In defense of the randomized controlled trial for health promotion research. Am J Public Health. 2006;96(7):1181–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Cassani RS, Fassini PG, Silvah JH, Lima CM, Marchini JS. Impact of weight loss diet associated with flaxseed on inflammatory markers in men with cardiovascular risk factors: a clinical study. Nutr J. 2015;14(1):5. doi:10.1186/1475-2891-14-5.View ArticlePubMedPubMed CentralGoogle Scholar
- ICMJE. Clinical Trial Registration. http://www.icmje.org/recommendations/browse/publishing-and-editorial-issues/clinical-trial-registration.html. Accessed 17 Feb 2015.
- Altman DG, Moher D, Schulz KF. Improving the reporting of randomised trials: the CONSORT Statement and beyond. Stat Med. 2012;31(25):2985–97.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG. Comparisons against baseline within randomized groups are often used and can be highly misleading. Trials. 2011;12(264):1–7.Google Scholar
- Allison DB, Gorman BS, Primavera LH. The most common questions asked of statistical consultants: our favorite responses and recommended readings. Genet Soc Gen Psychol Monogr. 1993;119:153–85.Google Scholar
- McNutt M. Journals unite for reproducibility. Science. 2014;346(6210):679.View ArticlePubMedGoogle Scholar
- Boutron I, Altman DG, Hopewell S, Vera-Badillo F, Tannock I, Ravaud P. Impact of spin in the abstracts of articles reporting results of randomized controlled trials in the field of cancer: The SPIIN Randomized Controlled Trial. J Clin Oncol. 2014;(36):4120-26 . doi:JCO.2014.56.7503.Google Scholar
- Committee on Publication Ethics. Guidelines for retracting articles. http://publicationethics.org/files/retraction%20guidelines.pdf. Accessed 2 Feb 2015.
- Bland JM, Altman DG. Best (but oft forgotten) practices: testing for treatment effects in randomized trials by separate analyses of changes from baseline in each group is a misleading approach. Am J Clin Nutr. 2015;102:991–4.View ArticlePubMedGoogle Scholar