You are on page 1of 4

AUTHORSHIP AND

CONTRIBUTORSHIP

The Hidden Research Paper


Richard Horton, FRCP Context To determine whether the views expressed in a research paper are accu-
rate representations of contributors’ opinions about the research being reported.

W
HAT HAPPENS WHEN SCI- Methods Purposive sampling of 10 research articles published in The Lancet; quali-
entists disagree? Most tative analysis of answers to 6 questions about the meaning of the study put to con-
times, readers of re- tributors who were listed on the byline of these articles. Fifty-four contributors listed
search papers never on the bylines of the 10 articles were evaluated, and answers to questions were com-
know. However, in 1995, a dispute pared between contributors within research groups and against the published re-
among the writing committee of the search report.
Italian Multicentre Acute Stroke Trial Results A total of 36 (67%) of 54 contributors replied to this survey. Important weak-
spilled out onto the pages of The Lan- nesses were often admitted on direct questioning but were not included in the published
cet.1,2 During peer review, it became article. Contributors frequently disagreed about the importance of their findings, impli-
clear that 2 committee members inter- cations, and directions for future research. I could find no effort to study systematically
preted the results of the trial very dif- past evidence relating to the investigators’ own findings in either survey responses or the
published article. Overall, the diversity of contributor opinion was commonly excluded
ferently from their colleagues. They
from the published report. I found that discussion sections were haphazardly organized
had, for the good of the collaboration, and did not deal systematically with important questions about the study.
self-censored their own views. How-
Conclusions A research paper rarely represents the opinions of those scientists whose
ever, this fragile truce broke down once
work it reports. The findings described herein reveal evidence of (self-)censored criti-
an editor asked for signatures confirm- cism, obscured meanings, confused assessment of implications, and failures to indi-
ing each contributor’s assent for the pa- cate directions for future research. There is now empirical support for the introduction
per to be published. Tognoni and Ron- of structured discussion sections in research papers. Editors might also explore ways
caglioni2 described their disagreement to recover the plurality of contributors’ opinions.
as “unfortunate.” JAMA. 2002;287:2775-2778 www.jama.com
Such disagreements have come to
light before. For example, divided in- ence that is most relevant to publica- contact contributors on the article’s by-
terpretations about a polio outbreak in tion because its disclosure can iden- line and to explain the background and
Israel led to separate commentaries in tify who is accountable for what part nature of the study. Once permission
The Lancet. 3 In an even more pro- of the research and allows the reader had been granted, I wrote to all con-
tracted dispute, a competing manu- to assign credit fairly.” tributors and asked 6 questions about
script based on the same study was However, important as contributor- their work (BOX 1). Contributors were
eventually published 4 years after the ship is, this mechanism of disclosure written to twice after that and tele-
original article appeared.4 Harmony does not take account of the ideas or phoned once to obtain replies.
among authors cannot be relied on. interpretations contributed to the re- Once available replies had been col-
A crisis over definitions of author- search being reported. I wanted to know lected, individual answers to ques-
ship during recent years has led sev- whether the views expressed in a re- tions were compared with one an-
eral medical journals to discard rigid search paper are accurate representa- other among contributors for each
rules for who can or who cannot be an tions of contributors’ opinions. research paper. These answers were also
author. Instead, the idea of the con- compared with the contents of the pub-
tributor has emerged.5 In place of the METHODS lished article. Finally, contributor sec-
assumption that scientists named on the I selected 10 articles published in The tions were examined to discover if there
byline of an article are true authors, Lancet during 2000 (TABLE). This study were any identifiable connections
journals such as The Lancet, BMJ, and had a qualitative design: articles were between stated contributions and the
Annals of Internal Medicine now re- selected purposively with varying num- answers to these 6 questions.
quire contributors to state explicitly bers of contributors, across a range of
Author Affiliation: Dr Horton is Editor of The Lan-
what part they played in the research subject areas, and including a spec- cet, London, England.
being reported. This conceptual shift trum of research methods. I wrote to Corresponding Author and Reprints: Richard Hor-
ton, FRCP, The Lancet, 32 Jamestown Rd, London
was best summarized by Rennie et al5: the corresponding author of each re- NW1 78Y, England (e-mail: richard.horton@lancet
“Contribution is the activity of sci- search article to secure permission to .com).

©2002 American Medical Association. All rights reserved. (Reprinted) JAMA, June 5, 2002—Vol 287, No. 21 2775

Downloaded From: https://jamanetwork.com/ on 10/15/2023


THE HIDDEN RESEARCH PAPER

published article, especially the discus-


Table. Articles Selected for Study From The Lancet by Subject, Number of Contributors, and
Design* sion section, did not clearly separate
Subject No. of Contributors Design Page Nos. these interests.
Helicobacter pylori 2 Cross-sectional 358-362 The strengths of the research were
Biliary atresia 3 Case series 25-29 identified as follows: study design (by
Down syndrome 3 Cohort 165-169 6 contributors), an identified mecha-
Myocardial infarction 4 Systematic review 773-778 nism of action (4), the double-blind na-
Suicide 5 Case-control 9-12 ture of the trial (3), daily patient con-
Critical care 5 Mixed 595-598 tact (3), well-matched controls (2),
Schizophrenia 5 Case-control 614-617 cyclicity of symptoms taken into ac-
Diarrheal disease 7 Time series 442-450 count (2), the large treatment response
Bulimia nervosa 11 Randomized trial 792-797 (2), and the interpretation (2). The first
Folate and heart disease 11 Randomized trial 517-522 3 of these strengths were clearly iden-
*Articles are from volume 355 of The Lancet published during 2000. tified as such in the article.
Similar transparency was not found
I do so because it is this study design for weaknesses. In the published re-
Box 1. Questions Asked of that is central to establishing evidence port, highlighted weaknesses were self-
Contributors of 10 Selected for or against interventions in clinical reporting of symptoms and the risk of
Research Papers practice. Supportive or contradictory a higher motivation to succeed among
In your own words, how would you: findings, together with further issues, study participants. However, on di-
1. Summarize the results of your will be explored by describing the re- rect questioning, small sample size
study? maining replies. (7 contributors), short duration of study
2. Define the strengths of your The trial concerned the efficacy of (4), no long-term follow-up (2), and
study? ondansetron, a peripherally active se- poor generalizability (2) were empha-
3. Define the weaknesses of your rotonin antagonist, in patients with an sized. Concerns about the study, freely
study? eating disorder. Forty-three patients stated by the scientists undertaking this
4. Interpret the results of your study
were screened and 26 were random- research, had not been incorporated
in the context of the totality of
available evidence? ized to receive either ondansetron or into the article.
5. Assess the implications of your re- placebo. The primary outcome mea- The views about interpretation in the
sults? sure was a composite of the number of context of the totality of available evi-
6. Plan further research into the bingeing and vomiting episodes per dence matched those found for the
question under investigation? week. For patients receiving ondanse- summary of findings. That is, contribu-
tron, at 4 weeks the mean number of tors ranged between strongly clinical
episodes was 6.5 (SD, 3.9) per week. (“ondansetron is effective in the treat-
For patients receiving placebo, the mean ment of bulimia nervosa”) and more
RESULTS number was 13.2 (SD, 11.6) per week. pathophysiologic conclusions. Again,
All corresponding authors gave per- When asked to summarize the pa- these distinctions, although clear from
mission for me to contact their co- per, contributors seemed to reply ac- individual replies, were not made in the
contributors. However, one correspond- cording to their underlying interest in published article, where clinical and
ing author, although agreeing that “the the research question. At the ex- mechanistic issues were mixed to-
results of your project will shed light on tremes, for example, one contributor gether in the discussion section.
whether the true strength of a collabo- took a purely clinical view: “Our study The implications of the study find-
rative research group is being fully found that ondansetron significantly re- ings were also poorly expressed in the
achieved,” declined permission for me duced binge eating and vomiting com- published report of the trial, according
to contact more junior members of her pared to pill placebo in women with se- to the responses of individual authors.
research team. She wrote, “ I would ask vere bulimia nervosa.” Another took a The main implications concerned vagal
that you refrain from contacting three more pathophysiologic perspective: nerve research (3 contributors), vagal in-
of the authors on our paper . . . since “Blocking vagal neurotransmission pri- fluences over psychiatric symptoms (3),
they are still under my supervision.” In marily at the gastric level by ondanse- ondansetron as a treatment for bulimia
all, 36 (67%) of 54 contributors con- tron produces a statistically signifi- (2), the therapeutic value of vagal block-
tacted replied to the survey. cant reduction in bulimia symptoms ade (2), and the need for a broader vi-
In reporting these results, I will take in a group of severely ill bulimic pa- sion for research into bulimia (2). Only
one study and describe the responses tients.” Different summaries suggest dif- ondansetron as a treatment was high-
of the contributors in detail. The study ferent interests and perhaps different lighted in the final article. Indeed, ac-
I will focus on is a randomized trial, and motivations for doing this work. The cording to a senior author, “the most im-
2776 JAMA, June 5, 2002—Vol 287, No. 21 (Reprinted) ©2002 American Medical Association. All rights reserved.

Downloaded From: https://jamanetwork.com/ on 10/15/2023


THE HIDDEN RESEARCH PAPER

portant implication” of the trial was that Confusion was also common when sociations could be made, mostly be-
the results “would help remove the nega- implications of new research were being cause contributor statements lacked
tive social connotations associated with considered. For example, in the folate sufficient descriptive detail.
this disorder.” Nowhere was this impli- and vitamin B6 randomized trial, one con- In reviewing these 10 published ar-
cation mentioned in the article pub- tributor concluded that the results “pro- ticles, the most frustrating aspect of
lished in The Lancet. vide some justification” for treating pa- comparing survey responses with pub-
Finally, in considering lines for tients at high risk of atherothrombotic lished reports was the chaotic nature of
future research, several possibilities disease with folic acid. Another contribu- discussion sections. There was no clear
were identified: the physiology of tor simply considered that this “first or consistent approach by contribu-
bulimia (4 contributors), comparisons (little) piece of evidence . . . should be tors to the discussion of their results.
of ondansetron with other treatments seen as an encouragement for other tri- Limitations were frequently omitted,
(3), the inclusion of patients with less als” only. In a systematic review of stress clinical interpretations were often
severe conditions in subsequent trials hyperglycemia and risk of death after mixed with mechanistic reflections, and
(2), and a longer study duration (2). myocardial infarction, one contributor repetition of key results was common.
None of these ideas was discussed in drew a diagnostic conclusion: “a simple,
the published article. early available, and cheap plasma glu- COMMENT
Many of these omissions and pat- cose identifies patients at a high risk for The results of this qualitative study
terns of reporting were found in the in hospital complications and death.” An- show that a research paper rarely rep-
other articles studied (data not shown). other drew a treatment lesson: “clini- resents the full range of opinions of
However, there were exceptions. For ex- cians should recognise hyperglycaemia those scientists whose work it claims
ample, in a randomized trial of folic acid as an important prognostic marker and to report. I have found evidence of cen-
plus vitamin B6 to lower plasma homo- take an aggressive therapeutic ap- sored criticism; obscured views about
cysteine concentrations and perhaps to proach for patients who have elevated the meaning of research findings; in-
ameliorate atherosclerosis, the weak- blood glucose readings at the time of complete, confused, and sometimes bi-
nesses cited in the survey responses [myocardial infarction].” The article it- ased assessment of the implications of
(small size, use of surrogate measures, self does not explore this range of opin- a study; and frequent failure to indi-
and short study duration) were all dis- ion and takes a more conservative line, cate directions for future research. Some
cussed in the published article. The strik- emphasizing glucose as a risk factor only. papers have more complete evalua-
ing fact, therefore, was the inconsis- Contributors differed in their views tions of findings than others. What was
tency across this sample of articles. For about future research. In response to a striking was the inconsistency in pub-
instance, in a study of how El Niño af- direct question, contributors to a pa- lished evaluations, especially regard-
fects diarrheal diseases in Peruvian chil- per on risk factors for suicide sug- ing weaknesses. The strengths of this
dren, several contributors pointed out gested looking at sex, family history of work are its qualitative design, which
that only one El Niño event had been illness, age, socioeconomic back- produced a rich data set, and a purpo-
studied. This weakness was not dis- ground, psychiatric diagnosis, and life sive sampling technique that con-
cussed in the article. Similarly, al- events. Yet none of these ideas was dis- firmed the findings across a range of
though wide confidence intervals were cussed in the published report. In a subject areas and study designs.
cited as a weakness in a study of cancer study of Helicobacter pylori transmis- This work also had several limita-
in individuals with Down syndrome, this sion among siblings, readers were given tions. First, these data are preliminary.
weakness was not highlighted in the ar- no direction about where future re- The sample of articles was small and
ticle published in The Lancet. search might be directed. However, in came from one journal only. The risk of
The question that yielded the most their survey responses, the contribu- bias is substantial. Second, since I sur-
uniformly disappointing response con- tors suggested work on the natural his- veyed contributors after publication of
cerned interpretation in the context of tory of the infection, factors (espe- their studies, I could not rule out the pos-
the totality of available evidence. In nei- cially those in the family) that influence sibility that contributors discussed their
ther the survey responses nor the pub- the dynamics of infection, and a focus responses with one another before re-
lished articles were any efforts made to on early childhood years. plying. Third, variance of opinion was
describe systematically evidence that re- The Lancet introduced contributors’ determined by one person (R.H.) using
lated to the investigators’ own find- descriptions of the parts each person a nonvalidated survey instrument. Mul-
ings. Anecdotal reporting of other work played in the research being reported tiple independent assessments, perhaps
was the norm in both settings. The con- in 1997. These descriptions are writ- adopting a quantitative scale, could im-
sistent failure by scientists to provide ten by the authors themselves. In this prove the validity of these findings.
a more rigorous overview of past evi- study, I relied on these self-reports to A scientific research paper is an ex-
dence when considering their own find- find links between stated contribu- ercise in rhetoric7; that is, the paper is
ings has been pointed out before.6 tions and survey responses. No such as- designed to persuade or at least con-
©2002 American Medical Association. All rights reserved. (Reprinted) JAMA, June 5, 2002—Vol 287, No. 21 2777

Downloaded From: https://jamanetwork.com/ on 10/15/2023


THE HIDDEN RESEARCH PAPER

tors themselves. Who determines what sidered in a structured discussion is


Box 2. Proposal for is written and why? Despite the intro- shown in BOX 2.
Structured Discussion in duction of contributors’ sections to re- Future research might aim to con-
Medical Research Papers search reports,8 this question remains firm these findings in a larger sample
Summary of Key Findings unanswered. from multiple clinical and nonclinical
The gaps identified in published re- journals, perhaps using a quantitative
Primary outcome measure(s)
Secondary outcome measure(s) search reports reveal not only the range scale or a more formal method of lin-
Results as they relate to a prior of opinions among contributors, but guistic analysis.13 More interestingly, an
hypothesis also the weaknesses of editorial proce- ethnographic approach14 would re-
dures. In particular, the omission of veal how papers are actively put to-
Strengths and Limitations
of the Study limitations from the discussion sec- gether. Whatever route this work takes,
tions must be judged a potential fail- current definitions of “contributor”
Study question
Study design ure of journal peer review. should be widened to include the ap-
Data collection What more could editors do to praisal and interpretation of research—
Analysis recover the plurality of contributors’ what is thought as well as what is done.
Interpretation opinions? The discussion section is a Author Contributions: Richard Horton conceived, de-
Interpretation and Implications neglected part of the research paper.9 signed, conducted, analyzed, and wrote the current
The 6 questions I asked in the survey study.
in the Context of the Totality Acknowledgment: I thank Mrs Hopelyn Goodwin,
of Evidence described herein seem to set a mini- who provided secretarial assistance.
Is there a systematic review mum standard for addressing the cen-
tral scientific issues concerning the va- REFERENCES
to refer to?
If not, could one be reasonably done lidity and meaning of a piece of 1. MAST-I Group. Randomised controlled trial of strep-
tokinase, aspirin, and combination of both in treat-
here and now? research. A first step would be to en- ment of acute ischaemic stroke. Lancet. 1995;346:
What this study adds to the sure that all 6 questions are answered 1509-1514.
available evidence 2. Tognoni G, Roncaglioni MC. Dissent: an alterna-
explicitly in the discussion section, with tive interpretation of MAST-I. Lancet. 1995;346:
Effects on patient care and health the full range of contributors’ opin- 1515.
policy ions being offered. To enable authors 3. Slater PE, Orenstein NA, Morag A, et al. Poliomy-
Possible mechanisms elitis outbreak in Israel in 1988: a report with two com-
to answer these questions satisfacto- mentaries. Lancet. 1990;335:1192-1198.
Controversies Raised rily, longer papers may be necessary. 4. Rennie D. The Cantekin affair. JAMA. 1991;266:
by This Study 3333-3337.
Editors should probably relax their 5. Rennie D, Yank V, Emanuel L. When authorship
Future Research Directions usual word limits for research papers fails: a proposal to make contributors accountable.
JAMA. 1997;278:579-585.
For this particular research or consider publishing expanded ver- 6. Clarke M, Chalmers I. Discussion sections in re-
collaboration sions of the paper on the Web. In- ports of controlled trials published in general medical
Underlying mechanisms deed, one could go further. These data journals. JAMA. 1998;280:280-282.
7. Horton R. The rhetoric of research. BMJ. 1995;
Clinical research provide empirical support for struc- 310:985-987.
tured discussions. I have raised this pos- 8. Yank V, Rennie D. Disclosure of researcher con-
tributions: a study of original research articles in The
sibility before,7 and other editors have Lancet. Ann Intern Med. 1999;130:661-670.
substantially supported and devel- 9. Horton R. The unstable medical research paper.
J Clin Epidemiol. 1997;50:981-986.
vey to the reader a particular point of oped this proposal.10,11 The results re- 10. Docherty M, Smith R. The case for structuring the
view. When one probes beneath the ported herein indicate that more care- discussion of scientific papers. BMJ. 1999;318:1224-
1225.
surface of the published report, one will ful organization of the discussion 11. Altman DG, Schulz KF, Moher D, et al. The re-
find a hidden research paper that re- section of a research paper might pro- vised CONSORT statement for reporting ran-
veals the true diversity of opinion vide the framework for not only a fairer domised trials: explanation and elaboration. Ann In-
tern Med. 2001;134:663-694.
among contributors about the mean- and more accurate representation of 12. Purcell GP, Donovan SL, Davidoff F. Changes to
ing of their research findings. For contributors’ views, but also a more manuscripts during the editorial process. JAMA. 1998;
280:227-228.
both readers and editors, the views complete analysis of the data being pre- 13. Kontos J, Malagardi I. Information and knowl-
expressed in a research paper are gov- sented. The importance of omissions in edge extraction from medical texts. Stud Health Tech-
nol Inform. 2000;57:260-269.
erned by forces that are clear to no- research reports has been described.12 14. Latour B. Science in Action. Boston, Mass: Har-
body, perhaps not even to the contribu- A proposal for the elements to be con- vard University Press; 1987.

2778 JAMA, June 5, 2002—Vol 287, No. 21 (Reprinted) ©2002 American Medical Association. All rights reserved.

Downloaded From: https://jamanetwork.com/ on 10/15/2023

You might also like