Marušić A, Fatović-Ferenčić S (2012). Adoption of the double dummy trial design to reduce observer bias in testing treatments.
Email to someoneTweet about this on TwitterShare on FacebookPin on PinterestShare on LinkedIn

© Ana Marušić, Department of Research in Biomedicine and Health, University of Split School of Medicine, Split, Croatia (; Stella Fatović Ferenčić, Department for History and Philosophy Sciences, Division for the History of Medicine, Croatian Academy of Sciences and Arts, Zagreb, Croatia (

Cite as: Marušić A, Fatović-Ferenčić S (2012). Adoption of the double dummy trial design to reduce observer bias in testing treatments. JLL Bulletin: Commentaries on the history of treatment evaluation (


Although the use of placebo controls and blind assessment to decrease observer bias in clinical trials was introduced at the end of the nineteenth century, it was not until the second half of the 20th century, coincident with the rapid proliferation of pharmaceutical drug trials, that placebo controls became more widely used (Kaptchuk 2011). In comparisons of drugs that are administered by different routes, however, the preparation of the placebo interventions (dummy treatments) becomes more complicated: to control for both delivery methods, the trial needs to have adequate control groups for both treatments – an approach referred to as the ‘double dummy’ trial design.

Bibliographic recognition of the double dummy trial design in the 1970s

Our interest in the double dummy trial design was prompted by a report published in 1975 by a group of rheumatological researchers in Zagreb, Croatia, led by Dr Theodor Dürrigl (Dürrigl et al. 1975). The report by Dürrigl and his colleagues is the earliest yielded by a search of the PubMed database using the search strategy “double dummy”[All Fields] AND (“1970/01/01″[PDAT] : “1979/12/31″[PDAT]. Up to that time, corticosteroids had been used in treating rheumatoid arthritis because of their anti-inflammatory and immunosuppressive effects (Kirwan et al. 1999). As steroids had serious adverse effects, non-steroid anti-inflammatory drugs (NSAID) were introduced to treat chronic rheumatoid arthritis to reduce the need for corticosteroid therapy. In the 1970s, research focus was on the synthesis of NSAIDs with high activity and high tolerability (Sallmann 1986). Diclofenac sodium (Voltaren) was released by Ciba-Geigy in 1973 (Ciba-Geigy. Company history. as a non-steroid, non-pyrazole compound with few side effects and significant anti-inflammatory and analgesic activities in animal models (Krupp et al. 1973) and clinical studies (Ono et al. 1972).

Professor Dürrigl and his team at the Institute for Rehabilitation of Rheumatic Patients of the School of Medicine in Zagreb, Croatia (then Yugoslavia), were among the first to test the efficacy and tolerability of diclofenac sodium against other NSAIDs (indomethacin in their study). The challenge presented for the design of this trial was that the two drugs looked different: diclofenac sodium was provided as enteric-coated tablets, whereas indomethacin was available as capsules. To address this problem, the researchers randomized the patients into three groups receiving either (i) diclofenac sodium and indomethacin placebo, or (ii) indomethacin and diclofenac sodium placebo, or (iii) both dummy preparations. They described the trial design as follows:

The trial design was a between-patient comparison of a 14-day treatment with either diclofenac sodium (25 mg t.i.d.), indomethacin (25 mg t.i.d.) or placebo. In order to preserve the double-blindness of the trial a double-dummy technique was used owing to the different appearance of the enteric-coated tablets of diclofenac sodium and indomethacin capsules. Each patient was allocated blindly to one of the three treatment groups. After each week of therapy the number of tablets and capsules of trial medication was recorded.

Out of 50 patients included in the trial, 48 completed the treatment. After 7 and 14 days of treatment, diclofenac sodium was judged superior to both indomethacin and placebo in alleviating symptoms (duration of morning stiffness, joint tenderness and severity of rheumatoid condition assessed by patients or physicians). Both active treatments were well tolerated (Dürrigl et al. 1975).

It is interesting that the investigators planned for possible confounding from the effects of the ‘rescue’ medications that patients would take, as needed, if the relief provided by the trial preparations proved inadequate. To address this potential source of bias, they provided the patients with rescue analgesics and counted (at two time points) the number of tablets they had used. There was no statistically significant difference in the proportion of patients in the trial comparison groups who had used rescue medications, although, on average, patients on placebo took half a ‘rescue’ tablet more than the two treatment groups.

Later in the year that the study by Dürrigl and his colleagues (1975) was published, the use of the double dummy trial design was reported to reduce observer bias in a comparison of two routes of administering diazepam as a premedication prior to anaesthesia (Assaf et al. 1975). A further nine articles published in the 1970s reported use of the double dummy trial design (Harding and Heath 1976; Beaver and Feise 1977; Gam et al. 1977; Murphy 1977; Wettrell 1977; Donald and Molla 1978; Rhind et al. 1978; Blohmé and Waldenström 1979; Mirakhur et al. 1979).

The introduction of the double dummy trial design in the 1960s

Careful reading of these 1970s studies, however, revealed that the Croatian study was not the earliest to report the use of the double dummy trial design. The earliest reports that we have identified so far were two published in 1964 and 1965. Percy and colleagues (1964), at the Royal Victoria Infirmary in Newcastle-upon-Tyne, UK, compared indomethacin and phenylbutazone in the treatment of a variety of rheumatic diseases. Their description of a double dummy design is clear:

As it was not possible to produce identical tablets of both drugs, an inert tablet corresponding to each active compound was made and the four resultant tablet types (active Indomethacin; dummy Indomethacin; active phenylbutazone; dummy phenylbutazone) were so dispensed that a week’s supply of each drug was given together with dummy tablets identical with the other drug.

The following year, Dudley Hart and Boardman (1965), at the Westminster Hospital in London, UK, compared indomethacin and phenylbutazone in the treatment of rheumatoid arthritis. Their article also clearly describes a double dummy design:

To provide double-blind conditions they received active indomethacin and dummy phenylbutazone in one month, and in the other active phenylbutazone and dummy indomethacin.

In 1969, the report of a comparison of indomethacin and phenylbutazone by Wright et al. also carries a clear explanation of a double dummy trial design (Wright et al. 1969):

The trial was conducted in three periods of 4 weeks, during each of which the patient received either red phenylbutazone tablets (100 mg. each) and yellow placebo capsules, or yellow indomethacin capsules (50 mg. each) and red placebo tablets, or yellow placebo capsules and red placebo tablets.

Although they did not use the term ‘double dummy’, Figure 1 [not shown in this article] in a paper published by the same authors two years later (McGuire and Wright 1971) makes clear that the two preparations compared (indomethacin as a capsule and phenybutazone as a tablet) were given using the double dummy trial design.

The National Library of Medicine did not use the term ‘double dummy’ in indexing any of the articles published before the 1970s. Although some of them used the term ‘dummy’, search using ‘dummy’ as a keyword did not identify any of the trials that we found by following leads from reports published in the 1970s. It thus seems that double dummy method originated in rheumatological research in the 1960s, but its introduction may have been even be even earlier.

Some reflections by Professor Dürrigl

We contacted Professor Dürrigl to ask him how he came to use the double dummy trial design. He recalled that he had been frustrated by the fact that, although there were many preparations for use in rheumatic diseases, none was ideal and it was difficult to know how to chose among them. He remembered that it was help from the pharmaceutical industry’s statistics experts that led him and other clinicians to implement the double dummy trial design. Although the statisticians were confident that their company’s products had advantages, they were eager to test them using rigorous study design. Professor Dürrigl remembered the collaboration being successful and productive.

Theodor Dürrigl’s research was driven by his wish to reduce the significant burden and poor quality of life of the large number of people with rheumatological disease. In his recently published memoirs he wrote:

I want and I hope that the terms ‘rheuma’ and ‘rheumatism’, and also ‘rheumatology’, will one day become obsolete because the cause (or causes) of all those diseases we have called ‘rheumatic’ for centuries will finally be discovered! Then the doctors who treat these diseases will give them proper names and, what is more important, find possibilities for their certain cure (Dürrigl 2006).

Our understanding of the aetiology of rheumatic diseases remains far from complete, despite the many biological drugs currently in use. However, the work of enthusiastic and conscientious rheumatological researchers in the sixties and seventies has left us with an important methodological legacy for tackling observer bias, which remains an important cause of biased estimates of treatment effects (Savović et al. 2012).

This James Lind Library commentary has been republished in the Journal of the Royal Society of Medicine 2013;106:196-198. Print PDF


Assaf RA, Dundee JW, Gamble JA (1975). The influence of the route of administration on the clinical action of diazepam. Anesthesia 30:152-8.

Beaver WT, Feise GA (1977). A comparison of the analgesic effect of oxymorphone by rectal suppository and intramuscular injection in patients with postoperative pain. J Clin Pharmacol 17:276-91.

Blohmé G, Waldenström J (1979). Glibenclamide and glipizide in maturity onset diabetes. A double-blind cross-over study. Acta Med Scand 206:263-7.

Donald JF, Molla AL (1978). A controlled study comparing a three times daily dose of chlordiazepoxide with a single night-time dose of Trancopal in the control of anxiety, using a double-blind, double-dummy technique. J Int Med Res 6:105-10.

Dudley Hart F, Boardman PL (1965). Indomethacin and phenylbutazone: a comparison. BMJ 2:1281-1284.

Dürrigl T (2006). Memories, biography and bibliography [in Croatian]. Zagreb : Croatian Rheumatological Society.

Dürrigl T, Vitaus M, Pucar I, Miko M (1975). Diclofenac sodium (Voltaren): Results of a multi-centre comparative trial in adult-onset rheumatoid arthritis. J Int Med Res 3:139-44.

Gam F, Helles A, Andersen B (1977). Hormonal treatment of climacteric complaints. A controlled, double-blind comparative study between 2 different types of sequential estrogen-gestagen preparations [in Danish]. Ugeskr Laeger 139:2808-12.

Harding SM, Heath S (1976). Intranasal steroid aerosol in perennial rhinitis: comparison with an antihistamine compound. Clin Allergy 6:369-72.

Hart FD, Boardman PL (1965) Indomethacin and phenylbutazone: a comparison. Br Med J 2:1281-4.

Kaptchuk TJ (2011). A brief history of the evolution of methods to control of observer biases in tests of treatments. JLL Bulletin: Commentaries on the history of treatment evaluation. Available at: (

Kirwan JR, Bálint G, Szebenyi B (1999). Anniversary: 50 years of glucocorticoid treatment in rheumatoid arthritis. Rheumatology 38:100-2.

Krupp PJ, Menassé-Gdynia R, Sallmann A, Wilhelmi G, Ziel R, Jaques R (1973). Sodium (o-((2,6-dichlorophenyl)-amino)-phenyl)-acetate (GP 45 840), a new non-steroidal anti-inflammatory agent. Experientia 29:450-2.

McGuire RJ, Wright V (1971) Statistical approach to indices of disease activity in rheumatoid arthritis. Ann Rheum Dis 30:574-80.

Mirakhur RK, Dundee JW, Connolly JD (1979).Studies of drugs given before anaesthesia. XVII: anticholinergic premedicants. Br J Anaesth 51:339-45.

Murphy JE (1977). A comparative trial of Anafranil, Pertofran and an Anafranil/Pertofran combination. J Int Med Res 5(1 Suppl):16-23.

Ono K, Okuda K, Kimura S (1972). Clinical evaluation of a non-steroidal analgesic and anti-inflammatory agent: GP 45,840 for the pain after tooth extraction. Shikai Tenbo (Prosp Dental Field) 39:941.

Percy JS, Stephenson P, Thompson M (1964) Indomethacin in the treatment of rheumatic diseases. Ann Rheum Dis 23:226-31.

Rhind VM, Leatham PA, Wright V, Akbar FA, Downie WW (1978). A comparative study of floctafenine (Idarac) and ibuprofen in the treatment of osteoarthrosis. J Int Med Res 6:11-3.

Sallmann AR (1986). History of diclofenac. Am J Med 80:29-33.

Savović J, Jones HE, Altman DG, Harris RJ, Jüni P, Pildal J, Als-Nielsen B, Balk EM, Gluud C, Gluud LL, Ioannidis JPA, Schulz KF, Beynon R, Welton NJ, Wood L, Moher D, Deeks JJ, Sterne JAC (2012).  Influence of reported study design characteristics on intervention effect estimates from randomized controlled trials. Ann Intern Med 157:429-38.

Wettrell K (1977). Intraocular pressure reduction during treatment with pilocarpine and systemic propranolol. A comparative double-masked study. Albrecht Von Graefes Arch Klin Exp Ophthalmol 204:1-7.

Wright V, Walker WC, McGuire RJ (1969). Indomethacin in the treatment of rheumatoid arthritis. A controlled trial comparing indomethacin, phenylbutazone, and placebo. Ann Rheum Dis 28:157-62.