How effective is cbt for depression

The study found when CBT was given, in addition to usual care that included antidepressants, it was effective in reducing depressive symptoms and improving quality of life over the long term – on average 46 months – for patients whose depression had not responded to medication. These benefits were found, on average, 40 months after the end of therapy.

Over the course of 46 months, 43 per cent of those who had received CBT had improved, reporting at least a 50 per cent reduction in symptoms of depression, compared with 27 per cent who continued with their usual care alone. Importantly, the study also found that this type of ‘high intensity’ CBT was a cost effective treatment from the perspective of the health service.

A psychological treatment that teaches people skills for life needs to show enduring effects. It is really important therefore that this study shows that for people with largely longstanding, complex depression, CBT leads to enduring effects over four years compared with usual care in the NHS. This is a key finding with one of the longest follow-ups of a large scale trial to date.

– Dr Willem Kuyken, Director of the Oxford Mindfulness Centre and Professor of Clinical Psychology at the University of Oxford’s Department of Psychiatry

The research was conducted by academics at the University of Bristol, along with colleagues from the Universities of Exeter, Oxford, Glasgow, and University College London.

The paper, ‘Long-term effectiveness and cost-effectiveness of cognitive behavioural therapy as an adjunct to pharmacotherapy for treatment resistant depression in primary care: follow-up of the CoBalT trial’ was published in The Lancet Psychiatry.

The research was funded by the National Institute for Health Research Health Technology Assessment (NIHR HTA).

Read the full paper published in The Lancet Psychiatry here.

Key Points

Question Which cognitive behavior therapy delivery format is most effective and acceptable for the treatment of acute depression?

Findings In this network meta-analysis of 155 trials involving 15 191 patients, no statistically significant differences in effectiveness were found among individual, group, telephone, and guided self-help treatment formats, although acceptability may be somewhat lower for guided self-help format. Unguided self-help therapy was not more effective than care as usual.

Meaning For acute symptoms of depression, group, telephone-administered, and guided self-help (internet-based or not) cognitive behavior therapy appeared to be effective and may be considered as alternatives to individual therapy.

Abstract

Importance Cognitive behavior therapy (CBT) has been shown to be effective in the treatment of acute depression. However, whether CBT can be effectively delivered in individual, group, telephone-administered, guided self-help, and unguided self-help formats remains unclear.

Objective To examine the most effective delivery format for CBT via a network meta-analysis.

Data Sources A database updated yearly from PubMed, PsycINFO, Embase, and the Cochrane Library. Literature search dates encompassed January 1, 1966, to January 1, 2018.

Study Selection Randomized clinical trials of CBT for adult depression. The 5 treatment formats were compared with each other and the control conditions (waiting list, care as usual, and pill placebo).

Data Extraction and Synthesis PRISMA guidelines were used when extracting data and assessing data quality. Data were pooled using a random-effects model. Pairwise and network meta-analyses were conducted.

Main Outcomes and Measures Severity of depression and acceptability of the treatment formats.

Results A total of 155 trials with 15 191 participants compared 5 CBT delivery formats with 2 control conditions. In half of the studies (78 [50.3%]), patients met the criteria for a depressive disorder; in the other half (77 [49.7%]), participants scored above the cutoff point on a self-report measure. The effectiveness of individual, group, telephone, and guided self-help CBT did not differ statistically significantly from each other. These formats were statistically significantly more effective than the waiting list (standardized mean differences [SMDs], 0.87-1.02) and care as usual (SMDs, 0.47-0.72) control conditions as well as the unguided self-help CBT (SMDs, 0.34-0.59). In terms of acceptability (dropout for any reason), individual (relative risk [RR] = 1.44; 95% CI, 1.09-1.89) and group (RR = 1.38; 95% CI, 1.06-1.80) CBT were significantly better than guided self-help. Guided self-help was also less acceptable than being on a waiting list (RR = 0.63; 95% CI, 0.52-0.75) and care as usual (RR = 0.72; 95% CI, 0.57-0.90). Sensitivity analyses supported the overall findings.

Conclusions and Relevance For acute symptoms of depression, group, telephone, and guided self-help treatment formats appeared to be effective interventions, which may be considered as alternatives to individual CBT; although there were few indications of significant differences in efficacy between treatments with human support, guided self-help CBT may be less acceptable for patients than individual, group, or telephone formats.

Introduction

Randomized clinical trials have shown that cognitive behavior therapy (CBT) is effective in treating depression1,2 and at least as effective as any other type of psychotherapy.1,3 Although CBT was developed as an individual therapy,4 it is also available as a group, telephone-administered, and self-help therapy in which patients work through a standardized protocol independently. The protocol can be in book format or available on the internet. Self-help therapy can either be guided (ie, involving a professional therapist) or unguided (ie, providing no professional guidance to the patient using the materials).

Many meta-analyses have examined these CBT delivery formats.5-14 Overall, the effects of individual, group, telephone-administered, and guided self-help (internet based or not) formats seem to be comparable in magnitude to the control conditions, whereas the unguided self-help format seems to make a small but substantial difference and to be less effective than the individual, group, telephone, and guided self-help formats.

However, conventional pairwise meta-analyses can only compare 2 treatment formats at a time, and only through network meta-analysis can we examine all formats simultaneously in a single analysis. Because network meta-analysis can combine direct and indirect evidence, it can assess the relative effectiveness of the different CBT delivery formats. To date, no previous network meta-analysis has examined these treatment formats.

Methods

Identification and Selection of Studies

We used a database of randomized clinical trials examining the psychological treatment of depression. The database is continuously updated and was developed through a comprehensive literature search from January 1, 1966, to January 1, 2018 by two of us (P.C. and E.K.), of PubMed, PsycINFO, Embase, and the Cochrane Library. The search used a combination of index and text words indicative of depression and psychotherapies and set filters for randomized clinical trials (eAppendix A in the Supplement shows the full search string for PubMed). All records were screened by 2 independent researchers (P.C. and E.K.), and all studies that could possibly meet the inclusion criteria according to one of the researchers were retrieved as full text. The decision to include or exclude a study was also made by 2 independent researchers (P.C. and E.K.). Disagreements were solved through discussion. The network meta-analysis is registered in the PROSPERO as CRD42017064442.

We defined CBT as a therapy in which cognitive restructuring was one of the core components.1-3 Cognitive restructuring is aimed at evaluating, challenging, and modifying a patient’s dysfunctional beliefs.

We included studies in which CBT was examined in the following formats: individual, group, telephone, guided self-help (administered through the internet or other media, such as a book), and unguided self-help (internet-based or not). For the control groups, we included waiting list, care as usual, and pill placebo. The group who received care as usual while waiting was classified under the waiting list control condition. Studies were included if they compared one treatment format with another format or with one of the control conditions. Studies in which the means and SDs (or SEs or 95% CIs of means) were not reported were excluded.

Depression could be established with a diagnostic interview or with a score above a cutoff point on a self-report measure. Comorbid mental or somatic disorders were included. Studies that included patients with anxiety also were excluded, as were studies on patients with depression and comorbid substance use disorders, studies on inpatients, and studies involving adolescents or children. We also excluded maintenance studies aimed at people who had already (partly) recovered after an earlier treatment, studies in which mixed treatment formats were used (eg, both individual and group therapy), and studies on third-wave therapies. We did not set a maximum or minimum on the length of treatment.1

Risk of Bias and Data Extraction

Two independent researchers (P.C. and E.K.) assessed the validity of included studies using 4 criteria of the risk-of-bias assessment tool from the Cochrane Collaboration15: adequate generation of allocation sequence; concealment of allocation to conditions; prevention of knowledge of the allocated intervention (masking of assessors); and dealing with incomplete outcome data, which was assessed as positive when intention-to-treat analyses (ie, meaning all randomized patients were included) were conducted. Disagreements were solved through discussion.

In addition, we coded participant characteristics (ie, depressive disorder or high score on a self-rating scale, recruitment method, and target group), the number of treatment sessions, and the country in which the study was conducted.

Outcomes

We selected 1 outcome measure for each study that indicated the severity of depression using an algorithm: Hamilton Rating Scale for Depression16; Beck Depression Inventory I or II17,18; another clinician-rated instrument; another self-report instrument, with priority for the Patient Health Questionnaire 919; the Center for Epidemiologic Studies—Depression Scale20; and the Hospital Anxiety and Depression Scale—Depression.21 Acceptability of the treatment formats was operationalized as study dropout for any reason during the acute-phase treatment.22

Statistical Analysis

We conducted a series of pairwise meta-analyses for all direct comparisons using a random-effects pooling model. As an assessment of the homogeneity of effect sizes, we calculated the I2 statistic, which is an indicator of heterogeneity in percentages, as well as τ2. We calculated 95% CIs around the I2 statistic using the noncentral χ2–based approach within the Heterogi module for Stata (StataCorp).23,24 We tested for publication bias using Egger test of the intercept to quantify the bias captured by the funnel plot and to test whether it was statistically significant.

The comparative effectiveness was evaluated using the network meta-analysis methodology of combining direct and indirect evidence for all relative treatment effects. First, we summarized the geometry of the network of evidence using network plots.25 Second, we conducted a network meta-analysis of the comparative efficacy or acceptability using the contrast-based network meta-analysis methods.26 Given the expected clinical and methodological heterogeneity of treatment effects among the studies, we adopted the random-effects model.27 Comparative standardized mean differences (SMDs) and relative risks (RR) were reported with their 95% CIs and 95% prediction intervals (PrI). The PrI indicates the range in which the true effect size of 95% of all populations will fall. The ranking of treatment formats was estimated according to the surface under the cumulative ranking curve, which is based on the estimated random-effects models.25

In examining the transitivity assumption, we created a table of important trial and patient characteristics to verify if potential effect modifiers were similarly distributed across the comparisons in the network. We checked the consistency of the network using local and global inconsistency tests. The local inconsistency test evaluates the loop inconsistency of all the triangle loops on the network.28 The global inconsistency is a goodness-of-fit test using the design-by-treatment interaction model of Higgins et al.29 If any relevant sources of bias were found, we performed sensitivity analyses of how these factors were associated with the overall results.

Furthermore, we conducted a multivariate meta-regression analysis of the possible sources of heterogeneity with the same variables that were used to examine the transitivity assumption. We also examined the results at follow-up and focused on 3 to 12 months of follow-up (only a few studies examined longer follow-up periods). If a study reported several outcomes between 3 and 12 months, we selected the effect size at the latest follow-up period.

We conducted a series of 3 sensitivity analyses: one in which we included only the studies with a low risk of bias, one in which we excluded outliers (the 95% CI around the effect size did not overlap with the pooled effect size), and one in which we included only internet-based guided and unguided self-help formats (and excluded other types of self-help CBT).

We assessed the certainty of evidence in network estimates of the main outcome using the Grading of Recommendations Assessment, Development, and Evaluation framework.30

Most analyses were conducted in Stata/SE, version 14.2 for Mac (StataCorp). The meta-regression analyses of small sample bias were conducted in OpenBUGS, version 3.2.3 (OpenBUGS Foundation), and the Grading of Recommendations Assessment, Development, and Evaluation ratings were conducted in CINeMA.31

Results

Selection, Inclusion, and Characteristics of Studies

After examining a total of 19 982 abstracts (15 598 after removal of duplicates), we retrieved 2343 full-text papers for further consideration and excluded 2181 papers. The PRISMA flowchart describing the inclusion process, including the reasons for exclusion, is presented in eAppendix B in the Supplement. In total, 155 studies with 15 191 patients met the inclusion criteria. Two studies had 2 CBT interventions that met the inclusion criteria32,33 and underwent separate comparisons. Selected characteristics of the included studies are given in eAppendix C and the references in eAppendix D in the Supplement.

The 155 studies included 57 individual, 45 group, 46 guided self-help, 10 telephone, and 21 unguided self-help arms as well as 71 waiting list, 71 care as usual, and 2 pill placebo arms. In half of the studies (78 [50.3%]), patients met the criteria for a depressive disorder; in the other half (77 [49.7%]), participants scored above the cutoff point on a self-report measure. Patients were partly recruited from the community in 77 studies (49.7%), exclusively from clinical samples in 33 studies (21.3%), and through other methods in 45 (29.0%). Seventy-eight studies (50.3%) were aimed at unselected adults, 31 (20.0%) at patients with comorbid general medical disorders, 13 (8.4%) at older adults, 11 (7.1%) at women with postpartum depression, and 22 (14.2%) at other specific target groups. Most studies (133 [85.8%]) were conducted in Western countries.

With regard to risk of bias, 92 studies (59.4%) reported an adequate sequence generation, 85 (54.8%) reported allocation to conditions by an independent or third party, 37 (23.9%) used blinded outcome assessors, and 107 (69.0%) used only self-report outcomes. In 102 studies (65.8%), intent-to-treat analyses were conducted. Sixty-one studies (39.4%) met all quality criteria, 58 (37.4%) met 2 or 3 of the criteria, and the remaining 36 (23.2%) met no or only 1 criterion.

Network Plot

The network is shown in Figure 1. Table 1 shows the number of studies for each comparison. In the network plot, the nodes and edges are weighted according to the number of available treatment formats and comparisons. Overall, the network was well connected. The most examined comparisons were between individual, group, and guided self-help formats as well as the waiting list and care as usual control conditions. Relatively few direct comparisons between individual, group, and guided self-help CBT were available. Telephone CBT was not strongly attached to the network, with a relatively small number of trials comparing it with individual therapy as well as with the waiting list and care as usual control conditions. Pill placebo was compared with only individual CBT in 2 trials and not with any other format or control condition; thus, no closed loops were found with pill placebo. Unguided self-help was compared with guided self-help CBT in a small number of studies as well as with the waiting list and care as usual control conditions but not with individual CBT. The contribution plot, showing the percentage of contributions from the direct comparisons for the mixed and indirect estimates, is presented in eAppendix E in the Supplement.

Pairwise Meta-analyses

Table 1 shows the results of the pairwise meta-analyses, consisting of 6 comparisons with more than 10 included studies (see the forest plots in eAppendix F in the Supplement). Individual, group, telephone, and guided self-help formats were more effective than the waiting list and care as usual control conditions. Furthermore, group therapy was statistically significantly less effective than individual therapy; unguided self-help therapy was significantly less effective than guided self-help but was more effective than being on a waiting list; and individual therapy was significantly more effective than taking pill placebo. None of the other comparisons was statistically significant (possibly owing to low power).

Heterogeneity was moderate to very high in all comparisons with more than 10 studies (range of I2 statistic, 51%-92%) except for unguided self-help CBT compared with the waiting list control condition. Egger test was statistically significant for all 6 comparisons, including the waiting list or care as usual control condition (except the comparisons between guided self-help vs care as usual, telephone vs care as usual, and unguided self-help vs waiting list), suggesting substantial publication bias.

Network Meta-analysis

The main results of the network meta-analysis are presented in Figure 2. Indirect evidence could be calculated for the 9 comparisons for which no direct evidence was available. Individual, group, guided self-help, and telephone CBT were statistically significantly more effective than the unguided self-help CBT (SMD, 0.34-0.59) as well as the waiting list (SMD, 0.87-1.11) and care as usual (SMD, 0.47-72) control conditions. We found no statistically significant differences between individual, group, guided self-help, and telephone CBT except for a small but statistically significant superiority of group CBT over guided self-help CBT (SMD, 0.25). Unguided self-help CBT was more effective than the waiting list control condition (SMD, 0.52) but not more effective than the care as usual (SMD, 0.13) control condition. Pill placebo is not included in Figure 2 because it was examined in only 2 studies, and none of the results was statistically significant, which may be attributed to a lack of statistical power (see the results in eAppendix G in the Supplement).

Visual inspection of the distribution of potential effect modifiers (eAppendix H in the Supplement) indicated that these potential effect modifiers were similarly distributed across the comparisons in the network, suggesting no significant evidence against the transitivity assumption.

Consistency factors were examined using the loop-specific approach (eAppendix I in the Supplement). The highest inconsistency factor was found for the loop of group CBT, unguided self-help, and waiting list. However, no inconsistency factors were found to be statistically significant, although this finding cannot be considered as evidence of the absence of inconsistency because of low power in some of the loops, especially in the presence of large heterogeneity in pairwise comparisons. The design-by-treatment interaction model did not indicate global inconsistency in the network (τ225 = 23.70; P for the null hypothesis of consistency in the network = .54).

The certainty of evidence for each network estimate is reported in eAppendix J in the Supplement. Among the major comparisons, certainty of evidence was moderate for individual compared with guided self-help CBT and for guided self-help compared with telephone CBT, but it was low for the comparisons between individual CBT and care as usual control condition, group and guided self-help CBT, as well as telephone CBT and waiting list or care as usual control condition.

Acceptability

The outcomes of the network meta-analysis for acceptability are shown in Figure 2. No indications of statistically significant differences were found between individual, group, and telephone CBT. Guided self-help was statistically significantly less acceptable than individual (RR = 1.44; 95% CI, 1.09-1.89) and group (RR = 1.38; 95% CI, 1.06-1.80) therapies as well as the care as usual (RR = 0.72; 95% CI, 0.57-0.90) and waiting list (RR = 0.63; 95% CI, 0.52-0.75) control conditions. Unguided self-help was less acceptable than being on a waiting list (RR = 0.75; 95% CI, 0.62-0.91). Individual and telephone CBT were more acceptable than pill placebo (eAppendix G in the Supplement), but placebo was only examined in 2 studies and thus should be considered with caution.

Ranking of Treatment Formats

The results of the analyses on the ranking of treatment formats (surface under the cumulative ranking curve) are shown in Table 2 separately for the efficacy and for the acceptability. In the Figure 3 forest plot, the treatment formats are ranked, with care as usual as the reference group. Individual (77.6%; SMD, –0.63), group (90.5%; SMD, –0.72), telephone (76.8%; SMD, –0.63), and guided self-help (55.8%; SMD, –0.47) formats ranked best with small differences. For acceptability, telephone CBT had the highest ranking, followed by individual, group, and unguided self-help therapies. Guided self-help format ranked considerably lower than the other treatment formats (1.6%; RR = 1.39, increasing dropouts in comparison with care as usual).

Long-term Effectiveness

The results of the network meta-analysis at 3 to 12 months of follow-up are presented in eAppendix K in the Supplement. The network was not well populated, with 14 of the 21 comparisons having 2 or fewer effect sizes (7 comparisons had no effect size) and with only 2 comparisons having more than 10 effect sizes. Only 3 studies were available for telephone CBT and 2 studies for the waiting list control condition. The design-by-treatment interaction model did not indicate global inconsistency in the network (τ213 = 10.70; P for the null hypothesis of consistency in the network = .64). The results of the network meta-analysis suggested positive and statistically significant effectiveness of individual, group, guided self-help, and telephone CBT compared with the care as usual control condition. Telephone CBT was also more effective than the waiting list and unguided self-help but less effective than individual CBT. However, because of the small number of comparisons, especially on telephone CBT, all of these findings have to be considered with caution.

The results of the meta-regression analyses are reported in eAppendix L in the Supplement. Only 1 predictor was found to be statistically significant. Because of the correlational nature of these findings, the large number of analyses conducted, and the relatively high P values for the statistically significant result (ie, P > .01), these results should be interpreted with caution.

Sensitivity Analyses

In the first sensitivity analysis (limited to studies with low risk of bias; eAppendix M in the Supplement), the network was not well populated. The results were, however, comparable to the overall results, with statistically significant differences between individual, group, guided self-help, and telephone CBT on the one hand and the care as usual control condition on the other hand. Individual, group, and guided self-help (but not telephone) formats were significantly more effective than the waiting list control condition. Unguided self-help CBT was significantly more effective than the waiting list but not care as usual. In the second sensitivity analysis (with the outliers excluded; eAppendix N in the Supplement), the results were again comparable to the main analyses.

In the third sensitivity analysis, we included only internet-based guided and unguided CBT and excluded other types of self-help therapy (see eAppendixes O and P in the Supplement for effectiveness and acceptability). Again, the results were comparable to findings in the main analyses, with individual, group, guided self-help, and telephone CBT being statistically significantly more effective than care as usual, waiting list, and unguided self-help. Acceptability was significantly lower in guided self-help compared with other treatment formats.

Discussion

The results of this network meta-analysis suggest that individual, group, telephone-administered, and guided self-help (internet-based or not) treatment formats have comparable effectiveness in the treatment of depression and that the effectiveness does not differ statistically significantly across formats. The effect sizes of these treatment formats compared with the care as usual control condition were moderate or large when compared with the waiting list control condition. Although guided self-help CBT was as effective as individual, group, and telephone CBT, it was less acceptable as the other formats. This finding was statistically significant when guided self-help was compared with individual and group CBT.

We also found that unguided self-help (without any involvement of a therapist) was statistically significantly less effective than that of individual, group, telephone, and guided self-help CBT. Unguided CBT was more effective than the waiting list but not the care as usual control condition. All results were broadly confirmed in several sensitivity analyses.

Although these findings are in line with results of previous meta-analytic research confirming the comparable effectiveness of individual, group, telephone, and self-help CBT, this current network meta-analysis (which includes the largest number of studies in the field, to our knowledge) takes advantage of all direct and indirect comparisons simultaneously, thus making the estimates more precise and consistent. To our knowledge, this network meta-analysis is the first to compare alternative delivery formats at the same time and on a common metric. An earlier meta-analysis found that individual therapies were more effective than group therapy.10 Our pairwise comparisons between individual and group therapies supported this finding, but in the network meta-analysis, no significant difference was found. This result suggests that a small difference may exist between individual and group therapy but that more high-quality research is needed to examine this issue.

It is not clear why the acceptability of guided self-help CBT was lower compared with that for the other formats. Maybe the absence of direct contact with a professional makes it easier to stop the treatment because there is less personal-relationship pressure to continue with the treatment or the study. However, lower acceptability would then also be expected to happen in unguided CBT, which we did not find. More research is needed to examine this issue.

Limitations

This study has several limitations that should be taken into consideration when interpreting the results. First, not enough studies on placebo were available from which to draw any conclusions about comparisons with placebo. Furthermore, relatively few studies examined the effectiveness of telephone CBT, although they suggested that telephone CBT was comparable to individual and group CBT both in terms of efficacy and acceptability. Second, substantial heterogeneity was found in several of the examined comparisons. We defined CBT as a psychological treatment in which cognitive restructuring was one of the core elements. However, most treatments also included other components, such as behavioral activation, problem solving, mindfulness, and social skills training. The variety of components included in these treatments differed widely, which may have contributed to the statistical heterogeneity and certainly to the clinical heterogeneity. We did not find indications of significant inconsistency, however.

Third, we could examine whether our findings were valid across different categories of baseline severity because baseline severity was measured with many different instruments. We found no indication that the transitivity assumption was violated in this study, but it is important to conduct more research on the differences between groups of patients, such as those who use internet-based treatments and those who do not as well as those who use guided self-help and those who use unguided interventions. Fourth, although network meta-analyses make optimal use of all available data, the indirect evidence is not directly based on randomized clinical trials.34 Fifth, we found indications of publication bias in several analyses, although our analyses suggested comparable results after adjustment for publication bias, possibly because such bias existed only between active and control conditions and therefore did not affect the relative efficacy among the active interventions. The same was true for risk of bias, with the sample of studies having considerable risk of bias and with sensitivity analyses suggesting comparable results in studies with low risk of bias.

Conclusions

This study suggests that group, telephone, and guided self-help treatments are effective interventions that may be considered as alternatives to individual CBT. Applying effective and acceptable CBT in a range of different formats will make CBT easier to implement, disseminate, and deliver across different settings and diverse patient populations. These results should inform future clinical guidelines worldwide for the management of depression.

Article Information

Accepted for Publication: January 23, 2019.

Published Online: April 17, 2019. doi:10.1001/jamapsychiatry.2019.0268

Corrections: This article was corrected on July 17, 2019, to fix an error in Figure 3; and on November 24, 2021, to fix a numeric error in Results and plotted values in Figure 3B.

Corresponding Author: Pim Cuijpers, PhD, Department of Clinical, Neuro and Developmental Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Van der Boechorststraat 7-9, 1081 BT Amsterdam, the Netherlands ([email protected]).

Author Contributions: Dr Cuijpers had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Concept and design: Cuijpers, Karyotaki, Cipriani.

Acquisition, analysis, or interpretation of data: All authors.

Drafting of the manuscript: Cuijpers.

Critical revision of the manuscript for important intellectual content: All authors.

Statistical analysis: Cuijpers, Noma, Karyotaki, Cipriani.

Administrative, technical, or material support: Cipriani.

Conflict of Interest Disclosures: Dr Noma reported personal fees from Boehringer Ingelheim, Kyowa Hakko Kirin, and ASKA Pharmaceutical outside of the submitted work. Dr Cipriani reported support from the National Institute for Health Research (NIHR) Oxford Cognitive Health Clinical Research Facility, professorship grant RP-2017-08-ST2-006 from the NIHR Research, and grant BRC-1215-20005 from the NIHR Oxford Health Biomedical Research Centre. Dr Furukawa reported personal fees from Meiji Seika Pharma, grants and personal fees from Mitsubishi-Tanabe, personal fees from MSD, and personal fees from Pfizer outside of the submitted work and a pending patent to 2018-177688. No other disclosures were reported.

Disclaimer: The views expressed herein are those of the authors and do not necessarily reflect those of the UK National Health Service, the National Institute for Health Research, or the UK Department of Health.

References