Between-and within-therapists variability in the therapeutic

Between-Therapist and Within-Therapist Differences
in the Quality of the Therapeutic Relationship: Effects
on Maladjustment and Self-Critical Perfectionism
m
David C. Zuroff, Allison C. Kelly, and Michelle J. Leybman
McGill University
m
Sidney J. Blatt
Yale University
m
Bruce E. Wampold
University of Wisconsin – Madison
The relationship between therapeutic outcome and a patient-reported
measure of the Rogerian conditions of positive regard, empathy, and
genuineness was decomposed into between-therapist effects and
within-therapist effects using multilevel modeling. Data were available for 157 depressed outpatients treated by 27 therapists in the
cognitive-behavioral therapy, interpersonal therapy, or placebo with
clinical management conditions of the Treatment of Depression
We thank Scott Baldwin for his advice on conducting the multilevel modeling in SAS.
The NIMH-TDCRP was a multisite program initiated and sponsored by the Psychosocial Treatments
Research Branch, Division of Extramural Research Programs, NIMH. The program was funded by
cooperative agreements to six participating sites: George Washington University, MH 33762; University of
Pittsburgh, MH 33753; University of Oklahoma, MH 33760; Yale University, MH 33827; Clarke Institute
of Psychiatry, MH 38231; and Rush Presbyterian-St. Luke’s Medical Center, MH 35017. The principal
NIMH collaborators were Irene Elkin, Coordinator; Tracie Shea, Associate Coordinator; John P.
Docherty; and Morris B. Parloff. The principal investigators and project coordinators at the three
participating research sites were, at George Washington University, Stuart M. Sotsky and David Glass; at
the University of Pittsburgh, Stanley D. Imber and Paul A. Pilkonis; and at the University of Oklahoma,
John T. Watkins and William Leber. The principal investigators and project coordinators at the three
research sites responsible for training therapists were, at Yale University, Myrna Weissman (now at
Columbia University), Eve Chevron, and Bruce J. Rounsaville; at Clarke Institute of Psychiatry, Brian F.
Shaw and T. Michael Vallis; and at Rush Presbyterian-St. Luke’s Medical Center, Jan A. Fawcett and
Phillip Epstein. Collaborators in the data management and data analysis aspects of the program were
C. James Klett, Joseph F. Collins, and Roderic Gillis of the Veterans Administration Studies Program,
Perry Point, Maryland.
Correspondence concerning this article should be addressed to: David C. Zuroff, McGill University,
Department of Psychology, 1205 Dr. Penfield Ave., Montreal, Quebec, Canada H3A 1B1; e-mail: zuroff@
ego.psych.mcgill.ca
JOURNAL OF CLINICAL PSYCHOLOGY, Vol. 66(7), 681--697 (2010)
& 2010 Wiley Periodicals, Inc.
Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/jclp.20683
682
Journal of Clinical Psychology, July 2010
Collaborative Research Program (Elkin, 1994). Consistent with prior
findings of significant between-therapist variability in outcome (e.g.,
Baldwin, Wampold, & Imel, 2007), patients whose therapists provided
high average levels of the perceived Rogerian conditions across the
patients in their caseloads experienced more rapid reductions in both
overall maladjustment and depressive vulnerability (self-critical perfectionism). Within each therapist’s caseload, differences between
patients in perceived Rogerian conditions had weaker effects. The
results underline the importance of differences between therapists as
determinants of outcome in the treatment of depression. & 2010
Wiley Periodicals, Inc. J Clin Psychol: 66:681–697, 2010.
Keywords: therapeutic relationship; therapist effect; depression;
cognitive behavioral therapy; interpersonal psychotherapy
A growing body of evidence suggests that psychotherapists differ substantially in
their average effectiveness; indeed, variability in outcomes attributable to differences
between therapists appears to be considerably larger than variability attributable to
differences between treatments (Wampold, 2001). In an important early review,
Crits-Christoph and Mintz (1991) examined 10 clinical trials that included 27
treatment conditions for a variety of disorders, concluding that on average about 8%
of the variance in outcomes was attributable to differences between therapists. Two
subsequent studies examined data from randomized clinical trials of treatments for
depression (Blatt, Sanislow, Zuroff, & Pilkonis, 1996) and panic disorder (Huppert
et al., 2001). Although both studies relied on analysis of variance, in which therapists
were treated as a fixed effect, suggestive evidence was obtained that therapists
differed in their average effectiveness.
Recent studies have employed more sophisticated multilevel modeling strategies,
which treat therapists as a random rather than a fixed effect. Kim, Wampold, and
Bolt (2006) reanalyzed data from the cognitive-behavior therapy (CBT) and
interpersonal therapy (IPT) conditions in the Treatment of Depression Collaborative
Research Program (TDCRP; Elkin, 1994), reporting that the proportion of variance
in outcome within the completer sample, attributable to differences between
therapists, ranged from about 8% to 12% for different measures of outcome. In a
similar set of analyses using pill placebo with clinical management (PLA-CM) and
imipramine with clinical management (IMI-CM) conditions in the TDCRP, McKay,
Imel, and Wampold (2006) found that the proportion of variance in outcome
attributable to differences between psychiatrists was 6.7%, using the Hamilton
Rating Scale for Depression (HRSD; Hamilton, 1967), and 9.1%, using the Beck
Depression Inventory (BDI; Beck, Ward, Mendelson, & Erbaugh, 1961).
Recent studies of psychotherapy in naturalistic settings have uncovered therapist
effects similar to those observed in controlled trials. Using multilevel modeling to
examine rates of improvements (slopes) in a university counseling center, Okiishi,
Lambert, Nielsen, and Ogles (2003) found that there was significant variability
across therapists in their clients’ average rate of improvement. This finding was
replicated in a larger sample of students and therapists at the same counseling center
(Okiishi et al., 2006). In a study of outcome in a wide variety of disorders treated in a
managed care context, Wampold and Brown (2005) found that about 5% of the
variance in outcomes was attributable to differences between therapists. Lutz, Leon,
Martinovich, Lyons, and Stiles (2007) also studied a managed care sample, finding
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
683
that 17% of the variance in clients’ rates of change could be attributed to differences
between therapists. Dinger, Strack, Leichsenring, Wilmers, and Schauenberg (2008)
found that outcome variance attributable to therapists in a naturalistic study of
inpatient treatment was significant (3%), but smaller than that reported in the
outpatient studies. Despite these consistent therapist effects in studies of diverse
disorders, in numerous forms of treatment, and using multiple outcome measures,
researchers have remained more interested in putative differences between
treatments (e.g., CBT vs. Behavioral Activation) than differences between treatment
providers (Blatt & Zuroff, 2005; Kim et al., 2006; Okiishi et al., 2003).
A number of possible mechanisms may account for the differences between
therapists that lead to differential outcomes. Given the significant relation between
dimensions of the therapeutic relationship and outcome (Norcross, 2002), one
possibility is that therapists differ in their capacity for forming constructive
psychotherapeutic relationships, and these differences in relational capacity give rise
to differences in outcome. Examining this hypothesis requires that the overall effect
of a relationship variable (e.g., the therapeutic alliance) on outcome be decomposed
into the component attributable to average differences between therapists and the
component attributable to differences between patients within a given therapist’s
caseload (Baldwin, Wampold, & Imel, 2007). Conceptually, the first (betweentherapists) component reflects the degree to which the quality of the therapeutic
relationship is attributable to characteristics of the therapist. The second (withintherapists) component reflects both the patient’s general capacity to form a
therapeutic relationship and the specific or interactive effect on the relationship
of the pairing of the patient with a particular therapist. This decomposition into the
therapist’s and patient’s contribution to the therapeutic alliance is effected by
conducting multilevel analyses in which patients are modeled as nested within
therapists, and the within-therapist (level-1) effects are tested separately from the
between-therapist (level-2) effects.
Baldwin et al. (2007) carried out such analyses using data from the Research
Consortium of Counseling and Psychological Services in Higher Education study
(Brownson, 2004). Pretreatment and posttreatment scores on the Outcome
Questionnaire-45 (OQ-45; Lambert et al., 2004) were available for 331 patients seen
by 80 therapists for an average of about seven sessions. The strength of the
therapeutic alliance was assessed using the Working Alliance Inventory (WAI;
Horvath & Greenberg, 1989) administered prior to the fourth therapy session.
Baldwin et al.’s primary finding was that therapist variability in the alliance was
related to outcome; that is, ‘‘therapists whose patients, on average, rated their alliance
high, also had better outcomes than therapists whose patients, on average, rated their
alliance low’’ (p. 846). Patient variability in the alliance was not significantly related
to outcome, suggesting that the causal pathways from alliance to outcome primarily
reflect the therapist’s capacity for establishing a constructive relationship with a range
of patients rather than the patient’s capacity to establish or respond to a constructive
relationship. In summary, Baldwin et al.’s results provide a new line of evidence for
the importance of therapists in determining the outcomes achieved by their patients.
The theoretical import of Baldwin et al.’s (2007) findings warrants efforts to assess
their replicability and generalizability. Their analyses were conducted with data from
a naturalistic study of a heterogeneous population of college counseling clients who
received a heterogeneous array of (mostly) brief interventions. One wonders whether
similar results would be obtained with samples of psychiatric patients with specific
disorders receiving specific, manualized treatments in the context of a clinical trial.
Journal of Clinical Psychology
DOI: 10.1002/jclp
684
Journal of Clinical Psychology, July 2010
The generalizability of Baldwin et al.’s findings over different dimensions of the
therapeutic relationship and different measures of outcome also remains unknown.
The TDCRP data set is an ideal context for investigating these questions, as it
involved the random assignment of carefully diagnosed depressed outpatients to
receive one of several well-defined treatments, and included a wealth of measures of
process and outcome. Thus, we undertook to replicate and extend Baldwin et al.’s
results using publicly available data from the TDCRP.
Several previous analyses of TDCRP data are relevant to the present article.
Zuroff and Blatt (2006) conducted a stringent test of the relations to outcome of
three dimensions of the therapeutic relationship: the therapist contribution to the
alliance, the patient contribution to the alliance, and the aggregate of patientperceived empathy, positive regard, and congruence in the therapist. Empathy,
positive regard, and congruence are the qualities that Carl Rogers (1951, 1957)
considered to be the necessary and sufficient conditions for therapeutic change, and
are frequently referred to as the Rogerian conditions. Patient and therapist
contributions to the therapeutic alliance were assessed using Krupnick et al.’s
(1996) observer ratings of video recordings of the third treatment session; patients’
perceptions of the Rogerian conditions were assessed using the Barrett-Lennard
Relationship Inventory (B-L RI; Barrett-Lennard, 1962) administered after the
second treatment session. Outcome was assessed using both patient-reported and
clinician-rated measures. Early treatment response (from intake to the fourth
treatment session) was partialled from the relationship measures, and the resulting
residualized measures were used to predict subsequent change in outcome measures.
Multilevel modeling consistently demonstrated effects of the residualized B-L RI on
outcome, demonstrating that the impact of perceived Rogerian conditions on
outcome was independent of the degree of early therapeutic change. The most robust
effects were obtained with a composite measure of outcome that combined five
measures of patient-reported and clinician-rated symptoms and overall life
adjustment (Blatt, Zuroff, Bondi, Sanislow, & Pilkonis, 1998). The analyses reported
here, therefore, focussed on the B-L RI as a predictor of this composite measure of
outcome.
The Rogerian conditions and the therapeutic alliance are conceptually distinct
dimensions of the therapeutic relationship (Hatcher & Barends, 2006; Horvath &
Bedi, 2002), as the former captures the patient’s perceptions of a facet of the
therapist’s behavior and the latter addresses the active collaboration between the
therapist and patient. Measures of perceived Rogerian conditions and the working
alliance are only weakly correlated (Zuroff & Blatt, 2006). Consequently, the present
analyses are not a strict replication of Baldwin et al.’s analyses, but rather an
exploration of another facet of the therapeutic relationship.
A comprehensive assessment of outcome should include measures of reduction in
underlying vulnerabilities as well as symptom and adjustment-focused measures
(Blatt & Zuroff, 2005; Blatt, Zuroff, Hawley & Auerbach, 2010). Fortunately, the
architects of the TDCRP included the Dysfunctional Attitudes Scale (DAS;
Weissman & Beck, 1978) at each assessment point during treatment. The DAS
was designed to measure cognitive vulnerability to depression. Research with the
Perfectionism subscale of the DAS and related measures such as the Self-Criticism
scale of the Depressive Experiences Questionnaire (DEQ; Blatt, D’Afflitti, &
Quinlan, 1976), has shown that self-critical perfectionism (SC-PFT) both confers
vulnerability to depression (Blatt, 2004; Zuroff, Mongrain, & Santor, 2004) and
interferes with the treatment of depression (Blatt & Zuroff, 2005). Blatt and Zuroff
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
685
(2005) found that, across the four treatment conditions of the TDCRP, higher levels
of the Rogerian conditions were associated with greater reduction in SC-PFT, but
they did not distinguish the effects of therapist variability and patient variability in
the B-L RI. We, therefore, extended Blatt and Zuroff’s and Baldwin et al.’s (2007)
investigations by examining the contributions of between-therapist and withintherapist variability in perceived Rogerian conditions to the reduction of SC-PFT.
Because of differences between the TDCRP and Research Consortium datasets,
our statistical procedures differed from those of Baldwin et al. (2007). Outcome data
were available in the TDCRP at five points from the beginning to the end of
treatment, whereas the Research Consortium collected only pretreatment and
posttreatment data. Baldwin et al., therefore, focused on prediction of posttreatment
scores, adjusted for pretreatment scores, by between-therapist and within-therapist
alliance scores. We instead modelled rates of change over time (slopes) in the
dependent variables (cf. Lutz et al., 2007; Zuroff & Blatt, 2006), and examined the
degree to which these slopes were determined by the between-therapist and withintherapist B-L RI scores.
We included the two psychotherapy conditions in the TDCRP (CBT and IPT) as
well as the PLA-CM condition. The PLA-CM condition was included because
clinical management was designed to provide the basic elements of supportive
psychotherapy (Elkin, Parloff, Hadley, & Autry, 1985). Additionally, previous
analyses of variables related to the therapeutic relationship found few differences
between the formal psychotherapies and the CM conditions (e.g., Blatt, Zuroff,
Quinlan, & Pilkonis, 1996; Zuroff et al., 2000). Patients in the IMI-CM condition
were omitted from the analyses for two reasons. First, the administration of
imipramine introduced a treatment element that clearly could not be construed as
psychotherapy. Second, the fact that each psychiatrist treated patients in both the
IMI-CM and PLA-CM conditions would have made statistical modeling very
difficult, as treatment providers were nested within the CBT and IPT conditions but
crossed the two CM conditions.
Based on Baldwin et al.’s (2007) findings, we hypothesized that more rapid
reductions in both the composite maladjustment measure and SC-PFT would be
achieved by patients whose therapists were perceived to offer, on average, higher
levels of the Rogerian conditions. We thought it was possible that the more sensitive
tests afforded by the repeated measurements in the TDCRP would reveal additional
within-therapists effects of the B-L RI, that is, that patients who experienced higher
levels of the Rogerian conditions than their therapist’s average patient would report
more rapid improvement.
Method
Participants in the TDCRP were outpatients with nonbipolar, nonpsychotic, major
depressive disorders. Two hundred and fifty patients were randomly assigned to
treatments, 239 patients began treatment, and 162 were defined as completers,
having received at least 12 treatment sessions over at least a 15-week period.
Inclusion and exclusion criteria, sample characteristics, treatment procedures, and
assessment procedures have been described previously (Elkin et al., 1985; Elkin et al.,
1989; Shea et al., 1992). Patients scored 14 or higher on the 17-item version of the
HRSD and met Research Diagnostic Criteria for a current episode of definite major
depression. Patients in the IMI-CM condition were excluded from the present
analyses, as were patients with missing data for the B-L RI at session two. One
Journal of Clinical Psychology
DOI: 10.1002/jclp
686
Journal of Clinical Psychology, July 2010
therapist in the CBT condition saw only one patient, and that dyad was excluded as
well. These criteria resulted in a sample of 157 (43 male, 114 female) patients, with 53
receiving IPT, 54 receiving CBT, and 50 receiving PLA-CM. They had an average
age of 35.1 (SD 5 8.60). Their racial/ethnic backgrounds were 86.6% white, 10.2%
black, 2.6% Hispanic, and 0.6% other. Their religions were 33.8% Catholic, 40.8%
Protestant, 5.7% Jewish, 1.3% Moslem, 3.2% other, and 15.3% unaffiliated. Their
mean occupational status was 4.60 (SD 5 1.25), on a 0–7 scale anchored by never
employed and executive/professional.
These 157 patients were treated by 27 MD-level or PhD-level therapists who saw
between 3 and 11 patients each. Twenty of the 27 therapists were male, and 18 of the
27 were MDs. IPT was provided by eight therapists; CBT was provided by nine
therapists; and PLA-CM was provided by 10 therapists. The average age of the
therapists was 41.51 (SD 5 8.23) years, and their average years of experience was
11.33 (SD 5 7.09).
All therapists received training and met competency criteria in the treatment they
provided. The treatments were manualized and monitored by experts. Tapes of
sessions were reviewed to assure adherence to treatment protocols, and therapists
received consultation during the study (Elkin, 1994). The manuals for clinical
management in the medication conditions provided ‘‘guidelines for providing
support and encouragement to the patient and giving direct advice when necessary.
This CM component thus approximates a ‘‘minimal supportive therapy’’ condition’’
(Elkin et al., 1985, p. 311).
Evaluations were conducted at intake (0 weeks), 4 weeks, 8 weeks, 12 weeks, and
16 weeks. At each assessment, patients completed self-report measures and were
interviewed by a PhD-level clinical evaluator (CE).
Measures
Maladjustment. The principal outcome measures in the TDCRP were as follows:
the BDI, the HRSD, the Hopkins Symptom Checklist (HSCL-90; Derogatis,
Lipman, & Covi, 1973), the Global Adjustment Scale (GAS; Endicott, Spitzer,
Fleiss, & Cohen, 1976), and the Social Adjustment Scale (SAS; Weissman & Paykel,
1974). Despite varying in both item content and rater (self vs. CE), residual change
scores on the five outcome measures loaded on a single factor that accounted for
75.5% of the variance (Blatt, Zuroff et al., 1996). We used a Composite
Maladjustment Index (CMI; Blatt et al., 1998), that was based on z-scores for
each of the five variables calculated using the pooled mean over the five observation
points and the pooled within-time period standard deviation. The CMI was
computed for each observation point by taking the mean of the five z-scores with
appropriate reversals. The CMI has been used in several reanalyses of the TDCRP
(Shahar, Blatt, Zuroff, & Pilkonis, 2003; Shahar, Blatt, Zuroff, Krupnick, & Sotsky,
2004; Zuroff et al., 2000; Zuroff & Blatt, 2006). Because of the higher reliability
afforded by aggregating correlated measures, more sensitive tests of effects are
possible using this composite. Moreover, parameter estimates are likely to be more
precise as a result of reduced random error.
Self-critical perfectionism. The DAS (Weissman & Beck, 1978) was designed to
measure cognitive vulnerability to depression. Consistent with prior research (e.g.,
Cane, Olinger, Gotlib, & Kuiper, 1986), Perfectionism and Need for Approval
dimensions emerged in a principal components analysis with a varimax rotation of
TDCRP intake data (Imber et al., 1990). Eleven items loaded substantially (4.40)
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
687
on Need for Approval, and 15 items loaded substantially (4.40) on Perfectionism.
Summing the items with high loadings yielded composites with high internal
consistency, a 5 .91 for Need for Approval and .82 for Perfectionism. Because
perfectionism has been found to be related to outcome at the end of treatment and to
functioning during the follow-up period (Blatt et al., 1998; Zuroff & Blatt, 2002), we
focused on Perfectionism rather than Need for Approval. The two highest loading
items for Perfectionism were as follows: ‘‘If I do not do as well as other people, it
means that I am an inferior human being’’ and ‘‘If I fail at my work, then I am a
failure as a person.’’ In the present article, we refer to the DAS Perfectionism scale as
self-critical perfectionism (SC-PFT) rather than simply perfectionism. Recent
research has demonstrated that perfectionism is a multidimensional construct, and
that the DAS-SC scale loads on the self-critical perfectionism factor, the more
maladaptive of the two major factors of perfectionism (Dunkley & Kyparissis, 2008;
Powers, Zuroff, & Topciu, 2004). The mean score for SC-PFT was 49.34
(SD 5 17.22). In comparison, Dunkley and Kyparissis’ (2008) community sample
of employed adults reported a mean score of 38.81 (SD 5 15.15).
Perceived Rogerian conditions in the therapeutic relationship. The BarrettLennard Relationship Inventory (Barrett-Lennard, 1962) includes four subscales
that assess the patient’s perception of the extent to which the therapist provides
positive regard, empathy, unconditional regard, and congruence. The B-L RI has
been shown to predict outcome in psychotherapy and counselling (Barrett-Lennard,
1986; Gurman, 1977). We used the B-L RI from the second treatment session.
Blatt, Sanislow et al. (1996) found that the four scales of the B-L RI formed a
single factor and constructed a measure of the perceived Rogerian conditions by
summing scores on the three high-loading scales, Empathy, Level of Regard, and
Congruence. Each subscale comprises 16 items rated on a 6-point scale. Cronbach’s
a for the composite of 48 items was .95. Examples of items include: ‘‘y nearly
always knows what I mean’’ (Empathy), ‘‘y feels a true liking for me’’ (Level of
Regard), and ‘‘y is comfortable and at ease in our relationship’’ (Congruence). The
mean B-L RI score was 20.78 (SD 5 11.66).
The mean score over the patients in each therapist’s caseload was designated the
Between-Therapist B-L RI (BT-BLRI). The SD of the BT-BLRI was 5.73, with a
range from 14.19 to 10.59. Each patient’s B-L RI was centered by subtracting from
it the mean score for that patient’s therapist. The resulting centered score was
designated the Within-Therapist B-L RI (WT-BLRI). To simplify calculations of
simple slopes in the multilevel models, the BT-BLRI scores were centered over the 27
therapists. WT-BLRI was negatively related to SC-PFT at intake, r(155) 5 .23,
po.01.
Results
Our principal data analytic strategy was multilevel modeling, using PROC MIXED,
version 9.1 (SAS Institute, 2004) and maximum likelihood estimation. The
dependent variables were the CMI and SC-PFT, each measured at five time points
(0 weeks, 4 weeks, 8 weeks, 12 weeks, and 16 weeks). CMI scores were multiplied
by 100 for ease of presentation. Descriptive statistics for the CMI and SC-PFT
are presented in Table 1. The effect size r was calculated for each significant effect
using the formula: r 5 [F/(F1df)]1/2 (Rosnow & Rosenthal, 1996). Cohen (1988)
characterized r 5 .10 as a ‘‘small’’ effect and r 5 .30 as a ‘‘medium’’ effect.
Journal of Clinical Psychology
DOI: 10.1002/jclp
688
Journal of Clinical Psychology, July 2010
Table 1
Means, Standard Deviations, and Sample Sizes for Composite Maladjustment Index and SelfCritical Perfectionism
CMI
Time
0 weeks
4 weeks
8 weeks
12 weeks
16 weeks
SC-PFT
M
SD
N
M
SD
N
113.20
33.44
11.18
25.67
67.64
58.74
87.57
98.34
96.10
94.75
157
144
129
114
113
49.50
46.86
43.32
43.87
39.94
17.26
17.52
17.19
17.41
16.59
157
143
126
114
112
Note: CMI 5 composite maladjustment index; SC-PFT 5 self-critical perfectionism.
The amount of between-therapist and within-therapist variance in B-L RI scores
was estimated using a multilevel model with a random intercept for therapist and no
fixed effect predictors (Snijders & Bosker, 1999). The relative size of betweentherapist and within-therapist variance was assessed using the intraclass correlation
(ICC), which is defined as the ratio of between-therapist variance to the sum of the
between-therapist and within-therapist variance. The ICC for the B-L RI was .098,
indicating that about 10% of the variance in patients’ perceptions of the Rogerian
conditions could be accounted for by differences between the therapists. Thus, the
majority of the variance was within-therapists, but between-therapists variability was
present as well.
Predictive models for changes in the CMI and SC-PFT were developed using a
sequential strategy (Wallace & Green, 2002), beginning by evaluating the fixed
effects part of the model while assuming a simple structure for the random effects
part. The initial models included a random intercept for patients and allowed the
error variance at each time point to vary (a UN(1) structure in PROC MIXED
notation). In analyses for the CMI, time was log-transformed (i.e., time 5 log [11
weeks]) to achieve a nearly linear relationship with time. The relationship between
SC-PFT and time (i.e., week of treatment) was close to linear, so time was
untransformed in those analyses. In both analyses, a negative slope for time
indicated change in the desired direction, decreased maladjustment or decreased
SC-PFT. The fixed effects in the initial models were as follows: treatment condition,
BT-BLRI, WT-BLRI, and the interactions of these three predictors with time.
Interactions with time were of particular interest, because they would indicate that
the rate of improvement was predicted by treatment condition or aspects of the
therapeutic relationship. The Condition Time effect was nonsignificant for both
the CMI and SC-PFT. We, therefore, deleted the effects involving Condition.
We also tested for the presence of cross-level interactions, that is, BT-BLRI WT-BLRI Time effects. In no case was this interaction significant, so all models
described below omit the cross-level interaction.
Our second step was to explore alternative error covariance structures. It is
important to find the most appropriate error structure because the fixed and random
components of mixed models are linked; misspecification of one can lead to
inaccurate estimates in the other (Wallace & Green, 2002). In particular, we tested
whether the model fit was improved according to the AIC criterion when: (a) a
random intercept for therapist was added, (b) a random slope for time was added, or
(c) the UN(1) error covariance structure was replaced by either an autoregressive
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
689
(AR[1]) or autoregressive with heterogeneous error variance (ARH[1]) structures.
BT-BLRI and WT-BLRI were left in raw score units in these analyses; consequently,
the reported slopes indicate the effect of a difference of one unit in raw scores on the
BLRI. We also explored whether the effect of patient variability in Rogerian
conditions varied depending on the therapist; that is, we added a random effect for
therapist for the WT-BLRI Time interaction. No evidence was found for
significant variance in that parameter, so this random effect was omitted from all
models.
Composite Maladjustment Index
The final model for the CMI included a random slope for log-transformed time
in addition to the random intercept for patients and the UN(1) error covariance
structure. Significant effects were found for both the BT-BLRI Time
(F(1, 497) 5 9.79, po.01, effect size r 5 .281) and WT-BLRI Time interactions
(F(1, 497) 5 4.73, po.05, effect size r 5 .195). The regression parameters for the
interaction effects were, respectively,
1.56 (SE 5 .50) and
.65 (SE 5 .30).
Although the parameter for the between-therapist effect was twice as large as the
parameter for the within-therapist effect, that difference only approached
significance, t(497) 5 1.60, po.12. To interpret the interactions, we calculated
slopes of the CMI as a function of time (i.e., the rates at which the CMI decreased)
for patients whose therapists fell above or below the mean (11 or 1) for BT-BLRI
and for patients whose WT-BLRI fell above or below (11 or 1) the mean for their
therapist. As expected, patients whose therapists were characterized by high mean
scores on the B-L RI improved more quickly (slope 5 55.68, SE 5 2.87, po.001)
than patients whose therapists were characterized by low mean scores on the B-L RI
(slope 5 52.56, SE 5 2.94, po.001). As well, patients whose B-L RI scores were
higher than the average of the other patients for their therapist improved more
rapidly (slope 5 54.77, SE 5 2.86, po.001) than did patients with B-L RI scores
that were below the average for other patients of their therapist (slope 5 53.46,
SE 5 2.89, po.001).
Results expressed in terms of the effects of B-L RI units on slopes per unit of logtime are difficult to understand and distant from clinical realities. To illustrate the
impact of differences in Rogerian conditions more directly, we calculated estimated
values for the CMI at each time point for patients with above average (110) or
below average ( 10) therapists. We focused on differences of 10 because it is close to
the SD of the raw B-L RI in this population and also represents nearly the full range
of BT-BLRI scores. It can be seen from Figure 1 that high scores on the BT-BLRI
and, to a lesser extent, the WT-BLRI, predicted more rapid declines in the CMI. The
difference in the decrease in predicted CMI scores from intake to termination for the
patients of therapists with low versus high average levels of perceived Rogerian
conditions was 103.4 units on the CMI, that is, a full standard deviation of the CMI
at the beginning of therapy.
Self-Critical Perfectionism
The final model for SC-PFT replaced the UN(1) error covariance structure with an
ARH(1) structure. The autoregressive correlation was .42, po.001. A significant
effect was found for BT-BLRI Time (F(1, 492) 5 7.60, po.01, effect size r 5 .249),
but not for WT-BLRI Time (p4.50). The regression parameters for the interaction
effects were, respectively, .035 (SE 5 .013) and .004 (SE 5 .008), and differed
Journal of Clinical Psychology
DOI: 10.1002/jclp
690
Journal of Clinical Psychology, July 2010
Figure 1. Estimated values for the Composite Maladjustment Index over the course of treatment for
patients with low ( 10) vs. high (110) scores on the BT-BLRI and WT-BLRI. BT-BLRI 5 BetweenTherapists Barrett-Lennard Relationship Inventory. WT-BLRI 5 Within-Therapists Barrett-Lennard
Relationship Inventory.
significantly, t(492) 5 2.12, po.05. As expected, patients whose therapists were
characterized by high mean scores on the B-L RI showed a more rapid reduction in
vulnerability (slope 5 .662, SE 5 .072, po.001) than patients whose therapists were
characterized by low mean scores on the B-L RI (slope 5 .593, SE 5 .074, po.001).
These results are illustrated in Figure 2. The difference in SC-PFT change from
intake to termination for the patients of the more Rogerian versus the less Rogerian
therapists was 11.06 SC-PFT units, or about two-thirds of the SD of SC-PFT at
intake.
Because of the association of SC-PFT with measures of depression (Zuroff, Blatt,
Sanislow, Bondi, & Pilkonis, 1999), it is conceivable that the results for SC-PFT were
not independent findings, but simply reflected shared variance with the CMI. We,
therefore, repeated the analyses, including the CMI as a time-varying covariate. The
same pattern of results was obtained. The BT-BLRI Time effect remained
significant (F(1, 491) 5 4.43, po.05, effect size r 5 .190), although the regression
parameter was somewhat smaller ( .025, SE 5 .012). The regression coefficients for
the BT-BLRI Time and WT-BLRI Time effects remained significantly different,
t(491) 5 1.99, po.05.
Because initial severity moderated some of the primary TDCRP analyses (Elkin
et al., 1989), the preceding analyses for the CMI and for SC-PFT were repeated,
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
691
Figure 2. Estimated values of Self-Critical Perfectionism over the course of treatment for patients with
low ( 10) vs. high (110) scores on the BT-BLRI. BT-BLRI 5 Between-Therapists Barrett-Lennard
Relationship Inventory.
introducing initial severity as assessed by the HRSD and then initial severity as
assessed by the GAS as potential moderators. In neither case did the triple
interaction of severity BT-BLRI Time or the triple interaction of Severity WTBLRI Time achieve statistical significance. Thus, there was no evidence that the
between-therapist or within-therapist components of the B-L RI had different effects
on rate of improvement for initially more severely or less severely depressed patients.
Discussion
The fact that therapeutic outcome is predicted by dimensions of the therapeutic
relationship is well-established (Norcross, 2002), and in particular, such relationships
have been extensively examined within the TDCRP (e.g., Krupnick et al., 1996;
Zuroff & Blatt, 2006). The importance of the present findings lies in the explication
of why these associations exist. Our analyses were an attempt to replicate and extend
Baldwin et al.’s (2007) finding that the relation between the therapeutic alliance and
outcome is explainable in terms of differences between therapists in the average
strength of the alliances they establish. Three principal results emerged. First, we
found that between-therapist variability in patients’ perceptions of the Rogerian
conditions was related to overall clinical improvement, as indexed by the CMI.
Second, we found that between-therapist variability in perceptions of the Rogerian
conditions was related to reduced vulnerability to depression, as indexed by SC-PFT.
These findings applied equally across the CBT, IPT, and PLA-CM conditions, and
they applied equally to patients with low and high levels of initial severity. Finally,
there was a smaller but statistically significant effect demonstrating that better
outcomes on the CMI were achieved by patients who experienced a higher level of
the Rogerian conditions than the average patient seen by their therapist. Thus, the
present analyses demonstrate that previously reported associations between
Rogerian conditions and outcome in the treatment of depression are primarily,
but not exclusively, explainable in terms of differences between therapists. That
conclusion underlines the importance of researchers’ devoting increased attention to
therapists as key determinants of outcome (Baldwin et al., 2007; Wampold, 2001).
Journal of Clinical Psychology
DOI: 10.1002/jclp
692
Journal of Clinical Psychology, July 2010
Between-Therapist Effects
Between-therapist differences in aspects of the therapeutic relationship emerged as
predictors of outcome in both Baldwin et al. (2007) and the present analyses, despite
numerous methodological differences between the studies. Thus, there appears to be
robust evidence that variability between therapists in the capacity to establish
constructive therapeutic relationships is a key predictor of outcome. The present
analyses extend Baldwin et al.’s findings, by focusing on a different dimension of the
therapeutic relationship (Rogerian conditions vs. therapeutic alliance) and by
examining changes in vulnerability as well as changes in maladjustment.
The proportion of between-therapist variance in B–L RI scores was about 10% in
the TDCRP, and the proportion of between-therapist variance in working alliance
scores was about 6% in the Research Consortium dataset (S.A. Baldwin, personal
communication, April 16, 2009). The association of outcome and between-therapist
variability is all the more striking given that the large majority of variance in
therapeutic relationship measures appears to be within-therapist. In other words, it is
the small fraction of between-therapist variance that does most of the ‘‘work’’ in
predicting outcome. This may be one reason for the generally modest size of most
reported correlations between measures of the therapeutic relationship and outcome
(Martin, Garske, & Davis, 2000); these overall correlations confound stronger
between-therapist effects with weaker within-therapist effects. The degree to which
the total correlation masks the between-therapist effect is related to the ratio of
patients to therapists; the more patients per therapist, the greater the extent to which
the total correlation will be attenuated.
Within-Therapist Effects
Baldwin et al. (2007) did not find a significant effect of within-therapist differences in
therapeutic alliance on outcome, whereas we found a significant effect of withintherapist differences in perceived Rogerian conditions on maladjustment. It is
possible that the discrepancy between the two studies simply reflects the greater
power afforded by our repeated measures design. Alternatively, the within-therapist
effect on the CMI might be specific to the patient population (depressed outpatients)
or the predictor variable (Rogerian conditions) used in the TDCRP. Depressed
individuals are sensitive to cues of rejection (Ayduk, Downey, & Kim, 2001), and so
perceiving one’s therapist as displaying unconditional positive regard might be
especially therapeutic for individuals with major depression. No within-therapist
effect of perceived Rogerian conditions was found for the SC-PFT; this might reflect
the slower pace at which the vulnerability of SC-PFT changes compared with
maladjustment (Hawley, Ho, Zuroff, & Blatt, 2006). A within-therapist effect on
vulnerability measures such as SC-PFT might become apparent in longer-term
treatments than the ones provided in the TDCRP.
The relatively weak or absent effects of within-therapist measures are somewhat
counter-intuitive and puzzling. One possible explanation is that between-therapist
differences are measured more reliably and validly because they are derived by
aggregating over multiple patients. Within-therapist differences reflect ratings by a
single patient at a single point in time and may contain irrelevant variance related to
patients’ response biases in using the rating scales (Baldwin et al., 2007). Because
between-therapist measures contain more valid variance, they would be expected to
yield stronger effects. A second possibility is based on the contextual model of
psychotherapy (Wampold, 2001), which suggests that it is the therapeutic context as
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
693
a whole, rather than its individual constitutive elements, that is responsible for
therapeutic change. It may be that between-therapist differences in relationship
measures serve as markers of differences in the global therapeutic context
(Wampold, 2001); that is, therapists who are consistently perceived as high in the
Rogerian conditions may also be the therapists who are able to orchestrate the full
range of contextual factors (e.g., fostering hope, providing a coherent and
meaningful treatment rationale, and providing a treatment that is consistent with
the rationale) into a transformative therapeutic experience. If between-therapist
measures are indices of the overall therapeutic context, rather than merely one part
of the context, they would be expected to be strongly related to outcome.
Generalizability
Well-designed and comprehensive as it was, the specific design features of the
TDCRP limit the generalizability of our results. Those features include the
short-term, manualized nature of the treatments, the limited range of well-trained,
well-supervised doctoral-level therapists, the homogeneous sample of depressed
outpatients, and the random assignment of patients to conditions. Because each of
these features differs in the Research Consortium dataset analyzed by Baldwin et al.
(2007), we believe that the core findings are likely to be highly generalizable.
Nevertheless, there is certainly a need for studies that investigate the impact of
between-therapist and within-therapist components of relationship variables in other
psychiatric disorders and in other forms of treatment.
Implications
Research and training in mental health services remain focused on differences
between treatments rather than differences between treatment providers (Blatt &
Zuroff, 2005; Kim et al., 2006). What are the implications of acknowledging that
between-therapist differences are important (arguably, more important) sources of
differences in patients’ outcomes?
First, more research effort needs to be devoted to determining the characteristics
that distinguish therapists who, in general, tend to be experienced as providing low
or high levels of the Rogerian conditions. A wide variety of characteristics could be
pertinent, including personality traits, interpersonal styles, interpersonal skills, and
preferred therapeutic strategies and interventions. Ackerman and Hilsenroth’s (2003)
review of predictors of the alliance can provide guidance for investigations of
therapist variability in providing Rogerian conditions. They found that several
therapist attributes (e.g., flexible, respectful, warm, and open) and several types of
interventions (e.g., reflection, support, affirming, accurate interpretation, and
facilitating affective expression) were associated with more positive alliances.
Dispositional qualities of the therapist such as the Big Five traits (Chapman,
Talbot, Tatman, & Britton, 2009) and compassion (Gilbert, 2009) are also plausible
antecedents of the capacity to provide high levels of the Rogerian conditions.
Second, psychotherapy researchers need to include multiple therapists in their
designs whenever possible so as to permit the separation of between-therapist and
within-therapist effects. Moreover, researchers should bear in mind that the
conceptual distinction between within-therapist and between-therapist effects, and
the statistical machinery for separating them, do not apply solely to therapeutic
relationship variables. They can be applied to almost any variable related to
psychotherapy process, including, for example, clients’ adherence to treatment,
Journal of Clinical Psychology
DOI: 10.1002/jclp
694
Journal of Clinical Psychology, July 2010
clients’ motivation for treatment, clients’ interpersonal behavior towards their
therapists, and therapists’ behavior towards their clients.
Finally, for graduate programs charged with training therapists and certifying
their competence, the key implication is that trainees’ average levels of therapeutic
relationship variables should be targets of training and ongoing monitoring.
Training programs should view these outcomes as at least equally important as the
acquisition of the specific skill sets associated with particular schools of therapy.
References
Ackerman, S.J., & Hilsenroth, M.J. (2003). A review of therapist characteristics and
techniques positively impacting the therapeutic alliance. Clinical Psychology Review, 23,
1–33.
Ayduk, O., Downey, G., & Kim, M. (2001). Rejection sensitivity and depressive symptoms in
women. Personality and Social Psychology Bulletin, 27, 868–877.
Baldwin, S.A., Wampold, B.E., & Imel, Z.E. (2007). Untangling the alliance-outcome
correlation: Exploring the relative importance of therapist and patient variability in the
alliance. Journal of Consulting and Clinical Psychology, 75, 842–852.
Barrett-Lennard, G.T. (1962). Dimensions of therapist response as causal factors in
therapeutic change. Psychological Monographs, 76 (43, Whole No. 562).
Barrett-Lennard, G.T. (1986). The Relationship Inventory now: Issues and advances in
theory, method and use. In L.S. Greenberg and W.M. Pinsof (Eds.), The psychotherapeutic
process: A research handbook (pp. 439–476). New York: Guilford.
Beck, A.T., Ward, C.H., Mendelson, M., Mock, J., & Erbaugh, J. (1961). An inventory for
measuring depression. Archives of General Psychiatry, 4, 561–571.
Blatt, S.J. (1974). Levels of object representation in anaclitic and introjective depression.
Psychoanalytic Study of the Child, 29, 107–157.
Blatt, S.J. (2004). Experiences of depression: Theoretical, clinical and research perspectives.
Washington, DC: American Psychological Association.
Blatt, S.J., D’Afflitti, J.P., & Quinlan, D.M. (1976). Experiences of depression in normal
young adults. Journal of Abnormal Psychology, 85, 383–389.
Blatt, S.J., Zuroff, D.C., Hawley, L.L., & Auerbach, J.S. (2010). Patient personality, the
therapeutic relationship, and change in mental representation: Fundamental determinants
of sustained therapeutic change. Psychotherapy Research, 9, 37–64.
Blatt, S.J., Quinlan, D.M., Pilkonis, P.A., & Shea, T. (1995). Impact of perfectionism and need
for approval on the brief treatment of depression: The National Institute of Mental Health
Collaborative Research Program revisited. Journal of Consulting and Clinical Psychology,
63, 125–132.
Blatt, S.J., Sanislow, C.A., Zuroff, D.C., & Pilkonis, P.A. (1996). Characteristics of effective
therapists: Further analyses of data from the NIMH TDCRP. Journal of Consulting and
Clinical Psychology, 64, 1276–1284.
Blatt, S.J., & Zuroff, D.C. (2005). Empirical evaluation of the assumptions in identifying
evidence based treatments in mental health. Clinical Psychology Review, 25, 459–486.
Blatt, S.J., Zuroff, D.C., Bondi, C.M., Sanislow, C.A., & Pilkonis, P. (1998). When and how
perfectionism impedes the brief treatment of depression: Further analyses of the NIMH
TDCRP. Journal of Consulting and Clinical Psychology, 66, 423–428.
Blatt, S.J., Zuroff, D.C., Quinlan, D.M., & Pilkonis, P. (1996). Interpersonal factors in brief
treatment of depression: Further analyses of the NIMH treatment of depression
collaborative research program. Journal of Consulting and Clinical Psychology, 64,
162–171.
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
695
Brownson, C. (2004). The Research Consortium of Counseling and Psychological Services in
Higher Education: Report of activities. Retrieved May 11, 2007, from http://www.utexas.
edu/student/cmhc/research/rescon.html
Cane, D.B., Olinger, J., Gotlib, I.H., & Kuiper, N.A. (1986). Factor structure of the
Dysfunctional Attitude Scale in a student population. Journal of Clinical Psychology, 42,
307–309.
Chapman, B.P., Talbot, N., Tatman, A.W., & Britton, P.C. (2009). Personality traits and the
working alliance in psychotherapy trainees: An organizing role for the five factor model?
Journal of Social and Clinical Psychology, 28, 577–596.
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ:
Erlbaum.
Crits-Christoph, P., & Mintz, J. (1991). Implications of therapist effects for the design and
analysis of comparative studies of psychotherapies. Journal of Consulting and Clinical
Psychology, 59, 20–26.
Derogatis, L.R., Lipman, R.S., & Covi, L. (1973). SCL-90: An outpatient psychiatric rating
scale-Preliminary report. Psychopharmacology Bulletin, 9, 13–28.
Dinger, U., Strack, M., Leichsenring, F., Wilmers, F., & Schauenburg, H. (2008). Therapist
effects on outcome and alliance in inpatient psychotherapy. Journal of Clinical Psychology,
64, 344–354.
Dunkley, D.M., & Kyparissis, A. (2008). What is DAS self-critical perfectionism really
measuring? Relations with the five-factor model of personality and depressive symptoms.
Personality and Individual Differences, 44, 1295–1305.
Elkin, I. (1994). The NIMH Treatment of Depression Collaborative Research Program: Where
we began and where we are now. In A.E. Bergin and S.L. Garfield (Eds.), Handbook of
psychotherapy and behavior change (4th ed.) (pp. 114–135). New York: Wiley.
Elkin, I., Parloff, M.B., Hadley, S.W., & Autry, J.H. (1985). NIMH Treatment of Depression
Collaborative Research Program: Background and research plan. Archives of General
Psychiatry, 42, 305–316.
Elkin, I., Shea, M.T., Watkins, J.T., Imber, S.D., Sotsky, S.M., Collins, J.F., et al. (1989).
NIMH Treatment of Depression Collaborative Research Program: General effectiveness of
treatments. Archives of General Psychiatry, 46, 971–982.
Endicott, J., Spitzer, R.L., Fleiss, J.L., & Cohen, J. (1976). The Global Assessment Scale: A
procedure for measuring overall severity of psychiatric disturbance. Archives of General
Psychiatry, 33, 766–771.
Gilbert, P. (2009). The compassionate mind: A new approach to facing the challenges of life.
London: Constable Robinson.
Gurman, A.S. (1977). The patient’s perception of the therapeutic relationship. In
A.S. Gurman & A.M. Razin (Eds.), Effective psychotherapy: A handbook of research
(pp. 503–543). New York: Pergamon.
Hamilton, M. (1967). Development of a rating scale for primary depressive illness. British
Journal of Social and Clinical Psychology, 6, 278–296.
Hatcher, R.L., & Barends, A.W. (2006). How a return to theory could help alliance research.
Psychotherapy: Theory, Research, Practice, Training, 43, 292–299.
Hawley, L.L., Ho, M.R., Zuroff, D.C., & Blatt, S.J. (2006). The relationship of perfectionism,
depression, and therapeutic alliance during treatment for depression: Latent difference
score analysis. Journal of Consulting and Clinical Psychology, 74, 930–942.
Horvath, A.O., & Bedi, R.P. (2002). The alliance. In J.C. Norcross (Ed.), Psychotherapy
relationships that work: Therapist contributions and responsiveness to patients (pp.
37–69). London: Oxford University Press.
Horvath, A.O., & Greenberg, L.S. (1989). Development and validation of the Working
Alliance Inventory. Journal of Counseling Psychology, 36, 223–233.
Journal of Clinical Psychology
DOI: 10.1002/jclp
696
Journal of Clinical Psychology, July 2010
Huppert, J.D., Bufka, L.F., Barlow, D.H., Gorman, J.M., Shear, M.K., & Woods, S.W.
(2001). Therapists, therapist variables, and cognitive-behavioral therapy outcome in a
multicenter trial for panic disorder. Journal of Consulting and Clinical Psychology, 69,
747–755.
Imber, S.D., Pilkonis, P.A., Sotsky, S.M., Elkin, I., Watkins, J.T., Collins, J.F., et al. (1990).
Mode-specific effects among three treatments of depression. Journal of Consulting and
Clinical Psychology, 58, 352–359.
Kim, D.M., Wampold, B.E., & Bolt, D.M. (2006). Therapist effects in psychotherapy:
A random-effects modeling of the National Institute of Mental Health Treatment of
Depression Collaborative Research Program data. Psychotherapy Research, 16, 161–172.
Krupnick, J.L., Sotsky, S.M., Simmens, S., Moyer, J., Elkin, I., Watkins, J., et al. (1996). The
role of the therapeutic alliance in psychotherapy and pharmacotherapy outcome: Findings
in the National Institute of Mental Health Treatment of Depression Collaborative
Research Program. Journal of Consulting and Clinical Psychology, 64, 532–539.
Lambert, M.J., Morton, J.J., Hatfield, D., Harmon, C., Hamilton, S., Reid, R.C., et al. (2004).
Administration and scoring manual for the Outcome Questionnaire (OQ-45.2). Orem, UT:
American Professional Credentialing Service.
Lutz, W., Leon, S.C., Martinovich, Z., Lyons, J.S., & Stiles, W.B. (2007). Therapist effects in
outpatient psychotherapy: A three-level growth curve approach. Journal of Counseling and
Clinical Psychology, 54, 32–39.
Martin, D.J., Garske, J.P., & Davis, M.K. (2000). Relation of the therapeutic alliance with
outcome and other variables: A meta-analytic review. Journal of Counseling and Clinical
Psychology, 68, 438–450.
McKay, K.M., Imel, Z.E., & Wampold, B.E. (2006). Psychiatrist effects in the psychopharmacological treatment of depression. Journal of Affective Disorders, 92, 287–290.
Norcross, J.C. (Ed.). (2002). Psychotherapy relationships that work: Therapist contributions
and responsiveness to patients. New York: Oxford University Press.
Okiishi, J.C., Lambert, M.J., Eggett, D., Nielsen, L., Dayton, D.D., & Vermeersch, D.A.
(2006). An analysis of therapist treatment effects: Toward providing feedback to individual
therapists on their clients’ psychotherapy outcome. Journal of Clinical Psychology, 62,
1157–1172.
Okiishi, J., Lambert, M.J., Nielsen, S.L., & Ogles, B.M. (2003). Waiting for supershrink: An
empirical analysis of therapist effects. Clinical Psychology and Psychotherapy, 10, 361–373.
Powers, T.A., Zuroff, D.C., & Topciu, R. (2004). Covert and overt expressions of self-criticism
and perfectionism and their relation to depression. European Journal of Personality, 18,
61–72.
Rogers, C.R. (1951). Client-centered counselling. Boston: Houghton-Mifflin.
Rogers, C.R. (1957). The necessary and sufficient conditions of therapeutic personality
change. Journal of Consulting and Clinical Psychology, 21, 95–103.
Rosnow, R.L., & Rosenthal, R. (1996). Computing contrasts, effect sizes, and counternulls on
other people’s published data: General procedures for research consumers. Psychological
Methods, 1, 331–340.
SAS Institute. (2004). SAS/STAT user’s guide, Version 9. Cary, NC: Author.
Shahar, G., Blatt, S.J., Zuroff, D.C., Krupnick, J.L., & Sotsky, S.M. (2004). Perfectionism
impedes social relations and response to brief treatment for depression. Journal of Social
and Clinical Psychology, 23, 140–154.
Shahar, G., Blatt, S.J., Zuroff, D.C., & Pilkonis, P. (2003). Role of perfectionism and
personality disorder features in response to brief treatment for depression. Journal of
Consulting and Clinical Psychology, 71, 629–633.
Shea, M.T., Elkin, I., Imber, S.D., Sotsky, S.M., Watkins, J.T., Collins, J.F., et al. (1992).
Course of depressive symptoms over follow-up: Findings from the National Institute of
Journal of Clinical Psychology
DOI: 10.1002/jclp
Between-Therapist and Within-Therapist Differences
697
Mental Health Treatment of Depression Collaborative Research Program. Archives of
General Psychiatry, 49, 782–787.
Snijders, T., & Bosker, R. (1999). Multilevel analysis: An introduction to basic and advanced
multilevel modeling. London: Sage.
Wallace, D., & Green, S.B. (2002). Analysis of repeated-measures designs with linear mixed
models. In D.S. Moskowitz & S.L. Hershberger (Eds.), Analyses of repeated measures
data: Advances and methods (pp. 103–134). Hillsdale, NJ: Erlbaum.
Wampold, B.E. (2001). The great psychotherapy debate: Models, methods, and findings.
Mahwah, NJ: Erlbaum.
Wampold, B.E., & Brown, G.S. (2005). Estimating variability in outcomes attributable to
therapists: A naturalistic study of outcomes in managed care. Journal of Consulting and
Clinical Psychology, 73, 914–923.
Weissman, A.N., & Beck, A.T. (1978). Development and validation of the Dysfunctional
Attitude Scale: A preliminary investigation. Paper presented at the meeting of the
American Educational Research Association, Toronto, ON.
Weissman, M.M., & Paykel, E.S. (1974). The depressed woman: Study of social relationships.
Chicago: University of Chicago Press.
Zuroff, D.C., & Blatt, S.J. (2002). Vicissitudes of life after the short-term treatment of
depression: Roles of stress, social support, and personality. Journal of Social and Clinical
Psychology, 21, 473–496.
Zuroff, D.C., & Blatt, S.J. (2006). The therapeutic relationship in the brief treatment of
depression: Contribution to clinical improvement and enhanced adaptive capacity. Journal
of Consulting and Clinical Psychology, 74, 130–140.
Zuroff, D.C., Blatt, S.J., Sanislow, III, C.A., Bondi, C.M., Pilkonis, P.A. (1999). Vulnerability
to depression: Reexamining state dependence and relative stability. Journal of Abnormal
Psychology, 108, 76–89.
Zuroff, D.C., Blatt, S.J., Sotsky, S.M., Krupnick, J.L., Martin, D.J., Sanislow, C.A., et al.
(2000). Relation of therapeutic alliance and perfectionism to outcome in brief outpatient
treatment of depression. Journal of Consulting and Clinical Psychology, 68, 114–124.
Zuroff, D.C., Mongrain, M., & Santor, D.A. (2004). Conceptualizing and measuring
personality vulnerability to depression: Comment on Coyne and Whiffen (1995).
Psychological Bulletin, 130, 489–511.
Journal of Clinical Psychology
DOI: 10.1002/jclp