azu_etd_12589_sip1_m

PEER-ADMINISTERED INTERVENTIONS FOR DEPRESSION:
A META-ANALYTIC REVIEW
by
Amanda E. B. Bryan
_____________________
Copyright © Amanda E. B. Bryan 2013
A Dissertation Submitted to the Faculty of the
Department of Psychology
In Partial Fulfillment of the Requirements
For the Degree of
DOCTOR OF PHILOSOPHY
In the Graduate College
THE UNIVERSITY OF ARIZONA
2013
2
THE UNIVERSITY OF ARIZONA
GRADUATE COLLEGE
As members of the Dissertation Committee, we certify that we have read the dissertation
prepared by Amanda Bryan
entitled Peer-Administered Interventions for Depression: A Meta-Analytic Review
and recommend that it be accepted as fulfilling the dissertation requirement for the
Degree of Doctor of Philosophy
____________________________________________________________Date: 3/8/2013
Hal Arkowitz
____________________________________________________________Date: 3/8/2013
John Allen
___________________________________________________________ Date: 3/8/2013
David Sbarra
___________________________________________________________ Date: 3/8/2013
Richard Bootzin
Final approval and acceptance of this dissertation is contingent upon the candidate's
submission of the final copies of the dissertation to the Graduate College.
I hereby certify that I have read this dissertation prepared under my direction and
recommend that it be accepted as fulfilling the dissertation requirement.
____________________________________________________________Date: 3/8/2013
Dissertation Director: Hal Arkowitz
3
STATEMENT BY AUTHOR
This dissertation has been submitted in partial fulfillment of requirements for an
advanced degree at the University of Arizona and is deposited in the University Library
to be made available to borrowers under rules of the Library.
Brief quotations from this dissertation are allowable without special permission, provided
that accurate acknowledgment of source is made. Requests for permission for extended
quotation from or reproduction of this manuscript in whole or in part may be granted by
the copyright holder.
SIGNED: Amanda E. B. Bryan
4
ACKNOWLEDGEMENTS
Many thanks to my advisor, Hal Arkowitz, who has been a supportive and
inspiring guide and mentor. Thanks also to my committee members, John Allen, Dave
Sbarra, and Dick Bootzin, each of whom has been a wonderful and influential teacher to
me over the last several years. My sincerest appreciation to research assistants Karlee
Frye, Sean Gallagher, and Joseph Santa Cruz for their generous and capable work on
coding the studies included in this analysis. Finally, warmest thanks to my entire family
for their love, patience, and support as I’ve found my circuitous way through graduate
school.
5
DEDICATION
For Andrew, and the many years of discovery that await us.
6
TABLE OF CONTENTS
LIST OF FIGURES……………………………………………………………..8
LIST OF TABLES……………………………………………………………....9
ABSTRACT…………………………………………………………………….10
1. BACKGROUND AND RATIONALE……………………………………....12
Effectiveness of nonprofessional mental health services………………….......13
Prior meta-analyses of peer interventions for depression….............................15
Why peer interventions?.....................................................................................17
The present study…...........................................................................................19
2. METHOD…......................................................................................................20
Study selection…...............................................................................................20
Data extraction…..............................................................................................21
Effect size calculation…....................................................................................22
Weighted mean effect size calculation…...........................................................25
Moderation analysis…......................................................................................27
Alternate metrics of outcome…........................................................................28
Follow-up data….............................................................................................28
Publication bias…............................................................................................29
3. RESULTS…....................................................................................................32
Study characteristics…....................................................................................32
Depression measurement….............................................................................32
Observed effect sizes…....................................................................................33
7
TABLE OF CONTENTS – CONTINUED
Moderation models….....................................................................................35
Follow-up effect sizes…..................................................................................37
Outcomes using alternate metrics…...............................................................38
Impact of providing peer support…...............................................................39
Publication bias…..........................................................................................39
4. DISCUSSION…............................................................................................41
Summary and interpretation of findings….....................................................41
Clinical significance of findings….................................................................44
Methodological limitations….........................................................................45
Future directions…........................................................................................46
FIGURES…......................................................................................................49
TABLES…........................................................................................................61
APPENDIX A: CODING FORM.....................................................................73
APPENDIX B: METHODOLOGICAL QUALITY SCORE (MQS)
CODING SYSTEM…............................................................77
REFERENCES….............................................................................................78
8
LIST OF FIGURES
Figure 1:
Flow of studies.............................................................................49
Figure 2:
Forest plot of effect sizes comparing
PAI to active therapy conditions..................................................50
Figure 3:
Forest plot of effect sizes comparing
PAI to TAU/WL conditions.........................................................51
Figure 4:
Forest plot of effect sizes of PAI conditions
(change from pre- to post-treatment)...........................................52
Figure 5:
Moderators of standardized mean gain (SMG) for
PAI conditions (change from pre- to post-treatment)..................53
Figure 6:
Moderators of standardized mean difference (SMD)
comparing PAI to active therapy conditions...............................54
Figure 7:
Moderators of standardized mean difference (SMD)
comparing PAI to TAU/WL conditions......................................55
Figure 8:
Methodological quality is not significantly
associated with pre-post effect size.............................................56
Figure 9:
Methodological quality is not significantly
associated with between-groups effect size
(a) PAI compared with active therapy conditions......................57
(b) PAI compared with TAU/WL...............................................58
Figure 10:
Funnel plot of effect sizes comparing
PAI to active therapy conditions.................................................59
Figure 11:
Funnel plot of effect sizes comparing
PAI to TAU/WL conditions........................................................60
9
LIST OF TABLES
Table 1:
Descriptive characteristics of clinical trials
involving PAIs..........................................................................61
Table 2:
Characteristics of depression symptom
measurement instruments..........................................................67
Table 3:
Results of included studies.......................................................68
Table 4:
Mean SMD estimates (PAI compared with
active therapy conditions) adjusted for four
models of selection (publication) bias......................................71
Table 5:
Mean SMD estimates (PAI compared with
TAU/WL conditions) adjusted for four
models of selection (publication) bias......................................72
10
ABSTRACT
A variety of psychotherapies have been demonstrated to be efficacious and effective
treatments for depression. The cost of psychotherapy, however, and its low availability
in some contexts pose significant treatment barriers for many depressed individuals.
Based on the idea that peers (i.e., individuals who have successfully recovered from
similar problems) may be uniquely able to provide empathy and support to those
currently receiving treatment, some community mental health centers have implemented
peer treatment models that employ recovered former clients as cost-effective adjunct
providers. The effectiveness of these and other peer-administered interventions (PAIs)
has not been well-established. The current study is a meta-analysis of the existing
outcome research on PAIs for depression. Twenty-six studies were identified as eligible
for inclusion and yielded 30 between-groups effect sizes and 29 pre-post PAI effect sizes.
Study characteristics and methodological quality were coded and random-effects models
were used to calculate and compare mean effect sizes. PAIs produced significant pre-topost treatment reductions in depression symptoms that were comparable to those found in
well-established professionally-administered interventions (.4554). In direct
comparisons, PAIs performed as well as professionally-administered treatments (.0848).
but not significantly better than treatment-as-usual (e.g., periodic physician check-ins or
availability of community mental health services) and wait-list control conditions (.0978).
These findings did not change after adjusting for the moderate degree of publication bias
in the data. Moderation models revealed that professionally-co-administered PAIs
produced significantly worse outcomes than those that were purely peer-administered,
11
and that educational/skills-based PAIs (but not supportive PAIs) produced better
outcomes compared with professional treatments. Limitations of this analysis included
the heterogeneity of the included interventions and the lack of data on mediators and
moderators. Still, these findings suggest that PAIs have promise as effective depression
treatments and are worthy of further study.
12
1. BACKGROUND AND RATIONALE
“You never really understand a person until you consider things from his point of
view... Until you climb inside of his skin and walk around in it.”
― Harper Lee, To Kill a Mockingbird (1960)
Approximately 16.6% of Americans will meet the diagnostic criteria for Major
Depressive Disorder (MDD) during their lives (Kessler et al., 2005). The costs of MDD
both to individuals and to society are substantial, with suicide attempt rates 11 times
higher in individuals with lifetime diagnoses of depression than in individuals without
psychological diagnoses (Kessler, Borges, & Walters, 1999), and an estimated annual
economic burden of $83.1 billion incurred through direct, suicide-related, and lostproductivity costs of depression in the United States (Greenberg et al., 2003). Yet just
over half of individuals who have met DSM criteria for major depression within the last
year have received any treatment, and only about one-fifth have received adequate
treatment, defined as receiving pharmacotherapy for at least 30 days with at least four
physician visits, or at least eight psychotherapy sessions (Kessler et al., 2003).
Psychotherapeutic treatments for depression have demonstrated comparable
efficacy to antidepressant medication and possibly superior outcomes in preventing
relapse (Hollon et al., 2002), but the proportion of depressed individuals who receive
psychotherapy has declined in recent decades, as has the mean number of psychotherapy
sessions for depressed people who do receive it (Olfson et al., 2002). Contributing to this
13
discrepancy is the high cost of psychotherapy, which decreases the rate at which
psychotherapy is utilized in outpatient settings (e.g., Simon et al., 1996), especially
among lower-SES individuals (e.g., Steele, Dewa, & Lee, 2007). The expenses of
training and reimbursement for master’s- and doctoral-level providers contribute to the
costs of therapy, yet it is far from clear that level of training is correlated with treatment
quality or outcome (Stein & Lambert, 1984; Bickman, 1999); thus, exploring areas of
treatment need that could be fulfilled at a lower cost by providers with less training is
worthwhile. Along these lines, it has been suggested (e.g., Pfeiffer et al., 2011) that peer
support services could substantially reduce the barriers to psychosocial treatment of
depression by making services available at low costs, and by circumventing some of the
stigma associated with traditional psychotherapeutic and pharmacological treatments.
Effectiveness of nonprofessional mental health services
Similar to but not synonymous with peer support services, services provided by
paraprofessionals or “lay therapists” (i.e., individuals without professional credentials as
mental health providers, but not necessarily with personal experiences of the problems
they are treating) have shown promise in numerous research studies, suggesting that
individuals with little or no prior training as mental health providers may be able to
contribute effectively to treatments for depression. Paraprofessional therapists may
receive brief, role-specific preparatory training, but do not hold professional credentials
as health or mental health professionals, and generally have not received broad or
extensive training in mental health treatment. Den Boer, Wiersma, Russo, and van den
14
Bosch (2005) meta-analyzed controlled studies of paraprofessional treatment of anxiety
and depression, and found that the outcomes achieved by paraprofessionals delivering a
variety of cognitive-behavioral and supportive interventions were not statistically
different from those achieved by professionals, and were significantly better than waitlist
and placebo control conditions. In a recent review of cognitive-behavioral treatments for
anxiety and depression (Montgomery, Kunik, Wilson, Stanley, & Weiss, 2010), the
authors identified four studies that had compared professional and paraprofessional
outcomes and concluded that “paraprofessionals generally achieved comparable overall
outcomes” (p. 57). These and other summaries (e.g., Durlak, 1979; Hattie, Sharpley, &
Rogers, 1984; Berman & Norton, 1985; Christensen & Jacobson, 1994) have concluded it
is possible that high-quality treatment can be delivered by individuals who are not trained
as therapists.
Taking nonprofessional treatment services a step further, some community mental
health organizations have begun to integrate peer “consumer-provided” care – that is,
services delivered by current or former clients – into their treatment programs with
promising results. Like paraprofessionals, consumer providers typically have neither
professional mental health credentials nor a broader training background in a mental
health field; they, too, may receive brief, role-specific preparatory training, or they may
simply rely on their own treatment experiences for preparation. Chinman, Rosenheck,
Lam, and Davidson (2000) evaluated a case management program for seriously mentally
ill individuals in which some case managers were consumer providers and others were
not mental health care consumers; they found no significant interaction between time and
15
case manager type on clinical outcomes over 12 months. In a broader investigation of
consumer-provided services nationwide, Goldstrom and colleagues (2006) reported on a
survey conducted by the Substance Abuse and Mental Health Services Administration
(SAMHSA) of mutual support groups, self-help organizations “run by and for mental
health consumers,” and consumer-operated mental health services. They found that these
groups exceeded in number traditional mental health organizations, and that they served
upwards of half a million people per year. Regarding the effectiveness of these services,
Doughty and Tse (2011) reviewed 29 studies of consumer-led mental health services and
concluded that they produced outcomes similar to the outcomes of traditional mental
health services. Taken together, these findings suggest that peer-administered mental
health interventions are widely used and possibly effective in improving outcomes.
Prior meta-analyses of peer interventions for depression
In the more specific realm of depression, two recent meta-analyses have evaluated
the literature on interventions for depression that focus on the development of
relationships with peers (as opposed to with mental health professionals). Pfeiffer and
colleagues (2011) analyzed “peer support interventions” for depression, which were
defined as interventions that “placed individuals with current depression in regular
contact with at least one other person with either current or prior depression” (p. 30).
They included 10 controlled studies, and concluded that outcomes of peer support
interventions did not differ significantly from outcomes of group CBT, and that the
effects of peer support interventions compared with usual care were similar in size to the
16
effects of CBT compared with usual care in other published analyses. Mead and
colleagues (2010) evaluated the effect on depression symptoms of “befriending,” defined
as “an intervention that introduces the client to one or more individuals whose main aim
is to provide the client with additional social support through the development of an
affirming, emotion-focused relationship over time” (p. 96). They identified 24 studies to
include in their analyses, and found that befriending had a modest effect on depression
symptoms compared with usual care or no treatment, and was less effective than alternate
interventions, including cognitive-behavioral and family-based interventions.
In addition to their somewhat different results, each of these analyses has features
that complicate interpretation. First, in both analyses, treatments that were administered
or facilitated by a mental health professional were included as long as the treatment also
involved some degree of interaction with nonprofessional peers. This makes it difficult
to interpret the extent to which peer support actually had an effect on symptoms,
especially because neither analysis examined professional facilitation as a potential
moderator of treatment outcome. Research has shown (e.g., Toro et al., 1988) that the
presence of a mental health professional in a mutual help group alters the behaviors of
group members and their perceptions of the group.
Second, the inclusion of studies was not comprehensive in either analysis: some
studies were either missed or excluded for reasons that are unclear, and internet-based
interventions were not included in either analysis in spite of other evidence that
computer-based treatments for depression produce effect sizes similar to face-to-face
treatments (e.g., Andersson & Cuijpers, 2009). Additionally, each of these analyses
17
included only controlled studies, yet a substantial minority of the published studies in this
area have been uncontrolled; therefore, many relevant findings were excluded. Although
controlled studies produce effect sizes that are easier to interpret than pre-post effect sizes
(see p. 23), pre-post comparisons may also be informative when uncontrolled studies
comprise much of a body of literature, because there may be systematic differences
between controlled and uncontrolled studies that could affect outcome. Researcher
allegiance is a hypothetical example: community organizations with a strong allegiance
to peer support as a treatment approach may also be less likely than university researchers
to have the resources to conduct controlled studies.
Why peer interventions?
Why might one expect interventions administered by non-therapist peers to
effectively reduce depression symptoms? One reason is that peer-administered
interventions may provide a relationship characterized by an especially high degree of
empathy. Although there is a dearth of evidence regarding the effect of empathy on
depression symptoms in everyday interactions, it has been demonstrated several times
that therapist empathy is associated with better therapy outcome (see Greenberg, Elliot,
Watson, & Bohart, 2001, for a brief meta-analysis), and there is evidence that therapist
empathy causally affects outcome in cognitive therapy for depression (Burns & NolenHoeksema, 1992). This suggests that receiving empathy improves symptoms, at least in
the context of therapy. Yet it is not clear that therapists are better equipped than peers to
provide a high degree of empathy. Hassenstab and colleagues (2007) showed that
18
therapists scored higher than non-therapist control subjects (matched on age, gender,
intelligence, and education level) on self-report measures of cognitive empathy (i.e., the
ability to understand others’ emotions, thoughts, or intentions), but lower than controls on
emotional empathy (i.e., one’s own emotional responses to others’ affective states),
suggesting that therapists may actually sacrifice some empathy in favor of emotional
control.
On the other hand, peers who are identified specifically because of their similar
experiences may be uniquely equipped to provide empathy. Several studies have
demonstrated that individuals are more empathetic in response to others’ distress if they
themselves have experienced a similar distressing situation. For example, Batson and
colleagues (1996) presented subjects with vignettes that described an adolescent
experiencing distress about either severe acne or interpersonal rejection. They found that
women, but not men, reported experiencing more empathy if they had personally
experienced the same type of distress as the adolescent in the vignette they had read.
Others have reported the similar effects in women only across different targets of
empathy, including fear of darkness, fear of abandonment, and loss of a pet (Eklund,
Andersson-Straberg, & Hansen, 2009), and pregnancy (Hodges, Kiel, Kramer, Veach, &
Villanueva, 2010). Furthermore, it appears that therapists are no exception: they reported
experiencing greater empathy in response to fictional clients if they were able to generate
“reference points” from their own experiences that were in some way similar to the
clients’ (Hatcher et al., 2005).
19
The present study
The research summarized above suggests that (1) peers may be especially wellsuited to provide social support characterized by a high degree of empathy; (2) receiving
empathy may reduce depression symptoms; and (3) previous evaluations of research on
peer-support interventions for depression have not been comprehensive and have
included studies that were administered primarily by professionals. These findings
jointly support the need for the present analysis, which meta-analytically aggregates the
published effects of peer-administered interventions on depression symptoms using (a) a
highly specific definition of “peer-administered” that explicitly qualifies any involvement
by health or mental health professionals, and (b) a comprehensive search strategy to
identify studies that have been excluded from previous analyses. Additionally, the
analysis reported here includes examinations of several possible moderators of treatment
outcome, and evaluates the clinical significance of the findings. Finally, the effects (if
any) of administering treatment to peers on well-being will be examined if a sufficient
number of studies report data on this.
20
2. METHOD
Study selection
For this analysis, a peer-administered intervention (PAI) was defined as an
intervention administered by one or more lay individuals who have experienced problems
similar to those being treated, with minimal or no assistance from a health or mental
health professional. Interventions meeting this definition might include a support group
for bereaved spouses facilitated by someone who is widowed; an internet support forum
for depression in which depressed individuals provide support to each other; and a
program in which mothers with prior perinatal depression have regular contact with
mothers who are currently experiencing perinatal depression. Under this definition, it is
acceptable for peer facilitators to have received supervision or training from a
professional in preparation to administer the intervention, and/or ongoing supervision
from a professional during the course of administering the intervention. Interventions
that were primarily administered by a peer, but had a professional present in a supportive
role, were also included in this analysis, and the effect of having a professional present
was examined statistically (see Results section).
To be included in this analysis, studies were required to report outcomes in which:
(a) at least one condition involved a PAI as defined above; (b) depression symptoms were
measured as an outcome; (c) participants were adult outpatients with diagnosed
depression or elevated depression scores on a well-validated symptom measure; and (d)
the report was published in a peer-reviewed journal and was available in English.
Samples were not required to be defined by a diagnosis with depression; studies of
21
interventions for chronic illness populations or other stressed populations could be
included, but only if those samples met minimum clinical cutoffs for probable depression
on well-established measures. In order to allow for the most comprehensive review
possible, uncontrolled and quasi-experimental studies were included if they reported, at a
minimum, pre- and post-treatment outcomes for a PAI condition.
Figure 1 shows the progression of the study selection process. Searches of the
PsycINFO and PubMed online databases were conducted in November 2011 using the
search terms “(Peer OR mutual OR paraprofessional) AND depress*”. The abstract of
each search return was reviewed, and articles that were clearly ineligible (e.g., because
they did not report on an intervention study) were discarded, leaving a list of “potentiallyeligible” studies. Reference sections of these studies were then hand-searched to identify
any additional potentially-eligible studies that the database searches may have missed.
Finally, the complete text of each potentially-eligible study was reviewed and its status as
eligible or ineligible was determined. For articles that reported on an eligible study but
did not include sufficient statistical information to calculate effect size, the needed data
were requested from corresponding authors.
Data extraction
A sample coding form, which lists all coded variables, and the Methodological
Quality Scale, which raters also completed for all studies, are included as Appendices A
and B, respectively.
22
Each study was independently coded by two raters: the author (Bryan) and a
trained undergraduate research assistant. Interrater discrepancies were resolved by
discussion to decide values for the final dataset. Interrater reliability for the continuous
Methodological Quality Score (MQS) was calculated using Spearman’s rank correlation
coefficient (Spearman, 1904), calculated as
i
i
( xi
( xi
x)( yi
x) 2
i
( yi
y)
y) 2
where xi and yi are each rater’s raw ratings converted to ranks. The MQS showed good
interrater reliability (ρ = .82).
Interrater reliability for categorical variables was calculated using Cohen’s kappa
(Cohen, 1960), calculated as
Pr(a) Pr(e)
1 Pr(e)
where Pr(a) is the relative observed agreement among raters, and Pr(e) is the hypothetical
probability of chance agreement. Kappas for categorical variables were all at least
moderately high, ranging from .71-.89.
Effect size calculation
Standardized mean gains. Pre-post effect sizes (standardized mean gains, or
SMGs) were calculated for all PAI conditions in controlled and uncontrolled studies,
defined as
23
X T2
ES sg
X T1
sp
ES sg2
2(1 r )
n
SEsg
,
2n
,
1
,
SEsg2
wsg
where X T 1 is the mean at Time 1, X T 2 is the mean at Time 2, n is the common sample
size at Time 1 and Time 2, r is the correlation between Time 1 and Time 2 scores, and
s p is the pooled standard deviations of Time 1 and Time 2 means, calculated as
sp
s 2T 1
s 2T 2 / 2
As Westen and Morrison (2001) note, interpretation of the SMG is limited for many of
the same reasons as its components: effects of the treatment are likely to be correlated
with placebo effects, the effects of time’s passage, and the effects of common factors. At
the same time, it may be useful to compare SMGs of PAI conditions with pre-post effect
sizes from other published meta-analyses of empirically supported therapies for
depression – interpreting these comparisons judiciously, of course.
Standarized mean differences. For studies that included randomized control
groups, between-groups effect sizes were calculated as standardized mean differences,
defined as (Lipsey & Wilson, 2001)
ES sm
X G1
X G2
sp
,
24
where X G1 the posttest mean for Group 1 (comparison group in this analysis), X G 2 is
the posttest mean for Group 2 (PAI group in this analysis), and s p is the pooled posttest
standard deviations of the two group means, defined as
sp
2
(ntx 1) s tx2 (nctrl 1) s ctrl
.
(ntx 1) (nctrl 1)
Because standardized mean differences tend to be upwardly biased in samples of fewer
than 20 participants (Hedges, 1981), all standardized mean differences were corrected
using Hedges’ correction factor, and these corrected (unbiased) effect size estimates were
used in all analyses:
'
ES sm
SEsm
1
3
ES sm ,
4N 9
n G1 n G 2
n G1 n G 2
wsm
'
( ES sm
)2
,
2( n G 1 n G 2 )
1
2
SEsm
For this analysis, two sets of SMDs were calculated: those comparing a PAI
condition to another active intervention (e.g., a professionally-administered
psychotherapy group), and those comparing a PAI condition to a treatment-as-usual
(TAU) or wait-list (WL) condition. The decision to combine TAU and WL control
conditions into a single meta-analytic comparator was based on the observation that the
distinction between the two categories was often blurry or impossible to make in a
25
systematic way. For example, participants in chronic illness samples almost invariably
continued to receive standard medical treatment during the course of the study, even if
they were in a wait-list condition for the PAI or other psychosocial interventions; these
standard medical treatments may or may not have involved “therapeutic” interactions
with health care professionals. Similarly, across sample types, conditions described as
“wait-list control” often included periodic check-ins or “case management” visits that
may have had therapeutic value. Conversely, in conditions described as TAU or
“standard care,” often this simply meant that ongoing care of some kind was available to
participants, not necessarily that it was utilized during the study period. The difficulty of
distinguishing between TAU and WL conditions in this sample was corroborated by low
interrater reliability between the two (κ = .25). Thus, rather than trying to categorize
based on incomplete data or arbitrary differences, the two were combined as a single
category. This had the added benefit of increasing statistical power for comparisons
between PAIs and TAU/WL conditions.
Weighted mean effect size calculation
In order to summarize effect sizes across studies, weighted mean effect size
estimates were calculated separately for between-groups ESs (SMDs) and within-groups
ESs (SMGs) using the formula
ES
( wi ES i )
wi
26
where wi is the inverse variance weight for ESi. Upper and lower limits of the 95%
confidence intervals around mean effect sizes were calculated as:
ES
z (1
)
(SEES )
where SEES is the standard error of the mean effect size, calculated as
1
.
wi
SEES
To determine whether the effect sizes included in these means all estimate the same
population effect size (e.g., Hedges, 1982, Rosenthal & Rubin, 1982), the heterogeneity
statistic Q was computed for each mean effect size:
Q (
2
i
wi ES )
(
wi ES i ) 2
wi
Q falls in a chi-square distribution with k – 1 degrees of freedom, where k is the number
of effect sizes included in its calculation (Hedges & Olkin, 1985). If Q is significant for a
given mean effect size (i.e., the null hypothesis of homogeneity is rejected), this indicates
that the dispersion of effect sizes around that mean is greater than would be expected
from sampling error alone (e.g., that differences in particular study characteristics
account for some of the variance in effect size). In these cases, it may be appropriate to
use random effects models, which use an adjusted inverse variance weight based on the
assumption that in addition to subject-level sampling error, there is an additional
randomly-distributed study-level source of error.
27
Moderation analysis
First, fixed-effects (ordinary least squares) models were calculated for all
hypothesized moderators using David Wilson’s MetaReg and MetaF macros for SPSS
(Wilson, 2010). Fixed-effects models partition effect-size variance into two categories:
between-groups variance (i.e., variance explained by the moderator), and within-group
variance (i.e., residual unexplained variance, which is assumed to be subject-level
sampling error). If Qw, the heterogeneity statistic for the residual variance, is significant,
this indicates that modeling only systematic variance and subject-level sampling error (as
in a fixed-effects model) is insufficient (i.e., fails to meet the model’s assumptions that
these are the only sources of variability), and a mixed-effects model may be appropriate
(Lipsey & Wilson, 2001).
Because fixed-effects models produced significant Qw values for all of the mean
effect sizes (see Results section), we turned to mixed-effects models using method-ofmoments estimation procedures, which Wilson’s (2010) macros can also calculate.
Mixed-effects models assume “that there is a remaining unmeasured (and possibly
unmeasurable) random effect in the effect size distribution in addition to sampling error.
That is, variability in the effect size distribution is attributed to systematic (modeled)
between-study differences, subject-level sampling error, and an additional random
component” (Lipsey & Wilson, 2001, p. 124). Mixed-effects models have lower
statistical power to detect moderation effects and result in wider confidence intervals than
fixed-effects models; they are, however, less prone to Type I errors, and their
assumptions are not violated by heterogeneity as the assumptions of fixed-effects models
28
are (Lipsey & Wilson, 2001). Thus, they were deemed more appropriate for testing
moderation in this set of data.
Alternate metrics of outcome
In addition to conventional effect size estimates, two alternate metrics of outcome
were examined to the extent that they were reported or available in order to determine
whether any treatment effects were clinically (not just statistically) significant. Clinical
significance has been defined as “the extent to which therapy moves someone outside the
range of the dysfunctional population or within the range of the functional population”
(Jacobson & Truax, 1991, p. 12), and is an important consideration when determining
whether detected effects represent meaningful changes in symptomology. First,
percentage of participants considered “improved” at treatment termination was coded
when reported. Second, posttreatment symptomology (i.e., mean scores on outcome
measures) was compared to published or widely-used guidelines for what constitutes
clinical elevation on the respective measures of depression symptoms.
Follow-up data
When available, follow-up data were examined to evaluate the durability of
treatment effects. SMGs and SMDs were calculated in the same manner as for pre-post
data (with SMGs indicating effect size between post-test and follow-up) When a study
29
reported on multiple follow-up time points (e.g., three- and six-month follow-ups), data
from the last available (e.g., the six-month follow-up) were used here.
For each set of follow-up effect sizes, two variables were tested as possible
moderators using the same procedures as described previously. First, the length of the
follow-up period was tested, as it was possible that shorter follow-ups would indicate
better maintenance simply because there was less time for symptoms to recur (and not
because interventions had a particularly enduring effect). Second, effect size at
posttreatment was tested as a moderator to determine whether PAIs that were more
effective in the short-term also produced more enduring benefits.
Publication bias
Several methods were considered for examining publication bias. The two most
commonly-reported indicators of publication bias, the funnel plot and the fail-safe N,
each have features that make reliable and valid interpretation difficult (Sutton, 2009),
rendering them less-than-ideal as standalone indicators. Funnel plots (Light & Pillemer,
1984) are scatterplots of effect size on the x-axis and standard error on the y-axis; in data
without publication bias, these plots are generally pyramid-shaped and symmetrical.
Interpretation of funnel plots typically relies on visual inspection for asymmetry.
Although funnel plots can be useful for developing hypotheses about publication bias,
interpreting them objectively and/or quantitatively presents challenges; furthermore,
when study characteristics moderate effect sizes, funnel plots can be misleading because
they effectively superimpose effect sizes drawn from multiple populations (Vevea &
30
Woods, 2005). The fail-safe N (Rosenthal, 1979; Orwin, 1983) is “the number of studies
with an effect size of zero needed to reduce the mean effect size to a specified or criterion
level” (Lipsey & Wilson, 2001, p. 166). This method has been criticized for its
assumption that all missing studies have effect sizes of zero, which is unlikely to be
reliably true (Vevea & Woods, 2005). Additionally, there is no theoretically-based
criterion for determining a threshold number of missing studies as indicative of
publication bias, and such a number may not be particularly informative anyway; as Field
and Gillett (2010) write, “…any fail-safe N method addresses the wrong question: it is
usually more interesting to know the bias in the data one has and to correct for it than to
know how many studies would be needed to reverse a conclusion” (p. 686). One
commonly-used method for correcting for publication bias, the iterative “trim-and-fill”
technique (Duval & Tweedie, 2000), produces a mean effect size corrected for the
number of studies estimated to be missing; this method, however, can lead to
overcorrection, particularly in datasets that contain heterogeneity, because it assumes that
all of the missing studies are those with the smallest effect sizes (Vevea & Woods, 2005).
Following the recommendation of Field and Gillett (2010), the analysis reported
here utilized Vevea and Woods’s (2005) weight-function sensitivity analysis to estimate
and correct for publication bias. Generally speaking, weight-function models estimate
the influence of different study characteristics on the likelihood of publication and make
corrections that are weighted according to these influences, thus avoiding the assumption
that missing studies are simply those with the smallest effect sizes. Typically, weightfunction models require very large samples sizes (k > 100; e.g., Vevea & Hedges, 1995),
31
but Vevea and Woods (2005) offer a modification for use with smaller-scale metaanalyses that “impose[s] a set of fixed weights determined a priori and chosen to
represent a specific form and severity of biased selection” (p. 432). This procedure
provides corrected effect size estimates for four different models of selection bias,
allowing the user to judge the most likely conditions of publication bias in his data (e.g.,
based on visual inspection of funnel plots) and select the model that most closely
represents those conditions. The models reported in this analysis were calculated using
the macros provided by Field and Gillett (2010).
32
3. RESULTS
Study characteristics
Search procedures identified 26 studies meeting inclusion criteria for which
sufficient data to calculate at least one effect size were either reported or obtainable from
authors. Included studies were published between 1988 and 2011 and reported on a total
of 29 PAI conditions. Twenty studies included at least one control group; in two of these
studies, however, participants were not randomly assigned to conditions, and only prepost data from the PAI conditions from those studies were included in this analysis. In
all, this yielded 30 effect sizes of PAI compared with another condition, and 29 pre-post
PAI effect sizes.
In all included studies, participants’ average pre-treatment depression scores met
minimum clinical cutoffs for likely diagnosable depression. In nine studies, elevation in
depression symptoms was the primary defining feature of the sample. Eleven studies’
samples were defined primarily as having a chronic illness, and the remaining six were
identified primarily by the experience of a severe psychosocial stressor. Additional study
characteristics are summarized in Table 1.
Depression measurement
Table 2 describes the measures of depression symptoms that were used in
included studies, including frequency of use and threshold scores for probable
depression.
33
Observed effect sizes
Results of included studies are summarized in Table 3. Effect sizes are shown in
Figures 2-4.
Standardized mean gains (pre-post effect sizes). It should be noted that, although
improvement in depression symptoms is indicated by a decreased score on all included
depression measures, for the sake of consistency all signs were reversed so that a positive
SMG indicates that symptoms improved from pre- to post-treatment.
The mean SMG for all PAI conditions using a fixed-effects model was .2194 (N =
29, p < .0001, 95% CI: .2003 - .2385), indicating that on average, depression scores
decreased between pre- and post-treatment in PAI conditions. However, the
heterogeneity statistic was highly significant (Q = 890.84, df = 28, p < .0001), suggesting
that the effect sizes comprising this mean do not all estimate the same population mean.
Using a random-effects model to account for the heterogeneity increased the estimated
mean SMG to .4554 (p < .0001, 95% CI: .3391 - .5717). As a standalone effect size
indicator, this estimate must be interpreted cautiously.
Standardized mean differences (between-groups effect sizes). It should be noted
that the PAI condition was considered “Group 2” for all SMD calculations; therefore, a
positive SMD indicates that PAI conditions produced fewer symptoms (i.e., better
outcomes) at post-treatment than control conditions.
Using a fixed-effects model, the mean SMD comparing PAI conditions to active
control conditions (i.e., other, non-PAI therapies) was .0306 (N = 13, p = .6664, 95% CI:
-.1085 - .1696), indicating that posttreatment depression scores were not significantly
34
different between PAI and active-control conditions. The heterogeneity statistic was
significant (Q = 30.15, df = 12, p = .0027), suggesting greater variability than would be
expected from sampling error if all effect sizes were estimates of the same population
mean; thus, a random-effects model may be more accurate. The mean PAI-vs-active
control SMD using a random-effects model was .0848 (p = .4706, 95% CI: -.1455 .3151), indicating that the effect size did not achieve statistical significance even after
accounting for the heterogeneity in the data.
Using a fixed-effects model, the mean SMD comparing PAI conditions to
TAU/WL conditions was .0913 (N = 17, p = .0235, 95% CI: .0123 - .1703), indicating
that posttreatment depression scores were significantly lower in PAI conditions than in
TAU/WL conditions. Again, the heterogeneity statistic was significant (Q = 72.88, df =
16, p < .0001), suggesting greater variability than would be expected from sampling error
if all effect sizes were estimates of the same population mean; thus, a random-effects
model may be more accurate. The mean PAI-vs-TAU/WL SMD using a random-effects
model was .0978 (p = .2966, 95% CI: -.0859 - .2815), indicating that the effect size did
not maintain statistical significance after accounting for heterogeneity. Given the
direction and small magnitude of the difference in effect size between the fixed- and
random-effects models, this discrepancy in statistical significance is more likely an
artifact of low statistical power than a meaningful indicator about the effect itself; that is,
although the heterogeneity in the data was statistically significant, its impact on effect
size was insufficient to support the statistical power demands of the random-effects
procedure.
35
In summary, these mean effect size estimates suggest that PAI interventions
produced significant reductions in depression symptoms from pre- to post-test, performed
similarly to the other active therapies to which they were compared, and did not
significantly outperform TAU/WL conditions. The significant heterogeneity in all three
estimates, however, suggests the presence of other significant sources of variability (e.g.,
moderators) that need to be considered in a valid interpretation.
Moderation models
Hypothesized moderators were PAI format (group or one-on-one), professional
coadministration of PAI (yes or no), sample type (depression, stressor, or chronic illness),
mutuality (mutual support or one person supporting another), training of peer
administrators (yes or no), professional supervision of peer administrators (yes or no),
PAI content (supportive or psychoeducational), PAI delivery mode (face-to-face,
telephone, or internet), PAI meeting frequency (weekly, more than weekly, or less than
weekly), blindness of assessments (yes or no), researcher allegiance (for or against PAI),
and total methodological quality score (MQS). Each of these constructs was tested as a
possible moderator of SMGs and SMDs, with two exceptions: researcher allegiance, for
which there was insufficient variability to test for moderation (i.e., all but two studies
expressed allegiance to PAI); and blindness of assessments, for which there were too
many missing data to test for moderation (only 56% of controlled studies reported on
blinding).
36
Results of mixed-effects moderation models are displayed in Figures 5-9. Prepost (SMG) models revealed a significant moderation effect of professional
coadministration, such that PAIs that were coadministered by health or mental health
professionals were less effective in reducing depression symptoms between pre- and posttest than PAIs that were entirely peer-administered. There was also a significant omnibus
test for supervision, but this is unlikely to be meaningful since post-hoc tests indicated
that the significant difference was between non-supervised groups and groups in which
supervision status was unspecified. Finally, there was an almost-significant (p = .0546)
omnibus test for sample type, such that pre-post improvements were greater for
depression samples than for stressor samples (with neither differing significantly from
chronic illness samples). There were no other variables that significantly moderated
SMG.
Mean SMD models comparing PAIs with active comparison conditions indicated
a significant moderation effect of PAI content, such that PAIs whose content was
primarily educational performed better than their active comparison groups, whereas
PAIs whose content was primarily supportive did not perform significantly differently
than their active comparison groups. There was also a moderating effect approaching
significance of meeting frequency (p = .0981), such that PAIs meeting weekly performed
better than their active comparison conditions, whereas PAIs with variable or unspecified
meeting frequencies performed no differently than their active comparison conditions.
There were no other significant moderators of SMDs between PAI and active control
conditions.
37
There were no significant moderators of SMDs between PAI and TAU/WL
comparison conditions.
Follow-up effect sizes
Eight studies reported sufficient follow-up data to calculate at least one effect
size. (A ninth study [Tudiver et al., 1992] reported follow-up depression scores, but no
standard deviations or other estimate of variability was available, precluding the
calculation of an effect size.) This resulted in eight follow-up SMGs comparing
outcomes at post-treatment to outcomes at follow-up, and nine follow-up SMDs (five
comparing PAI to active comparison, four comparing PAI to TAU/WL). Follow-up
periods ranged from 3-12 months (M = 6.89, SD = 3.62).
The mean SMG from post-treatment to follow-up was .0016 (p = .99), suggesting
that improvements in depression symptoms were maintained (but did not significantly
increase) after treatment ended. This effect was not moderated either by follow-up length
(p = .9214) or post-treatment SMG (p = .9501).
The mean SMD between PAI and active comparison conditions was .0477 (p =
.6322), indicating that PAI and active comparisons remained equally effective at followup. This effect was not significantly moderated either by follow-up length (p = .3922) or
by post-treatment SMD (p = .9777).
The mean SMD between PAI and TAU/WL conditions was .1226 (p = .0744),
which approached significance and, if significant, would indicate that PAI conditions had
maintained gains better (or deteriorated less) at follow-up compared with TAU/WL
38
conditions. This effect was not significantly moderated by follow-up length (p = .6505),
but it was significantly moderated by SMD at posttest (p = .0012), such that PAIs that
had performed better than TAU/WL comparisons at posttest also performed better at
follow-up.
Outcomes using alternate metrics
Only one of the 26 studies included in this meta-analysis (Bright et al., 1999)
reported an estimate of the percentage of participants who were considered improved
after treatment, precluding the aggregated use of this metric as an indicator of clinical
significance across studies.
Another indicator of clinical significance is whether post-treatment depression
scores fell below the thresholds for clinical elevation for the measures utilized. (Table 2
provides the elevation threshold for each measure included in this analysis.) In theory, at
least, falling below such a threshold would mean that participants improved enough that
their symptom levels at post-treatment were more similar to non-clinical than clinical
population means. Of the 29 PAI conditions for which they were available, posttreatment depression means fell below the elevation threshold for 10 conditions,
indicating that in almost two-thirds of the PAI conditions studied, participants were still
experiencing significant symptoms after receiving treatment. It should be noted that most
of these post-treatment means were well within one standard deviation of their thresholds
(and in many cases, fell mere fractions of points above or below); furthermore, this
indicator is only informative to the extent that variability is low (i.e., that a condition’s
39
mean outcome is a good representation of a typical participant’s outcome in that
condition). Therefore, it should be interpreted cautiously. To the extent that it is
accurate, it suggests that PAIs alone may not have been sufficient – at least in the doses
provided – to reduce symptoms to below a clinical level.
Impact of providing peer support
Only one of the studies included in this meta-analysis (Giese-Davis et al., 2006)
reported on any indicators of well-being for peers providing intervention. In that study,
peer “Navigators” (breast cancer survivors) who provided support to women newly
diagnosed with breast cancer did not show changes in depression symptoms or well-being
over the course of the study (though they did indicate increased dissatisfaction with their
medical teams). The lack of other outcome measures for peer supporters ruled out
aggregate analysis of the impact on peers of providing support.
Publication bias
Visual inspection of the funnel plot for PAI-vs-active therapy effect sizes (Figure
10) suggests a lack, but not a complete absence, of effect sizes on the left side of the
funnel (i.e., negative effect sizes, those that would indicate superiority of other therapy
over PAI). Therefore, the moderate one-tailed selection bias model (Table 4) was the
most likely to fit the data, and indicated that adjusting for publication bias would produce
a mean SMD of -.017 using a fixed-effects model (compared with an unadjusted SMD of
.031), or a mean SMD of -.014 using a random-effects model (compared with an
unadjusted SMD of .083). These adjustments suggest that the true mean SMD may be
40
closer to zero than the observed mean SMD. The observed mean SMD, however, was
not statistically different from zero; thus, the finding does not change when publication
bias is accounted for.
Visual inspection of the funnel plot for PAI vs TAU/WL conditions (Figure 11)
suggests a possible, but not dramatic, lack of effect sizes on the far left side of the funnel
(i.e., negative effect sizes, those that would indicate superiority of TAU/WL over PAI).
Therefore, here too the moderate one-tailed selection bias model (Table 5) was the most
likely to fit the data, and indicated that adjusting for publication bias would produce a
mean SMD of .061 using a fixed-effects model (compared with an unadjusted SMD of
.091), or a mean SMD of .006 using a random-effects model (compared with an
unadjusted SMD of .102). These adjustments suggest that the true mean SMD is closer
to zero than the observed mean SMD (possibly enough closer that it would not be
statistically different from zero).
In summary, it appears that there was a modest degree of publication bias in this
dataset, such that studies showing poorer outcomes for PAIs were less likely to be
published. Adjusting for this bias did not change the interpretation of comparisons
between PAIs and active therapy conditions; the outcomes remained statistically
equivalent. Comparisons between PAI and TAU/WL conditions also remained
equivalent after this adjustment. Therefore, the practical impact of this adjustment is
minimal (see Discussion, pp. 43-44).
41
4. DISCUSSION
Summary and interpretation of findings
The meta-analysis reported here examined the effects of peer-administered
interventions (PAIs) on symptoms of depression, defined as interventions administered
by one or more lay individuals who have experienced problems similar to those being
treated, with minimal or no assistance from a health or mental health professional. This
is the first analysis to comprehensively aggregate outcomes from studies of PAIs for
depression; prior meta-analyses of peer interventions for depression have suffered from
incomplete identification/inclusion of eligible studies and from overly-broad inclusion
criteria (e.g., including interventions that were primarily administered by professionals).
Effect size comparisons indicated that PAIs reduced depression symptoms comparably to
other active interventions (e.g., professionally-administered psychotherapies), but did not
significantly outperform TAU/WL conditions. Overall, these findings suggest that PAIs
may be effective, cost-efficient treatments for depression; their effectiveness, however, is
not unambiguous.
This is also the first meta-analysis to examine potential moderators of PAI
outcomes. Moderation models revealed that PAIs that were co-administered by health or
mental health professionals were less effective in reducing depression symptoms between
pre- and post-treatment than PAIs that were entirely peer-administered. Contrary to the
intuitive perspective that additional expertise should carry additional benefit, this finding
suggests that a “pure” approach to peer intervention may be preferable. It is possible that
the involvement of a professional undermined an important mechanism of peer change;
42
for example, perhaps peers were seen as less knowledgeable when a professional coadministrator was also present, reducing their impact as strong, capable role models of
successful recovery. Given that this moderation effect pertained to pre-post comparisons,
it is also possible that it was an artifact of a third variable unrelated to mechanisms of
therapeutic change, such as researcher allegiance. For example, researchers who
included professionals in their “peer” interventions may have started out with low
confidence in the abilities of peers to be independently therapeutic. Alternately,
researchers who knew that professionals would be involved may have seen less value in
dedicating time and resources toward preparing peers to intervene effectively.
There was also a moderating effect of intervention content, such that PAIs that
were psychoeducational or skills-based (e.g., teaching basic cognitive-behavioral skills)
performed better than other active therapies to which they were compared, whereas PAIs
that were primarily supportive did not perform differently than other active therapies.
There are several plausible interpretations of this finding, including: (1) that gaining
education and skills directly impacted depression symptoms more than receiving support;
(2) that peers administering educational/skills-based interventions were equally
supportive and empathic as those delivering primarily-supportive PAIs, resulting in
additive effects of skills and support; or (3) that peers delivering educational/skills-based
interventions were more likely to have undergone a brief preparatory training program to
prepare them for their roles, which might have resulted in a variety of advantages,
including higher confidence and better interpersonal skills.
43
It is also notable that PAIs performed similarly to other active therapies, yet did
not outperform TAU/WL conditions. Initially, this might seem to suggest that neither
PAIs nor active therapy comparisons were especially helpful. However, examination of
standardized mean gains (pre-post ESs) indicates otherwise: it appears that symptoms
tended to improve substantially between pre- and post-treatment not only for PAIs and
other active therapies, but also for participants in TAU/WL conditions (SMGPAI = .46;
SMGACTCTRL = .55; SMGTAU/WL = .32). This suggests that the failure of PAIs to perform
much better than TAU/WL conditions may be because participants improved regardless
of condition. One possible explanation for the high degree of improvement among
TAU/WL participants is that a substantial minority of included studies involved
depressed chronically ill populations, whose “usual” or “standard” care was likely to
involve regular contact with medical professionals. These contacts may, in some cases,
have been psychologically therapeutic, even if that was not their intended function. A
second explanation pertains to the fact that these analyses are based on study completers;
participants who remained in a study despite being assigned to a control condition may
have been highly motivated to change and/or less distressed to begin with, and thus likely
to improve whether or not they received treatment.
Publication bias appeared to be modestly present in the data, such that betweengroups comparisons that did not favor PAIs were somewhat less likely to appear in the
published literature. When effect sizes were adjusted for the likely degree of publication
bias, PAIs and other active therapies maintained their equivalent effectiveness, and the
difference in outcome between PAI and TAU/WL conditions remained non-significant.
44
Therefore, although it is notable that there was bias in the data, it posed little
complication to the interpretation of the findings.
Clinical significance of findings
The mean pre-post effect size (standardized mean gain) observed across PAI
conditions was statistically greater than zero, but statistical significance does not
necessarily imply clinical significance. What, if anything, does that number say about the
extent to which PAI participants experienced meaningful symptom reduction? There are
multiple ways to address this question.
First, we can calculate how an effect size translates to score reductions on the
symptom measures that were employed. Using the pooled standard deviation observed in
this sample of studies, the .4554 SMG is equivalent to a reduction of 3.54 points on the
BDI-II, or 4.41 points on the CES-D (by far the two most commonly-used measures in
this analysis). To put those numbers in context, we can consider them as minimal
proportions of the threshold for clinical elevation. For the BDI-II, scores ≥14 were
considered elevated; thus, a 3.54-point reduction would constitute a 25.3% minimum
reduction in symptoms. For the CES-D, scores >15 were considered elevated; thus, a
4.41-point reduction would constitute a minimum 29.4% reduction in symptoms. This
provides a conservative estimate of proportional score reductions for PAI participants
(since symptom means often exceeded the threshold considerably). Still, these average
raw score reductions are small in absolute terms – that is, they represent relatively small
changes in actual symptom presentation.
45
Second, we can use as benchmarks the pre-post effect sizes observed in recentlypublished meta-analyses of well-established, empirically-supported therapies for
depression. Large standardized mean gains have been observed in meta-analyses of a
variety of efficacious and possibly-efficacious (Chambless & Hollon, 1998) therapies for
adult unipolar depression, including but not limited to cognitive-behavioral therapy (d =
1.06; Hans & Hiller, 2013), short-term psychodynamic psychotherapy (d = 1.34;
Driessen, Cuijpers, Maat, Abbass, de Jonghe, & Dekker, 2010), and acceptance and
commitment therapy (Hedges’ g = .59; Hofmann, Sawyer, Witt, & Oh, 2010). These
previously-observed standardized mean gains are more than twice those observed in this
sample of PAIs. These comparisons suggest that although PAI participants experience
reduced symptoms, these symptom reductions are inferior to those seen in other, wellestablished psychotherapies for depression.
Methodological limitations
A significant limitation of this analysis was the heterogeneity of the interventions
falling under the PAI umbrella. All had the common feature of being peer-administered,
but they differed substantially in structure, duration, and content. Although every attempt
was made to code these differences and examine their moderating effects, it is possible
that some important moderators were missed or unavailable, or that significant
moderation effects did not achieve significance because of low statistical power.
Furthermore, the lack of clear distinction between TAU and WL conditions, and the
46
resulting need to combine them as a single comparison category are less than ideal, as
there was substantial variety in these conditions.
Another limitation was the lack of available data on potential moderators and
mediators of PAI outcomes. Although methodological features were coded and tested as
moderators, participant-level variables beyond basic demographics were largely
unavailable, let alone measured consistently enough across studies to test metaanalytically. For example, race and ethnicity were too inconsistently reported in these
studies to be tested as moderators, yet it is probable that racial and cultural factors impact
clients’ views on help-seeking generally, and their attitudes toward utilizing peers versus
professionals to manage distress (e.g., Diala, Muntaner, Walrath, Nickerson, LaVeist, &
Leaf, 2001; Sheu & Sedlacek, 2004; Townes, Chavez-Korell, & Cunningham, 2009;
Chen & Mak, 2008). Mechanisms of action underlying PAIs’ effectiveness were, for the
most part, neither tested nor speculated upon in this literature. One obvious example is
empathy: the capacity of peers to empathize with similar others’ distress is a
hypothesized process by which PAIs might be effective, yet too few studies measured
empathy or any other relationship or process-level variables to examine this hypothesis.
Thus, the currently-available literature largely leaves us to speculate about how, why, and
for whom PAIs are beneficial.
Future directions
Given the limitations details above, the literature on PAIs for depression would
benefit from future research examining for whom, under what circumstances, and via
47
what processes PAIs are helpful for depression. For example, in this analysis, there were
not statistically significant pre-post or between-groups differences in outcome for the
different sample types (depressed, stressed, or chronically ill), but there was a consistent
pattern of smaller effect sizes for stressor samples that would be worth investigating
further with a larger sample size. Furthermore, from a practical standpoint, it would be
useful to know whether PAIs function best as standalone interventions or as adjuncts to
traditional treatments (and, if as adjuncts, whether best delivered concomitantly or as
relapse prevention). Finally, in addition to empathy, other likely mediating processes
might include recovery modeling, the provision of hope, and specific techniques or
coping strategies.
Another distinction remains to be addressed. PAIs may be (1) treatments that are
already supported as professionally-administered interventions, but now administered by
peers; or (2) interventions that are specifically designed for peers to administer, without
prior empirical support as professionally-administered interventions. The former may be
more justifiable in terms of research funding; at the same time, interventions designed
specifically for peers may have therapeutic qualities that traditional interventions miss.
This distinction is worthy of attention in future research.
Another area of interest for future research is the effect on peers of delivering a
PAI. Ideally, helping others with problems one has personally overcome would reinforce
recovery or at least provide a subjectively positive experience. There is some evidence
that peer counselors have experienced increased well-being, increased confidence, and
decreases in mild depression symptoms (e.g., Garcia, Metha, Perfect, & McWhirter,
48
1997; de Vries & Petty, 1992; Byrd, 1984), but this evidence is meager and
methodologically limited. Moreover, we know little about how individual differences
among peer supporters might influence these outcomes. For example, for peers who are
currently depressed or at higher risk for relapse, spending time with other depressed
individuals and exerting energy in a helping capacity could be emotionally draining,
increase the probability of relapse, or prolong symptoms. On the other hand, for those
with solid, durable recovery experiences, supporting peers may provide gratification,
connectedness, a sense of purpose, or other positive emotional states that could buffer
against relapse and improve quality of life.
49
FIGURES
Figure 1: Flow of studies
Search terms:
“(Peer OR mutual OR paraprofessional) AND depress*”
(not unique)
PsycINFO:
3344 results
Medline:
1491 results
112 potentiallyeligible studies for
review from
database searches
4723 results excluded
because non-eligibility
was clear from
abstract (e.g., not an
empirical study or not
an intervention study)
+
33 potentiallyeligible studies for
review from
hand-search of
reference sections
145 potentiallyeligible studies
26 studies met
inclusion criteria
20*
controlled
111 studies excluded
because participants
or intervention did
not meet inclusion
criteria
8 studies excluded
because insufficient
data were available
6
uncontrolled
*Includes two non-randomized controlled studies, which
were not used to calculate between-groups differences.
50
Figure 2: Forest plot of effect sizes comparing PAI to active therapy conditions
51
Figure 3: Forest plot of effect sizes comparing PAI to TAU/WL conditions
52
Figure 4: Forest plot of effect sizes of PAI conditions (change from pre- to post-treatment)
53
Figure 5: Moderators of standardized mean gain (SMG) for PAI conditions (change from pre- to post-treatment)
Darker bars denote effect sizes that are significantly different from zero.
** p<.01
* p<.05
† p<.1
54
Figure 6: Moderators of standardized mean difference (SMD) comparing PAI to active therapy conditions
Darker bars denote effect sizes that are significantly different from zero.
** p<.01
* p<.05
† p<.1
55
Figure 7: Moderators of standardized mean difference (SMD) comparing PAI to TAU/WL conditions
Darker bars denote effect sizes that are significantly different from zero.
** p<.01
* p<.05
† p<.1
56
Figure 8: Methodological quality is not significantly associated with pre-post effect size
57
Figure 9a: Methodological quality is not significantly associated with between-groups
effect size (PAI compared with active therapy conditions)
58
Figure 9b: Methodological quality is not significantly associated with between-groups
effect size (PAI compared with TAU/WL)
59
Figure 10: Funnel plot of effect sizes comparing PAI to active therapy conditions
60
Figure 11: Funnel plot of effect sizes comparing PAI to TAU/WL conditions
61
TABLES
Table 1: Descriptive characteristics of clinical trials involving PAIs
1st Author,
Year
Setting/ format
Inclusion criteria
Depression
measure*
PAI(s)
Control
condition(s)
Duration of
intervention
Andersson,
2005
Internet/ group
BDI
Moderated online
discussion group;
therapist-monitored
Online self-help
CBT program +
moderated online
discussion group
M = 10
weeks
Bright,
1999
Outpatient
psychology
dept clinic/
group
facilitated by
prior
participants in a
communitybased self-help
org
Community
mutual aid
organization/
group
Swedish adults
with likely mildto-moderate
unipolar
depression (per
CIDI, and
MADRS-S score
15-30)
Memphis area
adults scoring ≥10
on HRSD and
meeting SCID-NP
criteria for MDE,
dysthymia, or dep
NOS
HRSD
(1) Mutual support
group led by
paraprofessional
(1) CBT group led
by professional
Hong Kong adults
who were still
symptomatic after
a 12-week group
CBT program
CES-D:
Chinese
version
Mutual-aid groups
of about 8
members each
Immediately
postpartum
Ontario women
scoring >9 on
EPDS
EPDS
Usual postpartum
care plus peer
telephone support
Cheung,
2000
Dennis,
2009
Ontario public
health
departments
(hospitals)/ 1:1
telephone with
[BDI]
(2) CBT group led
by paraprofessional
[SCID]
Follow up?
(months
post tx end)
6 months
MQS
10 weekly
90-minute
sessions
None
reported
5
Monthly
meetings over
12 months
(mean #
attended not
reported)
Minimum of
4 contacts; M
= 8.8 contacts
over 12
weeks
None
reported
2
3 months
(note that
29% had
continued
contact
10
8
(2) Mutual support
group led by
paraprofessional
Usual postpartum
care only
62
GieseDavis,
2006
peer w/selfreported hx of
and recovery
from PPD
Not specified/
1:1 telephone
(primarily) with
peer
“Navigator”
survivor of
several years
Not specified/
group
between
posttx and
FU)
California adult
women with
recent b.c. dx (M
= 2.23 mo postdx)
CES-D
Supportive and
educational peer
counseling
San Francisco
area adults
receiving care for
symptomatic
HIV/AIDS
CES-D
National c.b.
organization/
1:1 telephone
with peer w/ hx
of b.c.
recurrence
Not specified/
1:1 telephone
with another
participant
Adults
experiencing a
first recurrence of
b.c. (dx’ed in last
56 days)
CES-D
Group (10-15
participants)
educational
intervention taught
by teams of pair
leaders (1 with
HIV/AIDS)
Peer-delivered
telephone support
Indiana area, lowsupport,
community living,
elderly women
CES-D
Hill,
2006
Internet/ group
forum
Western US adult
rural-living
women with
chronic illness
CES-D
Letourneau,
2011
Home-based
and university/
1:1 in-person
Canadian adult
postpartum
women caring for
EPDS
Gifford,
1998
Gotay,
2007
Heller,
1991
3-6 months of
contact 14x/week (M
= 4 months
None
reported
3
Usual medical care
7 weekly
sessions / 16
total hours
None
reported
7
Usual medical care
3 months/ 4-8
sessions
None
reported
7
(1) Peer contact
(dyad initiator)
(1) Supportive staff
contact
20-30 weeks/
1-2 contacts
per week
None
reported
5
(2) Peer contact
(dyad receiver)
Online,
asynchronous,
peer-led support
group and
educational units
Peer support and
maternal infant
interaction program
(2) Assessment
only
No treatment
22 weeks
None
reported
4
Waiting list
(received 2 weeks
of peer support
12 weeks/ M
= 9 visits
and/or phone
None
reported
7
63
Lieberman,
2005
and telephone
with peer
w/self-reported
hx of and
recovery from
PPD
Internet/ group
Lieberman,
2006
Internet/ group
Lorig,
2009
Community
settings/ group
led by peers
with type-2
diabetes
Outpatient
medical clinic/
group led by
peers with
depression
experience
Ludman,
2007
Marmar,
1988
Not specified/
group led by
peer women
who had lost
husbands
several years
an infant <9
months old,
scoring >12 on
EPDS
after a 12-week
waiting period)
New members to
online selfdirected breast
cancer support
forums
New members to
online selfdirected breast
cancer support
forums
San Francisco
area adults with
type-2 diabetes
CES-D
Washington state
HMO adult
patients with
persistent
depression (SCL90 >.75) after ≥6
months antidep
medication
SCL-90:
depression
subscale
Adult women
whose husbands
died 4-36 months
previously
BDI short
form
CES-D
PHQ-9
Participation in
active peer-support
breast cancer
“bulletin board”
forum
Participation in
active peer-support
breast cancer
“bulletin board”
forum
Educational peerled diabetes selfmanagement
program
Telephone care
management +
peer-led group
(1) Telephone care
management
(2) Usual medical
care
[SCID]
[SCL-90]
Usual medical care
(waitlist for
program)
Supportive mutualhelp group
treatment
(3) Telephone care
management +
professionally-led
group
Professionallyadministered brief
dynamic
psychotherapy
calls
6 months/
variable (selfdirected)
participation
frequency
6 months/
variable (selfdirected)
participation
frequency
6 weekly 2.5hour sessions
None
reported
3
None
reported
3
6 months
7
6 weekly
workshops
followed by
ongoing bimonthly
groups
9 months
8
12 weekly
1.5-hour
sessions
12 months
9
64
McKay,
2002
Nagel,
1988
Onrust,
2010
Preyde,
2003
RobinsonWhelan,
2007
before
Internet
Nursing home/
1:1 in person
(half with
seniors and half
with
adolescents)
Home-based;
1:1 in person
with peer
supporter who
were widowed
themselves for
several years
Hospital NICU/
1:1 in person
with peers who
had previously
experienced
having a
preterm infant
Centers for
independent
living/ group
led by peer
women with
disabilities
Adults diagnosed
with type-2
diabetes
CES-D
(1) Educational
material + online
peer support forum
(2) Educational
material + online
peer support forum
+ professional selfmanagement coach
(1) Empathyfocused peer
counseling
(1) Educational
material only
10 months/
variable (selfdirected)
participation
frequency
None
reported
6
No treatment
5 weeks/ two
1-hour
meetings per
week
None
reported
6
Brief informational
brochure on
depression
symptoms
M = 8.3
home visits
6-9 months
9
(2) Professional
self-management
coach only
Elderly nursing
home residents
with mild to
moderate
depression
symptoms (Zung
score 40-61)
Netherlands
adults age ≥55
whose spouses
died during the
past year
Zung SDS
Adult women
immediately
postpartum with
very preterm
infants less than
10 days old
BDI short
form
“Parent buddy”
peer support
program
Standard medical
care
16 weeks/ M
= 9 contacts
None
reported
7
Adult rural-living
women with a
physical disability
and mild to
moderate
depression
BDI-II
Peer-led depression
self-management
program
Usual medical care
8 weekly 2.5hour sessions
3 months
8
CES-D
[CES-D]
(2) Educationfocused peer
counseling
Peer support home
visiting service
65
Roman,
1995
Regional
perinatal center/
1:1 in person
and phone with
peers who had a
preterm infant
≥1 year before
Silver,
1997
Urban medical
center/ 1:1 in
person and
phone with peer
women who
had raised
children with
health problems
HIV primary
care outpatient
clinic/ group in
person and 1:1
phone calls with
peer supporters
(HIV+ and on
HAART)
Outpatient
mental health
clinics/ 1:1
phone with
another
participant
Community
facility/ group
led by peer
(widowed)
facilitator
Simoni,
2007
Travis,
2010
Tudiver,
1992
symptoms (BDI-II
14-29)
Adult women
immediately
postpartum, no
prior NICU
experience, with
preterm infants
less than 6 days
old
Adult mothers of
5-8 y.o. children
with ongoing
health conditions
POMS:
depression
subscale
Veteran parent peer
support program
No treatment
12 months/
variable
contact (M =
34 hrs)
None
reported
6
PSI:
depression
subscale
Supportive peer
contact program
No treatment
12 months/ 6
face-to-face
meetings and
at least
biweekly
phone contact
None
reported
8
New York area
adult HIVpositive patients
on HAART
regimen
CES-D
Peer support
intervention
targeting
medication
adherence and
depression
symptomology
Standard medical
care
3 months/ 6
bimonthly
group
meetings, and
phone calls 13x/week
3 months
8
Michigan adults
with depression
symptoms (BDI-II
>12) and hx of at
least 2 antidep
med trials
Adult men
recently widowed
(3-12 months
prior)
BDI-II
Telephone-based
mutual peer
support program
None
reported
2
BDI short
form
Peer-facilitated
mutual help groups
12 weeks/ at
least 1
weekly call
(M = 10.3
completed
calls)
9 weekly 1.5hour sessions
8 months
(not
included in
analyses b/c
of
7
[GHQ]
Waiting list
(offered tx after 8
months)
66
Uccelli,
2004
Not specified/
group led by
peer (MS
patient
nominated by
neurologist)
Italian adults
diagnosed with
MS
BDI: MS
adaptation
Peer support group
intervention
8 weekly
sessions
insufficient
data
reported)
None
reported
4
67
Table 2: Characteristics of depression symptom measurement instruments
Instrument
Beck Depression
Inventory (BDI)
Beck Depression
Inventory-II (BDI-II)
Hamilton Rating Scale for
Depression (HRSD)
Center for
Epidemiological StudiesDepression (CES-D)
Edinburgh Postnatal
Depression Scale (EPDS)
Patient Health
Questionnaire-9 (PHQ-9)
Symptom Check List-90
(SCL-90)2
Zung Self-Rating
Depression Scale
Beck Depression
Inventory-Short Form
Profile of Mood States
(POMS)2
Psychiatric Symptom
Index (PSI)2
1
Author
Beck et al., 1961
Beck et al., 1996
Hamilton, 1960
Radloff, 1977
Cox et al., 1987
Kroenke et al., 2001
Derogatis et al., 1977
Zung, 1965
Beck et al., 1993
McNair et al., 1981
Ilfeld, 1976
N1
Description
21-item self-report; scores
range 0-63
21-item self-report; scores
range 0-63
Interviewer-administered;
scores range 0-52
20-item self-report; scores
range 0-60
2
Score threshold
for elevation
≥ 10
2
≥ 14
1
≥8
11
> 15
10-item self-report; scores
range 0-30
9-item self-report; scores
range 0-27
20-item self-report; scores
range 0-60
20-item self-report; scores
range 20-80
13-item self-report; scores
range 0-39
15-item self-report; scores
range 0-60
10-item self-report; scores
range 0-100 (standardized)
2
≥9
1
≥5
1
≥ 253
1
≥ 40
3
≥5
1
≥7
1
≥ 20
Number of studies included in meta-analysis for which this was considered the primary depression measure
Depression subscale score
3
Represents summed item scores; sometimes reported as mean score per item (as in Ludman et al., 2007)
2
68
Table 3: Results of included studies
Study
Andersson,
2005
Defining
condition of
sample
Depression
%
female
Mean
age
74
36.1
Bright,
1999
Depression
Cheung,
2000
Dennis,
2009
Depression
83
38
Depression
(postnatal)
100
Giese-Davis,
2006
Chronic
illness
(breast
cancer)
Chronic
illness
(HIV/AIDS)
Chronic
illness
(breast
cancer)
Depression
100
78%
ages
20-34
51.31
Gifford,
1998
Gotay,
2007
Heller,
1991
Hill,
2006
Chronic
illness
71
0
100
100
100
Group
Enrolled
N
Completed
Follow-up
Depression mean (SD)
Pre-treatment Post-treatment Follow-up
PAI
CG
60
57
49
36
35
36
20.9 (8.5)
20.5 (6.7)
19.5 (8.1)
12.2 (6.8)
PAI(1)
PAI(2)
CG(1)
CG(2)
22
21
27
28
14
13
18
22
18.01 (4.7)
17.86 (5.18)
17.11 (3.59)
17.67 (4.70)
6.07 (2.65)
6.85 (3.71)
8.17 (6.41)
8.50 (6.39)
PAI
83
65
21.54 (12.28)
23.57 (13.16)
PAI
CG
349
352
297
316
12.5 (2.8)
12.62 (2.76)
7.93 (4.68)
8.89 (5.24)
PAI
29
19
21.75 (12.51)
Not given
PAI
CG
34
37
25
33
17.3 (8.8)
19.6 (11.3)
18.7 (10.5)
20.0 (11.6)
PAI
CG
152
152
128
124
17.72 (10.71)
15.62 (10.12)
15.39 (9.69)
14.14 (9.81)
PAI(1)
PAI(2)
CG(1)
CG(2)
49
49
44
53
49
46
44
53
31.1 (8.4)
31.6 (6.8)
32.0 (9.2)
32.8 (8.3)
31.3 (10.1)
30.5 (7.6)
31.2 (9.3)
30.4 (7.1)
PAI
61
43
18.52 (11.64)
15.50 (11.90)
13.1 (7.6)
13.1 (9.1)
45.8
289
311
45.26
54
74
92%
older
7.00 (4.66)
7.61 (4.59)
69
Letourneau,
2011
Lieberman,
2005
Lieberman,
2006
Lorig,
2009
Ludman,
2007
(mixed, e.g.,
diabetes,
MS, heart
disease,
cancer)
Depression
(postnatal)
Chronic
illness
(breast
cancer)
Chronic
illness
(breast
cancer)
Chronic
illness (type2 diabetes)
Depression
than
42
100
100
100
66
71
Major
ity
26-35
46.2
18.13 (10.34)
16.91 (11.62)
PAI
CG
27
33
23
28
18.20 (4.76)
16.1 (4.67)
11.80 (4.68)
8.68 (5.44)
PAI
114
91
20.0 (.85)
15.3 (1.2)
PAI
74
52
19.90 (10.01)
14.60 (10.08)
PAI
CG
186
159
161
133
102
n/a
6.35 (5.85)
6.46 (5.60)
4.84 (4.33)
7.00 (4.49)
3.42 (5.03)
n/a
PAI(1)
CG(1)
CG(2)
CG(3)
26
26
26
26
22
23
23
26
21
20
23
25
1.63 (.68)
1.61 (.50)
1.66 (.54)
1.72 (.56)
1.22 (.54)
1.65 (.68)
1.21 (.58)
1.44 (.66)
1.24 (.95)
1.19 (.68)
1.19 (.65)
1.14 (.70)
PAI
CG
30
31
26
29
30
31
8.80 (4.66)
8.65 (4.69)
8.77 (5.72)
6.77 (5.82)
7.40 (6.59)
5.35 (4.88)
PAI(1)
PAI(2)
CG(1)
CG(2)
40
40
40
40
30
33
33
37
15.83 (12.79)
16.61 (11.67)
15.27 (12.25)
18.11 (11.79)
13.86 (12.62)
14.87 (9.69)
15.31 (9.15)
16.42 (12.98)
PAI(1)
PAI(2)
CG
20
20
20
20
20
20
45.60 (4.77)
46.05 (3.65)
49.60 (6.97)
39.80 (6.62)
42.40 (4.26)
48.50 (6.20)
50.2
Nagel,
1988
Depression
Not
available
74
Onrust,
Stressor
64
68.9
47
57
66.7
100
McKay,
2002
59
45.5
Stressor
(spousal
bereavemt)
Chronic
illness (type2 diabetes)
Marmar,
1988
CG
58
59
70
2010
Preyde,
2003
RobinsonWhelan,
2007
Roman,
1995
Silver,
1997
Simoni,
2007
Travis,
2010
Tudiver,
1992
Uccelli,
2004
(spousal
bereavemt)
Stressor
(preterm
infant)
Depression
+ Chronic
Illness
Stressor
(preterm
infant)
Stressor
(child’s
ongoing
health
problem)
Chronic
illness
(HIV)
Depression
Stressor
(spousal
bereavemt)
Chronic
illness
(multiple
sclerosis)
100
100
100
100
44
110
106
93
95
PAI
CG
32
28
24
25
PAI
CG
69
65
42
54
PAI
CG
27
31
PAI
CG
91
94
17.1 (9.4)
16.2 (9.1)
15.1 (8.1)
14.7 (8.7)
4.53 (3.81)
5.57 (3.28)
2.20 (4.00)
4.88 (3.77)
25.2 (11.3)
29.07 (11.30)
19.93 (11.3)
27.50 (11.30)
22
25
11.7 (11.4)
14.25 (10.35)
9.00 (11.25)
7.78 (8.25)
183
182
174
169
23.8 (17.5)
20.8 (16.6)
22.1 (17.5)
19.6 (16.9)
PAI
CG
71
65
54
51
19.9 (12.4)
19.6 (11.2)
17.6 (11.6)
16.4 (11.5)
PAI
54
32
26.0 (10.1)
22.1 (10.7)
PAI
CG
61
52
57
37
7.3
8.5
6.2
5.4
PAI
44
42
27.19 (7.83)
26.24 (6.80)
13.1 (7.3)
13.5 (7.9)
30
51.59
42
54
17.58 (11.30)
24.88 (11.30)
26
34.4
42.6
37
52.4
0
63
60
PAI
CG
>18,
no
mean
given
59
57
57
37
21.3 (14.5)
17.9 (11.2)
5.6
3.9
71
Table 4: Mean SMD estimates (PAI compared with active therapy conditions) adjusted
for four models of selection (publication) bias
FIXED EFFECTS
Parameter
Standard
estimate
error
Unadjusted ES
(SMD)
0.03141657
Moderate onetailed selection
Severe one-tailed
selection
Moderate twotailed selection
Severe two-tailed
selection
-0.01738422
RANDOM EFFECTS
Parameter
V (estimate of)
estimate
population effectsize variance
0.08347356
0.09470620
(SE =
(SE =
0.11564934)
0.06408005)
-0.01431520
0.09691550
-0.09302688
-0.83928611
0.97821510
0.02677201
0.07177643
0.09400445
0.02142850
0.05942731
0.09353144
0.07084147
72
Table 5: Mean SMD estimates (PAI compared with TAU/WL conditions) adjusted for
four models of selection (publication) bias
FIXED EFFECTS
Parameter
Standard error
estimate
Unadjusted ES
(SMD)
0.09130349
Moderate onetailed selection
Severe one-tailed
selection
Moderate twotailed selection
Severe two-tailed
selection
0.06115963
RANDOM EFFECTS
Parameter
V (estimate of)
estimate
population
effect-size
variance
0.1026520 (SE 0.14413175
= 0.1060054)
(SE =
0.07496142)
0.006533058
0.1516806
0.02258148
-1.018147072
1.5018998
0.07947753
0.095101156
0.1438070
0.06487532
0.086774433
0.1436267
0.04030200
73
APPENDIX A: CODING FORM
First Author:
Title:
Year of Publication:
1. Control group? YES NO
If yes:
a. Randomized? YES NO NOT SPECIFIED
b. Assessments blinded? YES NO NOT SPECIFIED
c. Number of control groups:
d. Type of CG1: OTHER THERAPY
TAU
MED
WAITLIST
i. If other therapy, specify type:
e. Type of CG2: OTHER THERAPY
TAU
MED
WAITLIST
i. If other therapy, specify type:
f. Researcher allegiance: PAI was presented as:
i. _____ An active treatment (i.e., a treatment of interest)
ii. _____ A control condition
2. Sample type:
OTHER
DEPRESSION
STRESSOR
CHRONIC ILLNESS
3. Intended duration of PAI treatment: ______ weeks and/or ______ contacts
4. Observed average duration of PAI treatment: ______ weeks and/or ____
contacts
5. Frequency of PAI treatment: WEEKLY MORE THAN WEEKLY
LESS THAN WEEKLY
VARIABLE/UNCLEAR
6. Format of PAI treatment:
GROUP
7. Mode of delivery of PAI treatment:
INTERNET
8. Support is:
ANOTHER
MUTUAL
ONE-ON-ONE
FACE-TO-FACE
PHONE
DELIVERED FROM ONE PERSON TO
74
a. If one person to another, any outcomes for supporters reported?
YES NO
9. Peer supporters were selected by: THEIR OWN RESPONSE TO AN AD
RECRUITED AS FORMER PATIENTS
OTHER
NOT SPECIFIED
10. Pre-intervention training of peer supporters:
SPECIFIED
SOME
NONE
NOT
11. Did peer supporters have ongoing contact with a professional supervisor over
the course of the intervention?
YES
NO NOT SPECIFIED
12. Was the PAI treatment co-administered by a health or mental health
professional, or entirely peer-administered?
PROFESSIONAL COADMINISTERED
ENTIRELY PEER-ADMINISTERED
13. Content of PAI treatment is described as primarily:
SUPPORTIVE
EDUCATIONAL
NOT SPECIFIED
14. Total N screened for study: _____
15. Total N enrolled in study (providing pretest data): _____
a. N enrolled in PAI: _____
b. N enrolled in CG 1:______; CG 2: ______
16. N completed study (providing posttest data): _____ (specify whether N or %)
a. N completed PAI: _____
b. N completed CG1:_____; N completed CG2: ______
17. Gender breakdown of total sample: _____% female, _____% male
a. Gender breakdown of PAI group: _____% female, _____% male
b. Gender breakdown of CG1: _____% female, _____% male
c. Gender breakdown of CG2: _____% female, _____% male
18. Mean age of total sample: _____ years
a. Mean age of PAI group: _____ years
b. Mean age of CG1: _____ years
75
c. Mean age of CG2: _____ years
19. Instrument(s) used to measure depression (check all that were used)*:
a. _____ HRSD (a.k.a., HAM-D)
b. _____ BDI or BDI-II
c. _____ CES-D
d. _____ Other: ______________________________________
*If more than one measure was used, write “primary” or “secondary” next to each one checked. HRSD is
always primary if it was used, followed in order of priority by BDI, CES-D, EPDS, HADS, and finally any
other measure.
20. PRETEST depression score of PAI group:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
21. POSTTEST depression score of PAI group:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
22. PRETEST depression score of CG1:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
23. POSTTEST depression score of CG1:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
24. PRETEST depression score of CG2:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
25. POSTTEST depression score of CG2:
a. Primary measure name: ______ Mean: ______
SD:______
b. Secondary measure name: ______ Mean: ______
SD:______
26. If reported, % considered “improved” at posttest:
PAI: ______
CG1: ______
CG2: ______
76
27. Are any follow-up data reported (including in a separate manuscript)?
YES NO
If yes:
a. Number of months post-treatment: ______
b. Was help-seeking behavior during the follow-up period reported?
YES NO
i. If Yes, % who sought help: PAI: ____ CG1: ____ CG2:
____
c. Was % remaining improved at follow-up reported? YES NO
i. If Yes, % remaining improved: PAI: ____ CG1: ____ CG2:
____
d. FOLLOW-UP depression score of PAI group:
i. Primary measure mean: ______
SD:______
ii. Secondary measure mean: ______
SD: ______
e. FOLLOW-UP depression score of CG1:
i. Primary measure mean: ______
SD:______
ii. Secondary measure mean: ______
SD: ______
f. FOLLOW-UP depression score of CG2:
i. Primary measure mean: ______
SD:______
ii. Secondary measure mean: ______
SD: ______
77
APPENDIX B: METHODOLOGICAL QUALITY SCORE (MQS) CODING SYSTEM
(adapted from Miller et al., 1995)
Methodological
feature
Group allocation
Quality control
Immediate
posttreatment
assessment rate
Additional
posttreatment followups
Dropouts
Blinding
Analyses
Multisite
Points available
2 = randomized comparison group
1 = nonrandomized comparison group
0 = no comparison group
1 = treatment standardized by manual,
specific training, etc.
0 = no standardization specified
2 = 85-100% of post assessments completed
1 = 70-84.9% of post assessments completed
0 = <70% of post assessments completed, or
not specified
1 = at least one additional post-treatment
follow-up completed
0 = no additional post-treatment follow-up,
or not specified
1 = treatment dropouts included in at least
some outcome data (e.g., intent-to-treat
analysis or compared on dependent variable)
0 = treatment dropouts not discussed or
accounted for
1 = posttreatment assessment done by
blinded interviewer
0 = posttreatment assessment nonblinded, or
unspecified
1 = acceptable statistical analyses
0 = no statistical analyses, inappropriate
analyses, or unspecified
1 = parallel replications at two or more sites
with separate research teams
0 = single site or comparison of sites
offering different treatments
SUM QUALITY SCORE:
Points
awarded
78
REFERENCES
Asterisked entries denote studies that were included in the meta-analysis.
*Andersson, G., Bergström, J., Holländere, F., Carlbring, P., Kaldo, V., & Ekselius, L.
(2005). Internet-based self-help for depression: Randomised controlled trial.
The British Journal of Psychiatry, 187, 456-461. doi:10.1192/bjp.187.5.456
Andersson, G., & Cuijpers, P. (2009). Internet-based and other computerized
psychological treatments for adult depression: A meta-analysis. Cognitive
Behaviour Therapy, 38, 196-205. doi:10.1080/16506070903318960
Batson, C.D., Sympson, S.C., Hindman, J.L., Decruz, P., Todd, R.M., Weeks, J.L.,
Jennings, G., & Burns, C.T. (1996). “I’ve been there, too”: Effect on empathy of
prior experience with a need. Personality and Social Psychology Bulletin, 22,
474-482. doi: 10.1177/0146167296225005
Beck, A.T., & Steer, R.A. (1993). Beck Depression Inventory Manual. San Antonio,
TX: Psychological Corporation.
Beck, A.T., Steer, R.A., & Brown, G.K. (1996). Manual for the Beck Depression
Inventory-II. San Antonio, TX: Psychological Corporation.
Beck, A.T., Ward, C.H., Mendelson, M., Mock, J., & Erbaugh, J. (1961). An inventory
for measuring depression. Archives of General Psychiatry, 4, 561-571.
doi:10.1001/archpsyc.1961.01710120031004
Berman, J.S., & Norton, N.C. (1985). Does professional training make a therapist more
effective? Psychological Bulletin, 98, 401-407. doi:10.1037/0033-2909.98.2.401
Bickman, L. (1999). Practice makes perfect and other myths about mental health
services. American Psychologist, 54, 965-978. doi:10.1037/h0088206
*Bright, J.I., Baker, K.D., & Neimeyer, R.A. (1999). Professional and paraprofessional
group treatments for depression: A comparison of cognitive-behavioral and
mutual support interventions. Journal of Consulting and Clinical Psychology, 67,
491-501. doi:10.1037/0022-006X.67.4.491
Burns, D.D., & Nolen-Hoeksema, S. (1992). Therapeutic empathy and recovery from
depression in cognitive-behavioral therapy: A structural equation model. Journal
of Consulting and Clinical Psychology, 60, 441-449. doi: 10.1037/0022006X.60.3.441
79
Byrd, M. (1984). Personal growth aspects of peer counselor training for older adults.
Educational Gerontology, 10, 369-385. doi:10.1080/0380127840100503
Chambless, D.L., & Hollon, S.D. (1998). Defining empirically supported therapies.
Journal of Consulting and Clinical Psychology, 66, 7-18. doi:10.1037/0022006X.66.1.7
Chen, S.X., & Mak, W.W.S. (2008). Seeking professional help: Etiology beliefs about
mental illness across cultures. Journal of Counseling Psychology, 55, 442-450.
doi: 10.1037/a0012898
*Cheung, S., & Sun, S.Y.K. (2000). Effects of self-efficacy and social support on the
mental health conditions of mutual-aid organization members. Social Behavior
and Personality, 28, 413-422. doi:10.2224/sbp.2000.28.5.413
Chinman, M.J., Rosenheck, R., Lam, J.A., & Davidson, L. (2000). Comparing consumer
and nonconsumer provided case management services for homeless persons with
serious mental illness. The Journal of Nervous and Mental Disease, 188, 446453. doi:10.1097/00005053-200007000-00009
Christensen, A., & Jacobson, N.S. (1994). Who (or what) can do psychotherapy: The
status and challenge of nonprofessional therapies. Psychological Science, 5, 814. doi:10.1111/j.1467-9280.1994.tb00606.x
Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and
Psychological Measurement, 20, 37-46. doi:10.1177/001316446002000104
Cox J.L., Holden J.M., & Sagovsky R. (1987) Detection of postnatal depression:
Development of the 10-item Edinburgh Postnatal Depression Scale. British
Journal of Psychiatry, 150, 782-786. doi:10.1192/bjp.150.6.782
De Vries, B., & Petty, B.J. (1992). Peer-counseling training: Analysis of personal growth
for older adults. Educational Gerontology, 18, 381-393.
doi:10.1080/0360127920180406
Den Boer, P.C.A.M., Wiersma, D., Russo, S., & van den Bosch, R.J. (2005).
Paraprofessionals for anxiety and depressive disorders. Cochrane Database of
Systematic Reviews, 2, CD004688. doi:10.1002/14651858.CD004688
*Dennis, C., Hodnett, E., Kenton, L., Weston, J., Zupancic, J., Stewart, D.E., & Kiss, A.
(2009). Effect of peer support on prevention of postnatal depression among high
risk women: Multisite randomized controlled trial. BMJ, 338, a3064.
doi:10.1136/bmj.a3064
80
Derogatis, L.R., & Cleary, P.A. (1977). Confirmation of the dimensional structure of the
SCL-90: A study in construct validation. Journal of Clinical Psychology, 33,
981-989. doi:10.1002/1097-4679(197710)33:4<981::AIDJCLP2270330412>3.0.CO;2-0
Diala, C.C., Muntaner, C., Walrath, C., Nickerson, K., LaVeist, T., & Leaf, P. (2001).
Racial/ethnic differences in attitudes toward seeking professional mental health
services. American Journal of Public Health, 91, 805-807.
doi:10.2105/AJPH.91.5.805
Doughty, C., & Tse, S. (2011). Can consumer-led mental health services be equally
effective? An integrative review of CLMH services in high-income countries.
Community Mental Health Journal, 47, 252-266. doi:10.1007/s10597-010-93215
Driessen, E., Cuijpers, P., de Maat, S.C.M., Abbass, A.A., de Jonghe, F., & Dekker,
J.J.M. (2010). The efficacy of short-term psychodynamic psychotherapy for
depression: A meta-analysis. Clinical Psychology Review, 30, 25-36.
doi:10.1016/j.cpr.2009.08.010
Durlak, J.A. (1979). Comparative effectiveness of paraprofessional and professional
helpers. Psychological Bulletin, 86, 80-92. doi:10.1037/0033-2909.86.1.80
Duval, S., & Tweedie, R. (2000). Trim and fill: A simple funnel plot-based method of
testing and adjusting for publication bias in meta-analysis. Biometrics, 56, 455463. doi:10.1111/j.0006-341X.2000.00455.x
Eklund, J., Andersson-Straburg, T., & Hansen, E.M. (2009). “I’ve also experienced loss
and fear”: Effects of prior similar experience on empathy. Scandinavian Journal
of Psychology, 50, 65-69. doi:10.1111/j.1467-9450.2008.00673.x
Field, A.P., & Gillett, R. (2010). How to do a meta-analysis. British Journal of
Mathematical & Statistical Psychology, 63, 665-694. Files accessed 12/1/12 at
http://www.statisticshell.com/meta_analysis/how_to_do_a_meta_analysis.html.
Garcia, Y.E., Metha, A., Perfect, M.C., & McWhirter, J.J. (1997). A senior peer
counseling program: Evaluation of training and benefits to counselors.
Educational Gerontology, 23, 329-344. doi:10.1080/0360127970230403
*Giese-Davis, J., Bliss-Isberg, C., Carson, K., Star, P., Donaghy, J., Cordova, M.J. … &
Spiegel, D. (2006). The effect of peer counseling on quality of life following
diagnosis of breast cancer: An observational study. Psycho-Oncology, 15, 10141022. doi:10.1002/pon.1037
81
*Gifford, A.L., Laurent, D.D., Gonzales, V.M., Chesney, M.A., & Lorig, K.R. (1998).
Pilot randomized trial of education to improve self-management skills of men
with symptomatic HIV/AIDS. Journal of Acquired Immune Deficiency
Syndromes and Human Retrovirology, 18, 136-144.
Goldstrom, I.D., Campbell, J., Rogers, J.A., Lambert, D.B., Blacklow, B., Henderson,
M.J., & Manderschied, R.W. (2006). National estimates for mental health mutual
support groups, self-help organizations, and consumer-operated services.
Administration and Policy in Mental Health and Mental Health Services
Research, 33, 92-103. doi: 10.1007/s10488-005-0019-x
* Gotay, C.C., Moinpour, C.M., Unger, J.M., Jiang, C.S., Coleman, D., Martino, S.,
Parker, B.J., Bearden, J.D., Dakhil, S., Gross, H.M., Lippman, S., & Albain, K.S.
(2007). Impact of a peer-delivered telephone intervention for women
experiencing a breast cancer recurrence. Journal of Clinical Oncology, 25, 20932099. doi:10.1200/JCO.2006.07.4674
Greenberg, L.S., Elliott, R., Watson, J.C., & Bohart, A.C. (2001). Empathy.
Psychotherapy, 38, 380-384. doi:10.1037/a0022187
Greenberg, P.E., Kessler, R.C., Birnbaum, H.G., Leong, S.A., Lowe, S.W., Berglund,
P.A., & Corey-Lisle, P.K. (2003). The economic burden of depression in the
United States: How did it change between 1990 and 2000? Journal of Clinical
Psychiatry, 64, 1465-1475. doi:10.4088/JCP.v64n1211
Hamilton, M. (1960). A rating scale for depression. Journal of Neurology, Neurosurgery
and Psychiatry. 23, 56-62. doi:10.1136/jnnp.23.1.56
Hans, E., & Hiller, W. (2013). Effectiveness of and dropout from outpatient cognitive
behavioral therapy for adult unipolar depression: A meta-analysis of
nonrandomized effectiveness studies. Journal of Consulting and Clinical
Psychology, 81, 75-88. doi:10.1037/a0031080
Hassenstab, J., Dziobek, I., Rogers, K., Wolf, O.T., & Convit, A. (2007). Knowing what
others know, feeling what others feel: A controlled study of empathy in
psychotherapists. The Journal of Nervous and Mental Disease, 195, 277-281.
doi:10.1097/01.nmd.0000253794.74540.2d
Hatcher, S.L., Favorite, T.K., Hardy, E.A., Goode, R.L., Deshetler, L.A., & Thomas,
R.M. (2005). An analogue study of therapist empathic process: Working with
difference. Psychotherapy: Theory, Research, Practice, Training, 42, 198-210.
doi:10.1037/0033-3204.42.2.198
82
Hattie, J.A., Sharpley, C.F., & Rogers, H.J. (1984). Comparative effectiveness of
professional and paraprofessional helpers. Psychological Bulletin, 95, 534-541.
doi:10.1037/0033-2909.95.3.534
Hedges, L.V. (1981). Distribution theory for Glass’s estimator of effect size and related
estimators. Journal of Educational Statistics, 6, 107-128.
doi:10.3102/10769986006002107
Hedges, L.V. (1982). Estimation of effect size from a series of independent experiments.
Psychological Bulletin, 92, 490-499. doi:10.1037/0033-2909.92.2.490
Hedges, L.V., & Olkin, I. (1985). Statistical methods for meta-analysis. Orlando, FL:
Academic Press.
*Heller, K., Thompson, M.G., Trueba, P.E., Hogg, J.R., & Vlachos-Weber, I. (1991).
Peer support telephone dyads for elderly women: Was this the wrong
intervention? American Journal of Community Psychology, 19, 53-74.
doi:10.1007/BF00942254
*Hill, W., Weinert, C., & Cudney, S. (2006). Influence of a computer intervention on the
psychological status of chronically ill rural women: Preliminary results. Nursing
Research, 55, 34-42. doi:10.1097/00006199-200601000-00005
Hodges, S.D., Kiel, K.J., Kramer, A.D.I., Veach, D., & Villanueva, B.R. (2010). Giving
birth to empathy: The effects of similar experience on empathic accuracy,
empathic concern, and perceived empathy. Personality and Social Psychology
Bulletin, 36, 398-409. doi:10.1177/0146167209350326
Hofmann, S.G., Sawyer, A.T., Witt, A.A., & Oh, D. (2010). The effect of mindfulnessbased therapy on anxiety and depression: A meta-analytic review. Journal of
Consulting and Clinical Psychology, 78, 169-183. doi:10.1037/a0018555
Hollon, S.D., Thase, M.E., & Markowitz, J.C. (2002). Treatment and prevention of
depression. Psychological Science in the Public Interest, 3, 39-77.
doi:10.1111/1529-1006.00008
Ilfeld, F.W. (1976). Further validation of a psychiatric symptom index in a normal
population. Psychological Reports, 39, 1215-1228.
doi:10.2466/pr0.1976.39.3f.1215
Jacobson, N.S., & Truax, P. (1981). Clinical significance: A statistical approach to
defining meaningful change in psychotherapy research. Journal of Consulting
and Clinical Psychology, 59, 12-19. doi:10.1037/0022-006X.59.1.12
83
Kessler, R.C., Berglund, P., Demler, O., Jin, R., & Walters, E.E. (2005). Lifetime
prevalence and age-of-onset distributions of DSM-IV disorders in the National
Comorbidity Study Replication. Archives of General Psychiatry, 62, 593-602.
doi:10.1001/archpsyc.62.6.593
Kessler, R.C., Borges, G., & Walters, E.E. (1999). Prevalence of and risk factors for
lifetime suicide attempts in the National Comorbidity Survey. Archives of
General Psychiatry, 56, 617-626. doi:10.1001/archpsyc.56.7.617
Kessler, R.C., Berglund, P., Demler, O., Jin, R., Koretz, D., Merikangas, K.R., Rush,
A.J., Walters, E.E., & Wang, P.S. (2003). The epidemiology of major depressive
disorder: Results from the National Comorbidity Study Replication (NCS-R).
JAMA: Journal of the American Medical Association, 289, 3095-3105.
doi:10.1001/jama.289.23.3095
Kroenke, K., Spitzer, R.L., & Williams, J.B. (2001). The PHQ-9: Validity of a brief
depression severity measure. Journal of General Internal Medicine, 16, 606-613.
doi:10.1046/j.1525-1497.2001.016009606.x
Lee, H. (1960). To Kill A Mockingbird. New York, NY: J.B. Lippincott Company.
*Letourneau, N., Stewart, M., Dennis, C., Hegadoren, K., Duffett-Leger, L., & Watson,
B. (2011). Effect of home-based peer support on maternal–infant interactions
among women with postpartum depression: A randomized, controlled trial.
International Jouranl of Mental Health Nursing, 20, 345-357.
doi:10.1111/j.1447-0349.2010.00736.x
*Lieberman, M.A., & Goldstein, B.A. (2005). Self-help on-line: An outcome evaluation
of breast cancer bulletin boards. Journal of Health Psychology, 10, 855-862.
doi:10.1177/1359105305057319
*Lieberman, M.A., & Goldstein, B.A. (2006). Not all negative emotions are equal: The
role of emotional expression in online support groups for women with breast
cancer. Psycho-Oncology, 15, 160-168. doi:10.1002/pon.932
Light, R.J., & Pillemer, D.B. (1984). Summing up: The science of reviewing research.
Cambridge, MA: Harvard University Press.
Lipsey, M.W., & Wilson, D.B. (2001). Practical Meta-Analysis. Thousand Oaks, CA:
Sage Publications, Inc.
*Lorig, K., Ritter, P.L., Villa, F.J., & Armas, J. (2009). Community-based peer-led
diabetes self-management: A randomized trial. The Diabetes Educator, 35, 641651. doi:10.1177/0145721709335006
84
*Ludman, E.J., Simon, G.E., Grothaus, L.C., Luce, C., Markley, D.K., & Schaefer, J.
(2007). A pilot study of telephone care management and structured disease selfmanagement groups for chronic depression. Psychiatric Services, 58, 10651072. doi:10.1176/appi.ps.58.8.1065
*Marmar, C.R., Horowitz, M.J., Weiss, D.S., Wilner, N.R., & Kaltreider, N.B. (1988). A
controlled trial of brief psychotherapy and mutual-help group treatment of
conjugal bereavement. American Journal of Psychiatry, 145, 203-209.
*McKay, H.G., Glasgow, R.E., Feil, E.G., Boles, S.M., Barrera, M. Jr. (2002). Internetbased diabetes self-management and support: Initial outcomes from the Diabetes
Network Project. Rehabilitation Psychology, 47, 31-48. doi:10.1037/00905550.47.1.31
McNair, D.M., Loor, M., & Droppleman, L.F. (1981). Profile of Mood States. San Diego,
CA: Educational and Industrial Testing Service.
Mead, N., Lester, H., Chew-Graham, C., Gask, L., & Bower, P. (2010). Effects of
befriending on depressive symptoms and distress: Systematic review and metaanalysis. The British Journal of Psychiatry, 196, 96-101.
doi:10.1192/bjp.bp.109.064089
Miller, W.R., Brown, J.M., Simpson, T.L., Handmaker, N.S., Bien, T.H., Luckie, L.F., et
al. (1995). What works? A methodological analysis of the alcohol treatment
outcome literature. In R.K. Hester and W.R. Miller (Eds.), Handbook of
Alcoholism Treatment Approaches: Effective Alternatives (2nd ed., pp. 12-44).
Boston: Allyn & Bacon.
Montgomery, E.C., Kunik, M.E., Wilson, N., Stanley, M.A., & Weiss, B. (2010). Can
paraprofessionals deliver cognitive-behavioral therapy to treat anxiety and
depressive symptoms? Bulletin of the Menninger Clinic, 74, 45-62.
doi:10.1521/bumc.2010.74.1.45
*Nagel, J., Cimbolic, P., & Newlin, M. (1988). Efficacy of elderly and adolescent
volunteer counselors in a nursing home setting. Journal of Counseling
Psychology, 35, 81-86. doi:10.1037/0022-0167.35.1.81
Olfson, M., Marcus, S.C., Druss, B., Elinson, L., Tanielian, T., & Pincus, H.A. (2002).
National trends in the outpatient treatment of depression. JAMA: Journal of the
American Medical Association, 287, 203-209. doi:10.1001/jama.287.2.203
85
*Onrust, S., Willemse, G., Van Den Bout, J., & Cuijpers, P. (2010). Effects of a visiting
service for older widowed individuals: A randomized clinical trial. Death
Studies, 34, 777-803. doi:10.1080/07481181003761252
Orwin, R.G. (1983). A fail-safe N for effect size in meta-analysis. Journal of
Educational Statistics, 8, 157-159. doi:10.3102/10769986008002157
Pfeiffer, P.N., Heisler, M., Piette, J.D., Rogers, M.A.M., & Valenstein, M. (2011).
Efficacy of peer support interventions for depression: A meta-analysis. General
Hospital Psychiatry, 33, 29-36. doi:10.1016/j.genhosppsych.2010.10.002
*Preyde, M., & Ardal, F. (2003). Effectiveness of a parent “buddy” program for mothers
of very preterm infants in a neonatal intensive care unit. CMAJ, 168, 969-973.
Radloff, L. S. (1977). The CES-D Scale: A self-report depression scale for research in the
general population. Applied Psychological Measurement, 1, 385-401.
doi:10.1177/014662167700100306
*Robinson-Whelan, S., Hughes, R.B., Taylor, H.B., Hall, J.W., & Rehm, L.P. (2007).
Depression self-management program for rural women with physical disabilities.
Rehabilitation Psychology, 52, 254-262. doi:10.1037/0090-5550.52.3.254
*Roman, L.A., Lindsay, J.K., Boger, R.P., DeWys, M., Beaumont, E.J., Jones, A.S., &
Haas, B., (1995). Parent-to-parent support initiated in the neonatal intensive care
unit. Research in Nursing & Health, 18, 385-394. doi:10.1002/nur.4770180504
Rosenthal, R. (1979). The “file drawer problem” and tolerance for null results.
Psychological Bulletin, 86, 638-641. doi:10.1037/0033-2909.86.3.638
Rosenthal, R., & Rubin, D.B. (1982). Comparing effect sizes of independent studies.
Psychological Bulletin, 92, 500-504. doi:10.1037/0033-2909.92.2.500
Sheu, H., & Sedlacek, W.E. (2004). An exploratory study of help-seeking attitudes and
coping strategies among college students by race and gender. Measurement and
Evaluation in Counseling and Development, 37, 130-143.
*Silver, E.J., Ireys, H.T., Bauman, L.J., & Stein, R.E.K. (1997). Psychological outcomes
of a support intervention in mothers of children with ongoing health conditions:
The parent-to-parent network. Journal of Community Psychology, 25, 249-264.
doi:10.1002/(SICI)1520-6629(199705)25:3<249::AID-JCOP3>3.0.CO;2-R
Simon, G.E., Grothaus, L., Durham, M.L., VonKorff, M., & Pabiniak, C. (1996). Impact
of visit copayments on outpatient mental health utilization by members of a
health maintenance organization. American Journal of Psychiatry, 153, 331-338.
86
*Simoni, J.M., Pantalone, D.W., Plummer, M.D., & Huang, B. (2007). A randomized
controlled trial of a peer support intervention targeting antiretroviral medication
adherence and depressive symptomatology in HIV-positive men and women.
Health Psychology, 26, 488-495. doi:10.1037/0278-6133.26.4.488
Spearman, C. (1904). The proof and measurement of association between two things.
The American Journal of Psychology, 15, 72-101. doi:10.2307/1412159
Steele, L., Dewa, C., & Lee, K. (2007). Socioeconomic status and self-reported barriers
to mental health service use. The Canadian Journal of Psychiatry, 52, 201-206.
Stein, D.M., & Lambert, M.J. (1984). On the relationship between therapist experience
and psychotherapy outcome. Clinical Psychology Review, 4, 127-142.
doi:10.1016/0272-7358(84)90025-4
Sutton, A.J. (2009). Publication bias. In The Handbook of Research Synthesis and MetaAnalysis (2nd Ed), pp. 435-452, H. Cooper, L.V. Hedges, & J.C. Valentine (Eds.).
New York, NY: Russell Sage Foundation.
Toro, P.A., Reischl, T.M., Zimmerman, M.A., Rappaport, J., Seidman, E., Luke, D.A., &
Roberts, L.J. (1988). Professionals in mutual help groups: Impact on social
climate and members' behavior. Journal of Consulting and Clinical Psychology,
56, 631-632. doi:10.1037/0022-006X.56.4.631
Townes, D.L., Chavez-Korell, S., & Cunningham, N.J. (2009). Reexamining the
relationships between racial identity, cultural mistrust, help-seeking attitudes, and
preference for a Black counselor. Journal of Counseling Psychology, 56, 330336. doi:10.1037/a0015449
*Travis, J., Roeder, K., Walters, H., Piette, J., Heisler, M., Ganoczy, D., Valenstein, M.,
& Pfeiffer, P. (2010). Telephone-based mutual peer support for depression: A
pilot study. Chronic Illness, 6, 183-191. doi:10.1177/1742395310369570
*Tudiver, F., Hilditch, J., Permaul, J.A., & McKendree, D.J. (1992). Does mutual help
facilitate newly bereaved widowers? Report of a randomized controlled trial.
Evaluation & the Health Professions, 15, 147-162.
doi:10.1177/016327879201500202
*Uccelli, M.M., Mohr, L.M., Battaglia, M.A., Zagami, P., & Mohr, D.C. (2004). Peer
support groups in multiple sclerosis: Current effectiveness and future directions.
Multiple Sclerosis, 10, 80-84. doi:10.1191/1352458504ms973oa
87
Vevea, J.L., & Hedges, L.V. (1995). A general linear model for estimating effect size in
the presence of publication bias. Psychometrika, 60, 419-435.
doi:10.1007/BF02294384
Vevea, J.L., & Woods, C.M. (2005). Publication bias in research synthesis: Sensitivity
analysis using a priori weight functions. Psychological Methods, 10, 428-443.
doi:10.1037/1082-989X.10.4.428
Westen, D., & Morrison, K. (2001). A multidimensional meta-analysis of treatments for
depression, panic, and generalized anxiety disorder: An empirical examination of
the status of empirically-supported therapies. Journal of Consulting and Clinical
Psychology, 69, 875-899. doi:10.1037/0022-006X.69.6.875
Wilson, D. B. (2010). Meta-analysis macros for SAS, SPSS, and Stata. Retrieved
September 8, 2012, from http://mason.gmu.edu/~dwilsonb/ma.html.
Zung, W.W. (1965). A self-rating depression scale. Archives of General Psychiatry, 12,
63-70. doi:10.1001/archpsyc.1965.01730060026004