effectiveness article - University of Colorado Denver

Author Affiliations
Jason R Williams
Casey Family Programs
Lisa Merkel-Holguin, Heather Allan
The Kempe Center for the Prevention and Treatment of Child Abuse and Neglect, University of
Colorado, School of Medicine, Department of Pediatrics
Erin J. Maher
Casey Family Programs
John Fluke, Dana Hollinshead
The Kempe Center for the Prevention and Treatment of Child Abuse and Neglect, University of
Colorado, School of Medicine, Department of Pediatrics
Keyword Suggestions (to be checked against MeSH)
implementation
family preservation
family support
child protective services
Bayesian Model Averaging
Family group conferencing
See specific formatting instructions. Documents will use APA, as well.
http://www.press.uchicago.edu/journals/jsswr/instruct.html?journal=jsswr

Abstracts should be written in present tense
Factors Associated with Staff Perceptions of the
Effectiveness of Family Group Conferences
Abstract
Family Group Conferencing (FGC) challenges the entrenched practice of child welfare agency
service providers making decisions for children and families. Instead, FGC position families as
capable planners and decision-makers given proper support and resources. Implementation
research in general highlights the importance of agency climate, worker characteristics, and
attitudes in predicting buy-in and ultimately the success of evidence-based programming. Prior
research has shown social work staff differ in their opinions of FGC, but little research has
connected these attitudes to staff and agency characteristics and, ultimately, to referrals and
outcomes. In the absence of literature describing agency and staff characteristics associated with
support of FGC in practice, we explore factors related to staff perceptions of effectiveness of
FGC using Bayesian Model Averaging with responses from N = 252 staff surveys. The results
imply that worker attitudes about the effectiveness of FGC are themselves a product of attitudes
towards families, type of work responsibility, and the perception of resources or services in the
external environment. Among those who carry a caseload, worker perceptions of FGC
effectiveness depend on perceptions of the availability and competence of local services to help
families in need. Among those who do not carry a caseload, endorsement of families’ abilities to
make plans to address their issues predicts endorsement of the effectiveness of FGC.
Introduction
Successful implementation of a program or intervention within an organization hinges in
large part on issues of ownership, efficacy, and discretion. Those charged with ground-level
implementation are more likely to do so with fidelity if they are personally invested in practices
that they believe will be effective. A program injected from outside will be treated like an
immune system treats a foreign body. Where there is room for interpretation and authority to
choose from among several possible options, these ground-level implementers become
essentially makers of policy, the results of which may be different than what was intended by
leadership (Lipsky 1980; Križ & Skivenes, 2014). In this paper, we address child welfare agency
staff attitudes and perceptions of family group conferences (FGCs) and the implications for
implementation. Specifically, we examine what predicts worker perceptions that FGCs are an
effective decision-making structure.
Family Group Conferencing in Child Welfare
Family group conferencing (FGC) is firmly rooted in New Zealand’s Children, Young
Persons, and their Families Act of 1989, which required participatory decision making between
child welfare systems and the family groups of children who are abused and/or neglected. This
reform acknowledged the marginalization and exclusion of family groups when their children
come to the attention of the state, honored indigenous and traditional ways of resolving concerns,
and recognized institutional racism within the public systems charged with the protection of
children and support of families. It emerged from a rights-based framework that all children are
entitled to having their family systems come together on their behalf to make decisions about
their well-being. The legislation deconstructed the prevailing, entrenched practice of child
welfare agency service providers making decisions for children and families. Instead, FGC
positions family groups to lead child welfare decision making, with the support of service
providers. It challenged the predominant construct of professionals as experts and required
systems and their professionals to view families more holistically and as solution builders and
leaders able to address challenges that confront their family systems (Ministerial Advisory
Committee, 1988).
FGC is among the most recognized models that fit under the family group decision
making (FGDM) framework. In 2008, five core components of FGDM were defined by experts,
with a sixth component related to follow-up added in 2013 (Kempe Center for the Prevention
and Treatment of Child Abuse and Neglect, 2013, p. 2). These components are:
1. An independent coordinator is responsible for convening the family group meeting with
agency personnel.
2. Child protection agency personnel recognize the family group as the key decision-making
partners and offer time and resources to convene the family group.
3. After initial presentations, family groups meet on their own, without statutory authorities
(e.g., CPS workers), to work through information they have been given and formulate
responses and plans.
4. When agency concerns are adequately addressed, preference is given to the family
group’s plan over any other possible plan.
5. Follow-up processes after the FGDM meeting occur until the intended outcomes are
achieved, to ensure that the plan continues to be relevant, current, and achievable. FGDM
is not a one-time event but an ongoing, active process.
6. Referring agencies support family groups by providing the necessary services and
resources to implement the plan.
Given that FGC is enshrined in legislation and policy in New Zealand, the influence of
service providers’ perceptions about FGC have not been a primary line of inquiry in that country.
That is not the case in other countries, like the United States, where FGC may be considered a
relatively new practice, and its implementation is often at the discretion of frontline social
workers to refer families to participate. The organizational bureaucracy, the procedural nature of
child protection work, and the values individual systems place on these types of decision-making
forums may further hamper or aid successful implementation (Brown, 2003).
Independent of the extent to which policy supports or directs the occurrence of family
group conferences, understanding the attitudes of the child welfare workforce (including
administrators, supervisors, and case workers) is a critical component to evaluating their use and,
ultimately, effectiveness in child welfare. In their influential review of implementation research,
Fixsen and colleagues (Fixsen, Naoom, Blase, Friedman, & Wallace, 2005) pointed out that
practitioners’ skills, attitudes, training, and coaching are keys to successful implementation of
evidence-based practices, though they noted that workforce issues have been understudied.
McCrae and colleagues (McCrae, Scannapieco, Leake, Potter, & Menefee, 2014) found agency
size, stress levels, worker tenure, and job position to be related to buy-in for implementation of a
child welfare practice model. There remains a dearth of literature examining child welfare
professionals’ attitudes and perspectives about FGC and other family meeting models. We
briefly review these studies here.
Some studies have suggested that the child welfare workforce possesses a generally
positive attitude about FGCs, particularly the underlying empowerment and strengths-based
philosophies of the model (Huntsman, 2006; Crow and Marsh, 1997; LeCroy and Milligan
2003). Even so, child welfare staff members have raised a number of concerns about FGCs,
including: skepticism of plans formulated at FGCs compared to other child welfare meetings
(Trotter, 1999), meeting length (Velen and Devine 2005), and excess family empowerment
(Huntsman, 2006). These perspectives may reflect ambiguity around the workforce’s attitudes
toward the core purpose of FGC, which is for the child welfare agency to facilitate and allow for
a process for family groups to make decisions. None of these studies connected these perceptions
to worker characteristics or to outputs or outcomes.
One study is particularly germane to this line of inquiry. In Sweden and the UK, Sundell,
Vinnerljung, and Ryburn (2001) studied the relationship between social worker attitudes toward
FGCs and referrals to the decision making process. In their survey of social work professionals
with limited experience with FGCs, a majority endorsed the need for private family time, the
capacity of extended families to make decisions, and the usefulness of FGC as a mechanism to
resolve child maltreatment concerns. Despite this, the rate of referrals to FGCs was generally low
in both countries. They also found that rates of referrals were higher when social workers: 1)
participated in developing FGC as a service in Sweden but not in the UK; and 2) had a more
positive attitude about FGCs. The age and previous work experience of the social worker did not
impact referral rates to FGCs.
As part of a broader examination of FGC implementation, this study focused on
understanding child welfare professionals’ perspectives about the effectiveness of FGC. These
perceptions are important to study since professionals’ buy-in impacts referrals (Sundell et al.,
2001) and other aspects of implementation. Further, quality implementation with a high degree
of family participation is an important ingredient in FGC practice and one which is hypothesized
to result in the achievement of desired outcomes. While this broader understanding of the role of
workforce attributes on FGC practice and outcomes is a longer-term goal, our purpose with this
exploratory analysis was to understand whether attributes and attitudes measured at the worker
level influence perceptions about the efficacy of FGC. We view this as a necessary precursor for
understanding the broader impact on implementation and, ultimately, outcomes.
Model Development Under Uncertainty
While child welfare staff members’ perceptions of the effectiveness of FGC are likely to
affect fidelity and success of implementation, few factors or combinations of factors have been
identified in the literature as associated with such perceptions. Thus, data collection and analysis
for this project began with few a priori hypotheses about individual factors and no hypothesis
about the best combination of factors to examine. This uncertainty about model composition or
functional form is common in research examining new practices and has implications for the
selection of an analytic approach.
Traditional inferential statistics is predicated on testing a single model against a null
model. In practice, however, even when authors present a single model, it is likely that multiple
models were actually considered. Several combinations of control variables may have been
considered through some formal or informal screening process (Raftery, 1995; Tobias & Li,
2004). Similarly, alternate ways of measuring constructs or capturing, for example, nonlinear
effects may be considered. The more such uncertainty enters into model building, the greater the
number of potential models to evaluate. Existing screening and selection methods that rely on pvalues or R2 or other goodness of fit statistics face numerous problems. These procedures do not
inherently account for the multiple comparisons necessary to generate the final model, and the
resulting “p-values based on a model selected from among a large set of possibilities no longer
have the same interpretation as they did when only two models were ever considered” (Raftery,
1995, p. 112). Standard errors or confidence intervals resulting from these procedures are
downwardly biased because they do not account for the uncertainty in model selection itself.
Given a relatively undeveloped literature from which to develop hypotheses on this topic
and the faults associated with traditional testing of multiple models, we employed Bayesian
Model Averaging (BMA) as our tool to weigh the evidence for each predictor and potential
model. Bayesian methods focus on which model fits the data best, and express all uncertainty—
including about the unknown parameters of interest—in terms of probability. We employed the
BMA methodology of Raftery and colleagues (Raftery, 1995; Hoeting, Madigan, Raftery, &
Volinksy, 1999) as implemented in the package BMA (Raftery, Hoeting, Volinsky, Painter, &
Yeung, 2013) for the statistical program R (R Core Team, 2014). This procedure allows for
competition even among highly correlated variables and facilitates the examination of different
operational definitions of the same construct.
BMA produces a model posterior probability, which is the likelihood that the candidate
model is the “correct” model that produced the data. It also produces inclusion probabilities that
capture the likelihood that each predictor is in the correct model—essentially, a measure of
predictor strength or the evidence supporting the inclusion or exclusion of that predictor
(Hoeting et al., 1999). The “Averaging” part of the name refers to the production of an expected
value of the coefficient weighted by the model posterior probability across all models
considered. A probability weighted standard deviation is calculated similarly, and is analogous to
the usual coefficient standard error except that it captures all uncertainty about the coefficient.
While users can report and act upon the expected values for all considered predictors, BMA is
often used to select a subset of predictors because the inclusion probabilities for some variables
will be quite low, providing moderate or strong evidence against there being an effect. Many
analysts select either the modal model (variables with inclusion probabilities above 0.50) or the
best model (highest posterior probability), which are often one and the same.
In this paper, we utilized BMA within an exploratory model building and testing effort,
combining the tool with more traditional inferential approaches as recommended by Gelman
(2011). We applied this exploratory approach to an on-going child welfare evaluation project,
identifying available variables potentially related to our outcome of interest—staff perceptions of
the effectiveness of FGCs. The data source and potential predictors considered are described
below.
Methods
Data Source
Data for these analyses were collected from child welfare agency staff in three
jurisdictions west of the Mississippi participating in a three-year evaluation of the use of family
meetings in child welfare (referred to as States 1, 2, and 3). These sites were chosen due to their
established family meeting practice; the three agencies had been implementing FGCs and other
family meetings for a minimum of six years and a maximum of 14 years. The survey was
administered as part of an evaluation training that informed staff of the evaluation goals, design,
and their roles in it, specific to their job. The on-line survey contained questions about staff
member demographic characteristics, position, and employment experience and scales pertaining
to case skills, family meeting knowledge and attitudes, organizational culture and climate,
service availability, and child safety vs. family preservation proclivity. In addition, as new staff
connected to the project joined the agency, they were trained on the evaluation and completed
the electronic survey at that time. Completing the survey took approximately 15 minutes.
Approval for this study was received from a western state’s Institutional Review Board.
Sample
The staff survey sampling frame consisted of all staff members who may have a role in
the family meeting evaluation or who may supervise direct service staff. Any staff member who
may have referred families to the study or participated in a family meeting was considered to
have a role in the evaluation. The bulk of survey data was collected in the Fall of 2012 during the
first wave of evaluation trainings at each site, with the remaining data collected via subsequent
webinar trainings for new staff through the Spring of 2014. Staff surveyed included caseworkers,
FGDM facilitators/coordinators, supervisors of both, and some upper level managers and
administrators. While there were 301 survey respondents overall, our analytical sample for the
BMA below was smaller (N = 252) due to missing responses across all variables considered.
Measures
Dependent Variable
The dependent variable was a composite of 10 items ranking FGC effectiveness for
addressing different family problems. The 10 family issues were drug abuse, alcohol abuse,
domestic/intimate partner violence, extreme poverty, extreme child behavior problems, mental
illness, developmental disability, extremely poor parenting skills, educational neglect, and
parent-child conflict. Respondents could select answers from a five-point scale that ranged from
not at all effective to completely effective. A not applicable option was also available to account
for staff who might not have much direct familiarity with FGC. A review of the distribution of
responses across all items revealed that most respondents (73 to 79%) endorsed moderately
effective or very effective. Therefore, we created a dichotomized composite dependent variable to
reflect ratings of at least very effective (i.e. rated a 4 or 5) on all items rated (i.e. excluding items
for which the respondent selected not applicable).
Predictors Examined for Inclusion
Table 1 describes the variables from the survey we tested in the modeling. For each item
or scale, we explored variation in response and, in some cases, covariation with the outcome in
making decisions about operationally defining the variable or creating composites to economize
on degrees of freedom. (Descriptive statistics appear in Table 2 below).
Table 1: Potential predictors tested.
Construct
Variables Composing Construct
Primary job
Position
Current work area FGDM
Caseload carried
Years in position
Years in child welfare
Experience with family meetings
Experience
Minority
Ethnicity
High job satisfaction
Job satisfaction
Family preservation proclivity
Worker attitude
toward families
Perceived changes
in workload due to
family meetings
Family ability to plan
Workload change due to family
meetings
Services composite
Local services for
families
Can find services
Confidence in services for needs
Organizational
climate and
culture
Supervisor competence
Definition
Three-category variable indicating
whether a worker is a supervisor,
program director or trainer; a
coordinator or facilitator; or a case
worker (reference category)
Dichotomous variable indicating
whether a staff person currently
works in a position with FGDM
responsibilities
Dichotomous variable indicating
whether the respondent reported a
caseload > 0
Years in current position
Years of experience in child welfare
Three-category variable
differentiating those who report a
lot, some, or little to no experience
with family meetings (reference
category, combining a little and
none)
Dichotomous variable indicating selfreport of being any ethnicity besides
white, non-Hispanic
Dichotomous variable capturing
those reporting they are very or
completely satisfied with their job
Dalgliesh (2010) scale produces a
single continuous variable reporting
preference for family preservation
vs. child safety
Dichotomous variable indicating that
the respondent agreed or strongly
agreed that families know how to
construct thorough plans for
resolving their issues
Three-category variable indicating a
reported decrease or increase in
workload or no change (reference
category)
Single scale assessing positive
feelings about services that
combines all service ratings
Dichotomous variable indicating the
respondent agreed or strongly
agreed
Mean confidence rating across 25
areas of need
Mean of supervisor skill ratings
(Butler Institute for Families, 2009;
New Mexico Children, Youth, &
Families Department [NMCYFD],
Leadership
Vision, professionalism, and
commitment
State child welfare
agency
State
2011; New York State Social Work
Education Consortium, 2005)
Mean shared/effective leadership
rating (Butler Institute for Families,
2009; NMCYFD, 2011)
Mean perception of coworkers’
shared vision, professionalism, and
commitment (Ellett, 2009)
Three-category indicator of working
in State 1 (reference category), State
2, or State 3
The staff survey included several questions assessing respondent perceptions of the local
service mixture, including ease of finding appropriate services, ease of working with service
providers, and cultural responsiveness of providers. With no clear way to combine what were
highly overlapping items, we tested and compared three different ways of assessing the quality
of the local service mixture. One combined the items capturing confidence in 25 specific local
services to meet individual family needs (e.g., early childhood services, legal services) into a
single scale by meaning the five-point scale item responses that ranged from not at all confident
to completely confident. The second measure was a single item capturing agreement that “I can
usually find services in my community that can help keep children safe in their home,”
dichotomized into agree or strongly agree versus not. The services composite variable combined
these two formulations plus the ratings of ease of working with service providers in the
community and the responsiveness of local services in meeting needs of culturally diverse
groups into a single scale (scored as a mean proportion among items completed) with scores
closer to 1 indicating more enthusiastic endorsement of local services.
Results
Descriptive Statistics
Descriptive statistics for the outcome variable and candidate predictors are presented in
Table 2. Missing data were not a substantial problem for this study. For individual items the
amount of missing data ranged from 1% to 4% of the total sample. Given the low rates of
missingness, data were presumed to be missing at random (MAR). Combining BMA and
multiple imputation is beyond the scope of this paper.
Table 2: Descriptive statistics for outcome and potential predictors
FGC at least very effective in all areas rated (0 – no, 1 –
yes)
Primary job:
Case worker (0 – no, 1 – yes)
Supervisor/ Program Director/trainer (0 – no, 1 - yes)
Coordinator/ facilitator (0 – no, 1 – yes)
Current work area FGDM (0 – no, 1 – yes)
Caseload carried (0 – no, 1 – yes)
Years in position
Years in child welfare
Experience with family meetings:
Little to none (0 – no, 1 – yes)
Some (0 – no, 1 – yes)
A lot (0 – no, 1 – yes)
Minority (0 – no, 1 – yes)
High job satisfaction (0 – no, 1 – yes)
Family preservation proclivity
Family ability to plan (0 – disagree, 1 – agree)
Workload change due to family meetings:
No change (0 – no, 1 – yes)
Decrease (0 – no, 1 – yes)
Increase (0 – no, 1 – yes)
Services composite
Can find services (0 – disagree, 1 – agree)
Confidence in services for needs
Supervisor competence
Leadership
Vision, professionalism, and commitment
Mean
0.313
SD
0.465
Min
0
Max
1
Missing
0
0.698
0.168
0.134
0.164
0.683
2.832
7.103
0.460
0.375
0.341
0.371
0.466
3.329
7.320
0
0
0
0
0
0
0
1
1
1
1
1
23
31
0
0
0
0
0
0
0
0.202
0.267
0.531
0.532
0.511
0.413
0.828
0.402
0.443
0.500
0.500
0.501
0.241
0.378
0
0
0
0
0
0
0
1
1
1
1
1
1
1
0
0
0
10
0
5
0
0.508
0.164
0.328
0.705
0.612
3.326
5.055
4.668
4.913
0.501
0.371
0.470
0.111
0.488
0.718
0.888
0.901
0.701
0
0
0
0.283
0
1.440
1.750
1.467
2.167
1
1
1
1
1
5
6
6
6
0
0
0
4
4
4
2
2
2
Mean
SD
Min
Max
Missing
0.460
0.272
0.487
0
0
0
1
1
1
0
0
0
State:
State 1 0.302
State 2 0.080
State 3 0.618
Note: SD is standard deviation, Min is minimum, and Max is maximum. N = 252.
Modeling Results
We used the BMA package function bic.glm (Raftery et al., 2013), which applies
BMA to generalized linear models, specifying a logistic regression model. The resulting
coefficient estimates reflect the estimated effect on the log odds of agreeing that FGC is very or
completely effective in all areas assessed. The four best models from the initial stage of our
analysis are presented in Table 3. The results reflect much uncertainty as to the correct model.
Raftery (1995) cites posterior odds of 20:1 as reflecting strong evidence, implying an inclusion
probability above 95% (with rounding) provides strong evidence of an effect, while a probability
below 5% implies strong evidence against an effect. While many of our variable inclusion
probabilities strongly rule out potential effects, a number are between 5% and 95%. The highest
inclusion probability—p(β≠0)—is for carrying a caseload, with a 61.7% chance of being in the
“correct” model, which represents positive but not strong evidence (Hoeting et al., 1999).
Table 3: BMA results, with 4 best models
p(β≠0)
EV
SD
Supervisor/PD/trainer
0.003
0.053
Coordinator/facilitator
Current work area FGDM
Caseload carried
Years in position
Years in child welfare
Experience with family
0.005
0.270
-0.516
0.000
0.000
0.079
0.470
0.474
0.006
0.003
Primary job:
model 1
model 2
model 3
model 4
-0.892
-0.840
-0.810
-0.915
0.5
29.8
61.7
1.2
1.1
0.0
p(β≠0)
EV
SD
39.3
1.0
1.7
1.0
0.000
0.000
0.263
0.000
0.009
0.001
0
0
0.378
0.033
0.107
0.041
42.2
17.8
0.224
-0.035
1.860
0.126
0.478
0.176
2.401
0.312
52.0
0.328
0.356
2.9
1.0
0.006
0.000
0.045
0.018
1.7
0.003
0.036
0.000
0.000
0
0
model 1
model 2
model 3
model 4
meetings:
Some
A lot
Minority
High job satisfaction
Family preservation proclivity
Family ability to plan
Workload change due to
family meetings:
Decrease
Increase
Services composite
Can find services
Confidence in services for
needs
Supervisor competence
Leadership
Vision, professionalism, and
commitment
State:
State 2
State 3
BIC
Posterior probability
0.733
0.606
20.5
4.646
0.692
4.533
0.649
0.0
-1121.10 -1121.07 -1120.32 -1119.78
8.5
8.4
5.8
4.4
The inclusion probabilities for the service ratings need to be interpreted in the context of
the high interrelationships among the variables. Examining the pattern of inclusion across all 72
averaged models, a measure of services was absent in only two highly unlikely models. While
not entirely mutually exclusive (they appear together in three highly unlikely models), the
inclusion probabilities for the services composite and the confidence in services across family
needs together indicated relatively strong evidence that one of them should be in the model. In
short, staff considerations of services appeared to affect perceptions that FGC is at least very
effective in addressing family needs.
The weak evidence for the inclusion of the other predictors tested plus the uncertainty
around which measure of services is best resulted in little to distinguish a best model. The
nominal best model had a posterior probability of only 8.5%. Furthermore, the difference in
Bayesian Information Criterion (BIC) values between the best and fourth best models was less
than 2, implying weak evidence in favor of the former (Raftery, 1995) and against the inclusion
of minority status (the only difference between the models).
While primary job was not related to ratings of FGC effectiveness, there was some
evidence that the similar variable of whether or not a worker carries a caseload was a likely
predictor. Carrying a caseload or not implies two very different groups of workers, and some of
the weakness in evidence for predictors might be due to different correlates of effectiveness
ratings for those who regularly work with children and families versus those who do not. To
explore whether the pattern of overall results might be different depending on whether workers
carried a caseload and whether this improved modeling results, we split the sample on the
caseload carrying dichotomy and ran separate BMA analyses on each. The results, not shown,
suggested that carrying a caseload may, in fact, be an important moderating variable in
predicting ratings of effectiveness.
Thus, we created four interaction terms to capture differential effects of four variables
that had very different inclusion probabilities for those with and without a caseload. Specifically,
Caseload×Supervisor and Caseload×Services captured the additional effect of higher supervisor
competence and service competence and availability ratings, respectively, among those carrying
a caseload. Caseload×Minority captured a higher or lower level of agreement that FGCs are
effective among the 35.2% of the sample who were both a member of a minority ethnic group
and carried a caseload. Caseload×Family captured a different intercept for the 55.7% who carried
a caseload and agreed that families are capable of making thorough plans for addressing their
issues. We added these terms to the prior potential predictors and submitted them to BMA.
Again, multiple overlapping potential predictors created much uncertainty in model
selection. As seen in Table 4, no predictor had an inclusion probability above 50%. The best
model had a posterior probability twice that of the second and third best models, but accounted
for only 7.9% of the total likelihood among the 80 averaged models, and the BIC difference
indicated weak preference over any of models 2 through 6 (not shown).
Table 4: BMA results with caseload interactions, with 4 best models
p(β≠0) EV
SD
model 1
Primary job
0.0
Current work area FGDM
13.2
0.113
Caseload carried
24.9
-0.316 0.844
Years in position
0.0
0.000
0
Years in child welfare
0.0
0.000
0
Experience with family meetings
0.0
Minority
39.3
0.326
0.482
High job satisfaction
0.0
0.000
0
Family preservation proclivity
0.0
0.000
0
Family ability to plan
17.1
0.574
1.503 4.546
Workload change due to family
meetings:
20.0
0.218
0.473
Decrease
Increase
model 2
model 3
-0.892
-0.840
model 4
0.326
0.733
0.721
4.646
4.992
-0.033 0.176
Services composite
42.4
1.993
2.520
Can find services
9.0
0.064
0.227
Confidence in services for needs
40.4
0.267
0.354
Supervisor competence
2.4
0.009
0.073
Leadership
0.0
0.000
0
Vision, professionalism, and
commitment
State:
0.0
0.000
0
0.8
0.692
p(β≠0) EV
SD
State 2
0.008
State 3
-0.003 0.056
model 1
model 2
model 3
model 4
0.103
Caseload×Supervisor
10.5
-0.019 0.073
Caseload×Minority
7.8
-0.075 0.301
Caseload×Services
25.8
0.686
Caseload×Family
35.2
-0.786 1.639 -5.013
2.078 5.428
-0.799
BIC
-1122.51 -1121.10 -1121.07 -1120.80
Posterior probability
7.9
3.9
3.9
3.4
The best model included an effect for agreeing that families know how to make thorough
plans to resolve with their issues, the interaction of this variable with caseload carrying status,
and an effect for service ratings only among those carrying a caseload. Put another way, we had
one effect for those who do not carry a caseload, and two apparent effects for those who do.
Together, these three predictors created a model that correctly classified 69.4% of the sample as
agreeing or disagreeing with the effectiveness of FGC. The resulting logistic regression model
(conducted on all cases with complete data for the four variables involved) had a likelihood ratio
test statistic of G2 = 30.674, p < 0.001.
To aid in interpretation of these results, we present the results of this final model
graphically. We simulated the predicted probabilities and confidence intervals accounting for
uncertainty in the model parameters using the R package simcf, utilizing 10,000 draws from
the multivariate normal space defined by the variance-covariance matrix of the resulting
parameters. We conducted this simulation separately from the BMA analysis. The resulting 95%
confidence intervals in Figure 1 reflect uncertainty in the parameters within the chosen model,
but not the uncertainty in choosing this model over others, and thus are biased downwards
(liberally). Figure 1 presents the predicted probability of endorsing effectiveness of FGC
(vertical axis) as a function of the variables in the best model selected above, with the horizontal
axis capturing responses on the services composite variable (better feelings about local services
being closer to 1).
Figure 1: Predicted probabilities of agreeing that FGC is effective in all areas
In summary, results of our model results suggested that among those who did not carry a
caseload, agreeing that families are able to make plans made a large difference in the probability
that the respondent would endorse FGC as effective in all areas. The relatively few respondents
who did not carry a caseload and were more pessimistic about families’ abilities had near 0
predicted probability of endorsing FGC as effective, while their colleagues who were positive
about family ability to construct plans had a 50% probability of endorsing FGC as effective,
compared to a base rate of 31%. In fact, the only group predicted to have a higher probability of
agreeing that FGC is effective—but not significantly so, as indicated by the confidence
intervals—was those who carried a caseload, disagreed about family ability to plan, and had the
most positive regard for local services to aid those families. However, for those with a caseload,
the confidence intervals indicated that agreeing about family ability to construct thorough plans
made little to no difference in the likelihood of agreeing that FGC is effective, and thus may not
be a factor for the case-carrying group.
Judging FGCs to be effective is, of course, related to multiple factors, some of which are
difficult to capture in a brief survey. Nonetheless, three general factors were the strongest
correlates of whether the staff members in this analysis agreed that FGC is effective: Their
reported beliefs about local service competence and availability, their reported beliefs about
families’ capability of constructing plans to resolve child abuse and neglect concerns, and
whether they carried a caseload. The effect of the latter appeared to operate through service
ratings: Those who worked directly with families held FGC in high regard if they held local
services in high regard. Among those who did not work with families every day, feelings about
FGC appeared to depend upon their beliefs about family ability to make plans. Lowest regard for
FGC appeared to be among those who did not carry a caseload and were pessimistic about the
ability of families to plan, and among those who carried a caseload but were pessimistic about
local services to help families.
Discussion
Social workers often serve as the ground-level gatekeepers to services, despite any
policies or mandates on the books. For this reason, gaining a better understanding about child
welfare professionals’ perspectives about FGC effectiveness may help child welfare agency
leaders (administrators, managers, and supervisors) establish protocols and policies that more
fairly disperse the service, coach their workforce on the purpose of the FGC, and remove barriers
to the effective implementation of FGCs.
From a rights-based perspective, the undergirding philosophy of FGCs is that all families
are entitled to the opportunity to be decision makers when children in their family group are
vulnerable. As such, agency staff’s perceptions of the family’s decision-making abilities and
capacities should not be a consideration when making a referral. In addition, given that plans that
emerge from FGCs tend to blend a combination of community, family, formal, and informal
resources (Marsh and Walsh, 1997; Burford and Pennell, 1998; Morris, 2007), the perception of
service competence and availability as an important predictor of FGC effectiveness may reflect
traditional thinking about the efficacy of child welfare interventions. Staff perceptions of formal
services as ineffective or unavailable, while possibly overlooking and undervaluing other family
and community resources that may be leveraged, present an opportunity for those guiding
implementation. Given these findings, it may be helpful to highlight the notion that caseworkers
are unlikely, at least initially, to fully understand the informal resources a family group can
access or how the family group will align a range of resources to meet needs during the FGC
process.
However, the rights-based framework is not the framework employed by the majority of
jurisdictions in the United States, nor elsewhere in the world, despite the growing value placed
on family engagement across child- and family-serving agencies. At this stage, family meeting
models, even those with empirical support, remain a new and sometimes foreign approach,
subject to the discretion of those charged with implementation, particularly those who make
referrals. As a result, a greater understanding of staff attitudes around FGC and how attitudes
might impact referrals and quality implementation is essential for understanding how FGC
affects outcomes for the children, youth, and families receiving them. Implementation research
clearly points to addressing staff characteristics and attitudes as key to successful implementation
of evidence-based practice (e.g., Fixsen et al., 2005; McCrae et al., 2014).
Additional research will examine whether and how staff attitudes influence referrals,
fidelity to the model, and outcomes. This analysis is a first step in understanding what does and
does not influence worker perceptions of the effectiveness of FGCs. Still, if FGCs truly are a
beneficial service and the goal is to maximize their effectiveness by increasing capacity to offer
them to the maximum number of families, then removing barriers to referral should be a goal.
Understanding what worker characteristics facilitate or impede FGC referrals, as understood
through staff perceptions of the practice, is significant to this goal. If judgments of effectiveness
do indeed indicate whether those with a caseload would refer families to FGCs, this research
indicates that those least likely to refer are those with the lowest opinion of the availability and
ability of local services to help those families. Whether these low opinions of services arise from
unavailable services, providers who have proven difficult to work with, or a mix of local
providers with poor results, addressing the service offerings available may be one ingredient to
increasing referrals and improving effectiveness. In addition, child welfare agencies
implementing FGC may benefit from re-tooling all staff on the core philosophies undergirding
FGC as they challenge the entrenched decision making structures.
This paper also demonstrates the use of BMA in an area of research that is limited in its
ability to test hypotheses based on theory or prior findings. BMA is emerging as a statistical
approach in many fields, with limited application in child welfare research to date. It provides a
different lens for analyzing relationships and the resulting models illuminate some relationships
that we may not have uncovered otherwise. The significance of carrying a caseload in
perceptions of effectiveness and the finding that assessments of service quality and availability
appear to affect perceptions of effectiveness provide important touchstones for both
implementers and researchers investigating other aspects of worker attitudes, referrals, fidelity,
or actual child welfare outcomes.
Limitations
While these results shed light on correlates of attitudes towards family group
conferencing, several factors may limit the generalizability of these results. As with other
surveys of perceptions, social desirability and other subjectivity issues may have created
measurement error, attenuating correlations. Lack of variability from ceiling effects and possible
social desirability led us to make certain choices regarding dichotomization of results. These
choices were part of an initial culling of degrees of freedom, for which other analysts might have
made different decisions. Similarly, the choice to dichotomize the outcome was informed, in
part, by a recognition that ordinal regressions require larger samples than binary regressions.
Having more than 252 surveys might have allowed testing an ordered outcome and resulted in
finding a different set of covariates. Item nonresponse reduced the sample size somewhat, and
we have operated here under an assumption of missing at random. One factor that does appear to
be related to item missingness is simple fatigue: The FGC ratings are less than halfway into a
survey that is 8 ½ pages (if printed), and all items with more than one missing value were asked
later in the survey. Finally, the three child welfare jurisdictions represented in our sample may
not be representative of other child welfare agencies, though we did not find that site had an
influence on perceptions of effectiveness in our models.
Despite these limitations, the study examines an understudied but important area of
implementation in jurisdictions that have been operating FGCs for some time. The results
illustrate that worker attitudes about FGCs and their effectiveness are themselves a product of
attitudes towards families, type of work responsibility, and the perception of resources or
services in the external environment. Future investigations that seek to understand the perceived
impact of practices from the worker perspective should consider including these concepts.
Worker characteristics and attitudes, while understudied, likely play a significant role in
determining what services families receive and the quality of those services, and we expect, their
outcomes.
References
Adolph, C. (2013). simcf: Counterfactuals and confidence intervals for estimated models. R
package version 0.2.10. http://faculty.washington.edu/cadolph/index.php?page=60
Burford, G., & Pennell, J. (1998). Family group decision making: After the conference —
progress in resolving violence and promoting well-being: Outcome report (Vol I & II). St.
John's, NF: Memorial University of Newfoundland, School of Social Work. Accessed 28
March 2014 from http://faculty.chass.ncsu.edu/pennell/fgdm/OutcomeReport/index.html
Brown, L. (2007). The Adoption and Implementation of a Service Innovation in a Social Work
Setting – a Case Study of Family Group Conferencing in the UK. Social Policy and Society,
6(3), 321-332.
Brown, L. (2003). Mainstream or margin? The current use of family group conferences in child
welfare practice in the UK. Child and Family Social Work, 8, 331-340.
Butler Institute for Families. (2009). The Western Regional Recruitment & Retention Project:
Final report. Denver: Author.
Crow, G. & Marsh, P. (1997). Family group conferences, partnership and child welfare: A
research report on four pilot projects in England and Wales. Sheffield: University of
Sheffield.
Dalgliesh, L. (2010). Balance of work focus in child welfare. Unpublished manuscript.
DePanfilis, D. (2006). Child neglect: A guide for prevention, assessment, and intervention.
Washington DC: US Department of Health and Human Services Office on Child Abuse and
Neglect.
Doolan, M. (2007). Duty calls: The response of law, policy and practice to participation rights in
child welfare systems. Protecting Children, 22(1), 10-18.
Eigen Kracht. (no date). Eigen Kracht Centrale: What we do (web page). Retrieved August 15,
2014 from http://www.eigen-kracht.nl/en/inhoud/what-we-do
Ellett, A. (2009). Intentions to remain employed in child welfare: The role of human caring, selfefficacy beliefs, and professional organizational culture. Children and Youth Services
Review, 31(1), 78-88. [doi: 10.1016/j.childyouth.2008.07.002].
Gelman, A. (2011). Induction and deduction in Bayesian data analysis. Rationality, Markets and
Morals, 2, 67–78.
Fixsen, D. L., Naoom, S. F., Blase, K. A., Friedman, R. M. & Wallace, F. (2005).
Implementation Research: A Synthesis of the Literature. Tampa, FL: University of South
Florida, Louis de la Parte Florida Mental Health Institute, The National Implementation
Research Network (FMHI Publication #231).
Hoeting, J.A., Madigan, D., Raftery, A.E., & Volinksy, C.T. (1999). Bayesian Model Averaging:
A tutorial. Statistical Science, 14, 382-417.
Huntsman, L. (2006). Family group conferencing in a child welfare context. New South Wales,
Australia: Centre for Parenting & Research.
Kempe Center for the Prevention and Treatment of Child Abuse and Neglect. (2013). Family
group decision making in child welfare: Purpose, values and processes. Version 2. Retrieved
28 March 2014 from
http://www.ucdenver.edu/academics/colleges/medicalschool/departments/pediatrics/subs/can
/FGDM/Documents/FGDM%20Web%20Pages/About%20FGDM/FGDM%20Purpose%20V
alues%20and%20Processes.pdf
Križ, K., & Skivenes, M. (2014). Street-level policy aims of child welfare workers in England,
Norway and the United States: An exploratory study. Children and Youth Services Review,
40, 71-78.
LeCroy and Milligan Associates, Inc. (2003). Family group decision making: Annual evaluation
report. Tucson, AZ: LeCroy & Milligan Associates, Inc.
Lipsky, M. (1980). Street-level bureaucracy: Dilemmas of the individual in public services. New
York: Sage.
Marsh, P., & Walsh, D. (2007, January). Outcomes of family group conferences: More than just
the plan? Retrieved December 9, 2008, from
http://www.kent.gov.uk/NR/rdonlyres/9C8F2763-5B80-4F00-B9A2602DE5AF1DAE/11750/fgcresearchreport.pdf
McCrae, J. S., Scannapieco, M., Leake, R., Potter, C. C., & Menefee, D. (2014). Who's on
board? Child welfare worker reports of buy-in and readiness for organizational change.
Children & Youth Services Review, 37, 28-35. doi:10.1016/j.childyouth.2013.12.001
Ministerial Advisory Committee on a Maori Perspective for the Department of Social Welfare.
(1988). Puao-te-ata-tu (day break). Wellington, New Zealand: Department of Social
Welfare.
Morris, K. (2007). An evaluation of outcomes: Camden family group conference service. LB
Camden.
New Mexico Children, Youth, & Families Department. (2011). New Mexico CYFD – Protective
Services organizational health assessment: Report for protective services staff. Santa Fe:
Author.
New York State Social Work Education Consortium. (2005). Workforce Retention Survey.
Albany: State University of New York, Albany.
R Core Team (2014). R: A language and environment for statistical computing. Vienna, Austria:
R Foundation for Statistical Computing, http://www.R-project.org/.
Raftery, A.E. (1995). Bayesian Model Averaging in social research. Sociological Methodology,
25, 111-163.
Raftery, A., Hoeting, J., Volinsky, C., Painter, I., & Yeung, K.Y. (2013). BMA: Bayesian Model
Averaging. R package version 3.16.2.3. http://CRAN.R-project.org/package=BMA
Sundell, K., Vinnerljung, B., & Ryburn, M. (2001). Social workers’ attitudes towards family
group conferences in Sweden and the UK. Child and Family Social Work, 6, 327-336.
Tobias, J.L., & Li, M. (2004). Returns to schooling and Bayesian Model Averaging: A union of
two literatures. Journal of Economic Surveys, 18, 153-180.
Trotter, C., Sheehan, R., Liddell, M., Strong, D. & Laragy, C. (1999). Evaluation of the
statewide implementation of family group conferencing. Department of Human Services,
Government of Victoria, Melbourne.
Velen, M., & Devine, L. (2005). Use of FGDM with children in care the longest: It’s about time.
Protecting Children, 19 (4), 25-34.