Sample Redesign for the FDIC`s Asset Valuation Review

Papers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
Sample Redesign for the FDIC’s Asset Valuation Review
David W. Chapman
Federal Deposit Insurance Corporation
Abstract
When an FDIC-insured financial institution is in danger
of failing, the FDIC has to assess the value of the
institution’s loan portfolio in a short period of time, as
part of an Asset Valuation Review (AVR). Based on the
AVR, price tags are given to various loan pools (types),
which are used to sell the bank’s loans to other
institutions, if the bank fails. Because of the large number
of loans in most portfolios, sampling must be used to
estimate the value of the loan pools. The basic design is a
stratified random sample, with strata defined by size (loan
book value). The definition of strata (including a certainty
stratum) and the derivation of the total and stratum
sample sizes have to be automated because of the limited
time available for the AVR. The current design is being
revised to improve sampling efficiency. The challenges
and features of the revised design will be discussed.
Keywords: Optimum Allocation, Stratified Sampling,
Certainty Selections
1.
Introduction
The Federal Deposit Insurance Corporation (FDIC) has
the responsibility of insuring deposits in FDIC-chartered
banks and savings associations (hereafter referred
collectively to as banks) for amounts up to statutory
limits, which is $100,000 for most standard checking and
savings accounts. In this capacity, the FDIC tracks the
financial soundness of banks so that it can be prepared to
take appropriate action when a bank is in danger of
failing. In such instances, if steps cannot be taken to
prevent a bank failure, the FDIC, as the receiver of the
banks assets, will attempt to sell the banks assets,
including the loan portfolio, to other institutions, as
quickly as possible.
To sell off the loan portfolio of a failing bank, an
important step is to assess the value of the loans, by loan
pool (type), and to put price tags on each loan pool. This
is done quickly by applying a customized computer
program to select a probability sample of loans from each
loan pool, and assessing the value of each sample loan.
From the sample valuations, an unbiased (or nearly
unbiased) estimate of the value of each loan pool (a price
tag) is derived. The sample is designed in such a way that
the value of each loan pool is estimated with a specific
level of precision (e.g., to within ± 10% of the true pool
value with 95% confidence). The process of selecting a
sample of loans, reviewing and assessing the value of
each sample laon, and pricing loan pools, is referred to as
the Asset Valuation Review (AVR).
The AVR process has to be done very quickly, often in a
few days. For over ten years the FDIC has been using a
computer program referred to as RAVEN (Risk Analysis
and Value EstimatioN) to design and select a probability
sample of a bank’s loan portfolio to be valued for the
AVR. At this time the FDIC is in the process of
converting RAVEN, which uses FoxPro software, to a
program that uses more current software. As part of this
process, the statistical methodology underlying RAVEN
is being reviewed with the intention of improving
sampling efficiency, which could save FDIC resources for
conducting AVRs. This paper describes the process of
reviewing and revising the AVR sampling methodology.
The next section provides a summary of the current
sampling methodology incorporated into the RAVEN
software. Section 3 presents recommended changes in the
AVR sampling methodology. Section 4 summarizes the
evaluation of some of the recommended changes in the
AVR sampling methodology. The last section presents
the final recommendations for revisions of the sampling
methodology for the AVR, and includes suggestions for
additional research.
2.
Current Sample Methodology Used in RAVEN
As mentioned above, RAVEN is a software package that
the FDIC uses to select a probability sample of the loans
held by a bank for the purpose of conducting an Asset
Valuation Review (AVR). RAVEN is also used to
provide estimates of the value of a bank’s loan pools,
based on sample loan valuations; but the focus of this
discussion is on the sampling methods applied in
RAVEN. The current sample design for RAVEN is a
stratified random sample. The strata are defined by loan
type (referred to as pools), and loan size, as measured by
book value (current unpaid loan balance). The same loan
pool definitions are used at all banks, though a given bank
may not contain loans in all the loan pools defined.
Prior to defining strata and deriving sample sizes, a cutoff
book value is derived for the purpose of removing the
830
Papers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
smaller loans from the probability sampling process. In
particular, the cutoff is determined by sorting the loans
from smallest to largest and identifying the largest loan
such that the sum of the book values for all the loans
smaller than it do not exceed 10% of the book value of the
entire loan portfolio. The book value of the loan
identified by this process is the cutoff. All loans with
book values less than the cutoff are excluded from the
sampling process, and are valued by applying an average
recovery rate to the book value of each of these small
loans in the pool. This average recovery rate is computed
as the sum of the estimated recoveries of each of the
sample loans divided by the sum of the book values of the
sample loans.
This method of valuing loans in the bottom 10%, based
on the recovery rates of sample loans, introduces some
bias in the estimates of pool values. But the FDIC has
done some research to determine that this method is a
worthwhile bias/cost tradeoff—that the resources saved
by excluding these small loans is worth the introduction
of some bias in estimating pool values. There is an
exception to the rule for excluding loans in the bottom
10% of book values of the bank loans: If all of the loans
in a pool (or nearly all as determined by the loan
reviewer) have book values less than the cutoff value, the
entire loan pool is included in the probability sampling
process.
After removing the loans with book values less than the
cutoff, each loan pool that contains two or more
remaining loans is subdivided into two subpools, large
and small. This split is made using a procedure that
minimizes the total sample for the pool, based on meeting
a target level of precision for each of the two subpools for
estimating total book value.
To summarize this
procedure, let N represent the total number of loans in the
pool (aside from those with book values below the 10%
cutoff), and N1 and N2 (where N=N1+N2) represent the
number of loans in the large and small subpools. The
program considers 101 possible splits into large and small
subpools. For the first split, all of the loans are placed
into the small subpool (i.e., N1=0 and N2=N). The
remaining 100 splits are defined by successively
increasing N1 by 1% of N (rounded to the nearest integer).
For each of the 101 possible splits into the large and small
subpools, the required sample size is derived to estimate
the total book value (as a proxy for recovery value) for
each subpool for a specified level of precision. The level
of precision includes a specification for the relative
precision for a specified confidence level (like ± 10% at
the 95% confidence level). The calculation of the
minimum sample size needed to meet the precision target
for estimating the total book value for a subpool is based
on standard textbook formulas for deriving sample sizes,
given the subpool mean book value and variance that are
calculated from the subpool book values. The program
allows for three alternate levels of precision:
(1) High: Estimate the total book value of the subpool to
within ± 10% with 95% confidence.
(2) Medium: Estimate the total book value of the
subpool to within ± 15% with 90% confidence.
(3) Low: Estimate the total book value of the subpool to
within ± 20% with 80% confidence.
For each of the 101 splits, the required sample sizes for
the two subpools (n1 and n2) are calculated and added
together (n=n1+n2). The split for which n is smallest is
used to define the subpools. In the case of ties, the split
corresponding to the smallest value of N1 is used.
3.
Recommended Changes for the Sampling
Methods Used for the AVR
There are a number of areas for which improvements in
the sampling methodology can be made. First, since there
is a high correlation between book value and recovery
value for most loan pools, and since the distribution of
book values is highly skewed for most pools, it would
improve sampling efficiency to select the very largest
loans with certainty. There is no allowance for certainty
selections in the current RAVEN sampling methodology.
Regarding noncertainty selections, larger loans should be
selected with higher probabilities than smaller loans. This
is accomplished to some degree with the iterative method
of defining the large and small sampling strata used in
RAVEN, but the procedure is not specifically targeted to
minimizing the sample size to meet a precision target at
the pool level; instead it is focused on meeting a precision
target for each stratum (subpool). The two approaches are
not the same. For optimum (Neyman) allocation, the
stratum sample sizes are proportional to the product of the
number of loans in a stratum and the standard deviation of
the book values of the loans in the stratum. For the
methodology used in RAVEN the stratum sample sizes
are proportional variance of the loan book values in the
stratum (aside from the effect of the finite population
corrections). The total sample size and sample allocation
to strata should be focused on estimates at the pool level.
The iterative procedure used in RAVEN to define strata
will not, in general, produce optimum stratum definitions
because of the focus on meeting precision targets for the
two subpools, rather than the entire pool. Cochran (1977,
pp. 127-131) discusses a number of approximate methods
for defining optimum stratum boundaries. One approach,
due to Dalenius and Gurney (1951) is to equalize, across
all strata, the product of the proportion of loans, Wh, in a
stratum and the standard deviation of the book values in
831
Papers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
the stratum, Sh. (This approach would equalize the
stratum sample sizes.) Another effective approach, due to
Ekman (1959) is to equalize the product, across strata, of
Wh and the stratum interval width, (yh - yh-1) across strata.
These are both effective methods, but require an iterative
process. Another approach described by Cochran (1977)
for defining stratum boundaries, that is used quite often in
applications, is the “Cum Sq. Root of f Rule,” due to
Dalenious and Hodges (1959). However, this approach is
defined primarily for grouped data.
size strata and stratified random sampling is
recommended. The specific methods investigated for
defining strata are discussed in the next section.
Finally, for including various levels of precision to choose
from in the sampling software (low, medium, and high), it
may be confusing to vary both the confidence level and
the amount of tolerable across the precision options. An
approach that fixes one of these precision dimensions
(i.e., the confidence level) would be easier to understand.
(4) Total Sample Size. Because of the focus in RAVEN
of meeting precision targets at the subpool level, the
resulting pool sample size will be larger than
necessary to meet the precision target for estimating
the value of the loan pool. It would be much better to
derive the required sample size based on the use of
optimum allocation to strata for estimating the total
value of the loan pool. This can be done using a
formula from Cochran (1977, p.105) for the sample
size as a function of the target variance of the
estimator (which is derived from the desired level of
precision), the stratum variances, and the number of
units in each stratum.
Based on the preceding discussion, following are several
recommended changes to the AVR sampling
methodology that should improve the efficiency of the
procedure.
(5) Precision Levels. The following three levels of
precision are recommended for choices for AVR
sampling, with the confidence level being fixed at
95%:
(1) Certainty Selections. Select the largest loans with
certainty. Since probability-proportional-to-size
(PPS) sampling is not being recommended for the
redesign for AVR sampling (as discussed later in this
section), there is no obvious way to define certainty
selections. Two criteria were tested, based on
coverage of a given percent of total book value of a
loan pool (10% and 15%). The investigation of these
two options is discussed in the next section.
High: Estimate the total book value to within ± 5%
(with 95% confidence)
(2) Number of Strata. The RAVEN software defines
two strata for each loan pool. Up to a certain point,
precision can be improved when more strata are
added. However, many of the loan pools are small,
some containing fewer than 25 loans, after removing
the loans in the bottom 10% of total book value.
Also, since ratio estimation is used to estimate the
total value of a loan pool, with book value as the
covariate, the gains from additional size stratification
may be minimal. Therefore, it is recommended that
either two or three strata be defined within each loan
pool for non-certainty selections.
(3) Stratum Definitions. A common method used to
oversample larger units (larger loans in this
applications) is probability-proportional-to-size (PPS)
sampling. However, since ratio estimation is being
used to estimate loan pool values, most of the gains
from PPS sampling can be achieved by defining size
strata (in terms of book values), using random
sampling within each size stratum, and allocating the
sample to strata based on optimum allocation
methods. Therefore, this simpler approach of using
Medium: Estimate the total book value to within
± 10% (with 95% confidence)
Low: Estimate the total book value to within ± 20%
(with 95% confidence)
4.
Investigation of the Recommended Changes in the
Sampling Methods Used for the AVR
For the first three recommended changes listed in the
previous section, various approaches were applied to
sampling bank loan portfolios for three AVRs that have
been done. The sample sizes and other features of the
revised AVR sample design were compared for various
alternatives and to those for the AVR design based on
RAVEN. In all cases, the precision target used was to
estimate the total book value (as a proxy for recovery
value) of a loan pool to within ± 10% with 95%
confidence (the medium precision level of the three listed
in the previous section.) The results from these
comparisons are discussed below.
(1) Certainty Selections. As mentioned in the previous
section, since PPS sampling is not being used, there
is no straightforward way to identify loans that are
large enough to be selected with certainty. Another
approach is for the researcher to simply observe the
size distribution of the populations units, and identify
those that appear to stand out from the rest. For AVR
sampling, it would not be prudent to ask the asset
832
Papers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
review specialists to make such decisions, especially
with the severe shortage of time.
The goal is to define the two or three noncertainty
strata in such a way that the product, WhSh, is
reasonably constant across the three strata. Two
straightforward methods, that would not involve
making iterations, were tested on the AVR data. One
was to define strata my equalizing, to the extent
possible, the book value coverage in each stratum.
The other was to equalize the coverage of the square
root of book values in each stratum.
Therefore, it is recommended that the loans to be
selected with certainty are those that make up a
specific percent of the total book value of the
portfolio. Both a 10% criterion and a 15% criterion
were applied. It turned out for many loan pools that
the 10% criterion did not identify as certainty
selections one or two rather large loans that perhaps
should have been included. Therefore, the 15%
criterion is recommended.
Because of the skewness of the book value
distribution, there was concern that equalizing book
value coverage across the three strata would put too
few loans in the large stratum. Therefore, the
approach of equalizing the coverage of the square
roots of book values was investigated first. However,
equalizing the coverage of the square root of book
values placed too many assets in the large stratum,
and for a majority of pools, led to a considerably
higher sample allocation to the large stratum.
In one instance with a large loan portfolio, the 15%
criterion identified a large number of certainties. In
that case, many of the loans in the certainty group did
not particular stand out as a large loan.
Consequently, it is recommended that the number of
certainty selections for a loan pool be limited to five.
(2) Number of Strata. Revised sample designs were
prepared for most all loan pools in the available AVR
loan portfolios, defining both two and three
noncertainty strata. In most cases, the use of three
strata provided better sample efficiency (i.e., a lower
total sample size to meet the precision target) than
did the use of two strata. However, because of the
requirement of a minimum of two units per stratum,
defining three strata for small loan pools was not
beneficial. Therefore, it is recommended that only
two noncertainty strata be defined for loan pools that
contain 20 loans or less. Furthermore, for pools with
five or fewer loans, all loans should be selected with
certainty.
For all other pools, defining three
noncertainty strata is recommended.
(3) Stratum Definitoins. As stated in the previous
section, the research into constructing strata focused
on defining either two or three noncertainty strata.
The objective was to define strata for which the
optimum sample sizes for the three strata would be
nearly equal (i.e., that the product of the proportion
of loans in a stratum, Wh, and the stratum standard
deviation of book values, Sh, would be nearly the
same across the three strata.
This could be done using an iterative procedure that
derives the product, WhSh, for all possible splits into
the designated number of strata (two or three) for the
noncertainty loans in a pool. However, the gains
from defining the strata in this way may not be worth
the additional complexity associated with the
iterative procedure. Although this assessment has not
been researched, it is recommended that a more
straightforward procedure be used to define strata.
The approach of equalizing the book coverage across
the three strata worked better for defining strata for
most loan pools, and is therefore the recommended
method for defining strata. However, for both
approaches, the middle stratum often had the lowest
sample allocation because loans were often more
variable at the low end of the distribution than in the
middle.
5.
Summary of the Recommended Redesign for AVR
Sampling and Recommendations for Future
Research
Following is a summary of the features of the
recommended revised sample design for the AVR.
For each loan pool, the
(1) Certainty Selections.
largest loans that cover the top 15% of the book value
will be selected with certainty, with a limit of five
certainty selections.
(2) Number of Noncertainty Strata. For each pool
containing five or fewer loans (once the loans in the
bottom 10% of book values are removed), all of these
will be selected with certainty. For each loan pool
containing six to 20 loans, two noncertainty strata
will be defined. For pools containing more than 20
loans, three noncertainty strata will be defined.
(3) Strata Definitions. For each loan pool, noncertainty
strata will be defined using the criterion that the two
or three strata will constitute approximately equal
percentages of the book values of the noncertainty
loans in the pool. To determine stratum boundaries,
833
Papers presented at the ICES-III, June 18-21, 2007, Montreal, Quebec, Canada
the noncertainty loans in a pool will be sorted from
highest to lowest, and cumulative percents of dollar
book value coverage will be computed for each loan
on the list. Stratum breaks will be made that equalize
the coverage of book values in the strata.
(4) Total Sample Size and Allocation to Strata.
For each loan pool, the total noncertainty sample
size will be derived as the minimum sample size
needed to achieve the precision target for
estimating the total book value (as a proxy for
recovery value) of the loan pool. The precision
target will be specified as either high, medium,
or low, as defined in item (5) in Section 3. The
sample size for each noncertainty stratum will be
computed using standard Neyman allocation.
There are some areas for future research, as follows:
•
Improvements of the method of defining
certainties should be investigated.
•
A number of alternate methods of defining strata
could be considered, including the use of an
iterative procedure to equalize the product of Wh
and Sh across strata.
•
The possibility of basing the total sample size
and sample allocation to strata on the ratio
estimate of recovery rates should be investigated.
There may be enough past AVR data to make
this approach feasible.
•
The level of bias introduced by leaving out the
bottom 10% of book values, and associated costs
of including these small loans in the probability
sampling process, should be reviewed.
References
Cochran, W.G. (1977). Sampling Techniques, 3rd ed.
John Wiley, New York, NY.
Dalenius, T. and Gurney, M. (1951). “The Problem of
Optimum Stratification.” II.Skand. Akt., 34, 133-148.
Dalenius, T. and Hodges, J. L. (1959). “Minimum
Variance Stratification.” JASA, 54, pp. 88-101.
Ekman, G. (1959). “An Approximation Useful in
Univariate Stratification.” Annals of Mathematical
Statistics, 30, pp. 219-229.
834