Concurrent Validity and Intertester Reliability of

Concurrent Validity and Intertester Reliability of
Universal and Fluid-based Goniometers for Active
Elbow Range of Motion
MICHELE PETHERICK,
WENDY RHEAULT,
SHARON KIMBLE,
CONNIE LECHNER,
and VIRGINIA SENEAR
The purposes of this study were 1) to establish the concurrent validity of the
universal goniometer and the fluid-based goniometer and 2) to determine the
intertester reliability of these two instruments. A correlational study was performed in which two testers used the universal goniometer and the fluid-based
goniometer in measuring elbow range of motion in 30 healthy subjects. The fluidbased goniometer had high intertester reliability (R = .92), and the standard
goniometer had poor reliability (R = .53). The Pearson product-moment correlation
between the two instruments was .83. A significant difference was shown between the standard goniometer and the fluid-based goniometer by the t test (t =
4.4, df = 28, p < .05). The results support the use of the fluid-based goniometer
between testers on elbow range of motion; however, the two instruments cannot
be used interchangeably.
Key Words: Elbow joint; Physical therapy; Tests and measurements, range of motion;
Upper extremity, elbow.
The goniometer long has been recognized as an essential
tool for clinical physical therapists. Goniometers are used by
physical therapists to assess treatment effectiveness and dis­
ease progression and to document this information to thirdparty payers.1"7 Unfortunately, the homogeneity of this im­
portant assessment tool and the technique associated with its
use have evolved slowly. This problem was addressed by
Moore, who attributed errors to a lack of standardization in
nomenclature, numerical expression, patient position, and
instrumentation.6,8 Hellebrandt et al suggested that rigid con­
trol of these variables would result in greater reliability for
measurement of joint range of motion.3
In 1949, Moore advocated the use of an instrument that is
currently considered the universal (standard) goniometer
(UG).8 This goniometer is a protractor with an extended
M. Petherick, BS, is a physical therapist affiliated with the Manhattan
Psychiatric Center-Ward's Island, 600 E 125th St, New York, NY 10035.
W. Rheault, MA, is Chairman and Associate Professor, Department of
Physical Therapy, University of Health Sciences-The Chicago Medical School,
3333 Green Bay Rd, Bldg 51, North Chicago, IL 60064 (USA).
S. Kimble, BS, is Physical Therapist, Shalom Home Inc, 1554 Midway Pkwy,
St. Paul, MN 55108.
C. Lechner, BS, is Staff Physical Therapist, Paulsen Rehabilitation Center,
Willowbrook, IL 60439.
V. Senear, BS, is Staff Physical Therapist, Virginia Mason Hospital, Depart­
ment of Physical Medicine and Rehabilitation, 925 Seneca St, Seattle, WA
98101.
Ms. Petherick, Ms. Kimble, Ms. Lechner, and Ms. Senear were senior
students in the physical therapy program, University of Health Sciences-The
Chicago Medical School, at the time this study was conducted.
Address all correspondence to Mrs. Rheault.
This article was submitted February 13, 1987; was with the authors for
revision 17 weeks; and was accepted September 8, 1987. Potential Conflict of
Interest: 4.
966
stationary arm and a fulcrum-mounted moveable arm. Tape
measures and trigonometry, in addition to pendulum, bubble,
and "over-the-joint" goniometers, have also been used to
measure joint ROM.9-11 These other measuring tools, how­
ever, have not gained wide clinical acceptance. A fluid-based
type of goniometer (FG), which works on the principle of a
carpenter's level, was recently introduced. The fluid in this
type of instrument is contained in a circular chamber and is
displayed by straight-plane movement in gravity. Although
manufacturers of the FG have provided no literature on the
instrument, Clarke and associates have described the use of a
type of FG for the glenohumeral joint,12,13 and other authors
have used it to measure cervical ROM.10
For a measurement instrument to be useful, its reliability
and validity must be established. Goniometric measurement
has been found to have greater intratester reliability than
estimation by observation.4 High intratester reliability has
been found by several authors in both clinical and research
settings when using the UG.1,7 Boone et al have found that
for a given motion no significant intratester variation exists
in three repetitions of the same motion.1 Rothstein and asso­
ciates found that intratester reliability for elbow and knee
joint ROM measurements was very high (r = .91-.99).7 The
results of that study support the finding of Boone et al1 that
multiple measurements do not increase intratester reliability.
These findings, however, disagree with those of Low, who
contends that the average of multiple measurements leads to
greater accuracy and reliability.4
In general, past research has found intertester reliability to
be less than intratester reliability.1,9,11 Hamilton and Lachenbruch measured finger joint ROM and found significant
PHYSICAL THERAPY
RESEARCH
variance in mean measurements taken by several therapists
over a period of time.11 Rothstein et al studied intertester
reliability of elbow and knee joint ROM measurements and
found Pearson product-moment correlations of .88 to .97 for
knee and elbow flexion and elbow extension but correlations
of only .63 to .70 for knee extension.7 Fitzgerald et al found
that intertester reliability was .88, .76, and .91 for spinal
extension, right lateral spinal flexion, and left lateral spinal
flexion, respectively.14 Boone et al found less intertester vari­
ation in three upper extremity motions (r = .86) than for
lower extremity motions (r = .58).1 They found that the
average standard deviation for upper extremity motions was
2 degrees, but for elbow flexion was 3.7 degrees. They con­
cluded that a change of greater than 5 degrees in the upper
extremities and greater than 6 degrees in the lower extremities
is necessary to determine improvement in ROM when more
than one tester measures the same motion.1
High validity of the measuring instrument is required for
the tool to provide meaningful results. Burdett and associates,
in a study measuring lumbar spine movements in healthy
subjects, found low validity correlations between four meas­
urement instruments and roentgenographic measurements.15
Gogia et al, however, found high Pearson product-moment
correlation coefficients (r = .97-.98) and intraclass correla­
tion coefficients (ICCs) (ICC = .98-.99) between goniometric
measurements and roentgenograms taken of the knee.16 They
concluded that the goniometer was a valid instrument for
measuring ROM of the knee joint. Enwemeka also reported
no significant difference between goniometric and roentgen­
ographic measurements of the knee, except in the first 15
degrees of ROM.17
Because little literature is available on the FG, we felt that
a study exploring the validity and reliability of this instrument
was necessary. The purposes of this study were 1) to establish
the concurrent validity of the UG and the FG and 2) to
determine intertester reliability of these two instruments in
measuring elbow joint ROM. We hypothesized 1) that no
significant difference would exist between the UG and the FG
in measuring elbow flexion ROM and 2) that the intertester
reliability of these two instruments would be high.
METHOD
Subjects
Subjects were 10 male and 20 female volunteers with no
previous history of musculoskeletal or neurological problems
of the right upper extremity (Tab. 1). The mean age of the
subjects was 24 years (s = 4.2 years). All subjects were
informed of the testing procedure, and a signed statement of
consent was obtained from each subject.
Age
Height (m)
Weight (kg)
Right arm girth (cm)a
Right arm length (cm)b
s
24.00
1.68
63.62
26.40
74.26
Two goniometers were used for comparison of their validity
and intertester reliability, a UG and a FG.* The UG was a
full-circle plastic goniometer with 25.4-cm moveable arms,
marked in 1-degree increments.
The FG had afluid-filledchamber that responded to posi­
tion changes in space. It had a 360-degree scale divided into
1-degree increments and was read from the bottom of the
fluid meniscus. The FG weighed about 11 g, had a 0.7-cm
base, and was 10 cm in height. Each goniometer was calibrated
before each testing session to known angles of 0, 90, and 180
degrees as determined from a standard protractor.
Procedure
Subjects were placed supine in the anatomical position on
a treatment table. The subject's right arm was positioned with
a 5.08-cm thick towel under the distal humerus and with the
forearm off the edge of the plinth. This arm position allowed
full elbow extension ROM. The right shoulder was stabilized
with a 2.3-kg weight over the deltopectoral groove to prevent
shoulder protraction. The subject's right arm was exposed for
easy identification of bony landmarks.
The UG was aligned according to the procedure outlined
by Norkin and White.18 The stationary arm of the goniometer
paralleled the lateral midline of the humerus, pointing at the
acromion process. The goniometer's moveable arm was
aligned with the lateral midline of the radius and the styloid
process. The lateral epicondyle was used to center the fulcrum
of the goniometer.
The FG was placed on the dorsal aspect of the subject's
forearm, 6 cm distal to the olecranon process. The scale of
the liquid level was set at 0 degrees when the limb was at the
starting position of full extension. The scale was read when
the subject's elbow was in full flexion.
Two testers (S.K. and C.L.) each took three measurements
on every subject with both instruments. This testing procedure
was followed because the literature disagrees on whether
multiple measurements lead to greater intratester reliability.
The order of measurement was randomized to eliminate
sampling bias. The researchers practiced the measurement
procedure before the experiment to ensure testing reliability.
Subjects activelyflexedand extended their right elbow five
times before thefirstmeasurement to ensure constant ROM
throughout the testing session. The goniometer was aligned
with the arm in full extension. The subjects then actively
flexed their right elbow through the full ROM, avoiding
forceful contraction. The full arc of motion was recorded by
separate researchers (M.P. and V.S.) to blind the testers to
previous measurements and thus prevent tester bias. All data
were collected in the same laboratory at the university.
Data Analysis
TABLE 1
Physical Characteristics of Subjects (N = 30)
Characteristic
Instrumentation
4.20
.08
11.25
3.60
4.45
Two statistical methods were used to assess the concurrent
validity of the goniometers. We used a Pearson productmoment correlation to determine the covariance of measure­
ments taken with both goniometers. Because the correlation
coefficient can be high even if large differences exist between
covarying paired measurements, the paired-data t test was
also used.19 This method measured the agreement between
a
Girth measured 10 cm proximal to olecranon process.
Arm length measured from acromion process to distal end of
third digit.
b
Volume 68 / Number 6, June 1988
Chattanooga Corp, 101 Memorial Dr, PO Box 4287, Chattanooga, TN
37405.
967
testers to establish the concurrent validity of the goniometers.
To compute these statistics, we used the mean of all six
measurements per subject taken with each instrument.
Separating the data by instrument and tester, we calculated
the mean of each subject's measurements. These values were
used to compute the ICC (formula 2,3), which was used to
determine the intertester reliability of each instrument.519 The
subjects acted as their own control, thus increasing the sensitivity of the experiment.19 The use of ICCs allowed the determination of the source of variability between and within
testers. All statistical tests were two-tailed at an alpha level
of .05.
RESULTS
The Pearson product-moment correlation between the two
goniometers equaled .83, which is significant at an alpha level
of .05 (Tab. 2). The paired-data t test revealed a significant
difference between the two instruments (t = 4.4, p < .05)
(Tab. 2). Intertester reliability was determined by the ICC as
.53 for the UG and .92 for the FG (Tab. 3).
DISCUSSION
The results of this study, although conducted on a small
group of subjects, showed high correlation between the FG
and the UG. Because the instruments were measuring the
same joint angle, a high correlation would be expected. The
instruments were not shown to be concurrently valid by the t
test.
The intertester reliability was found to be higher with the
FG than with the UG. Our results do not support those of
Rothstein and associates, who found high intertester reliability
(r = .95 for elbow flexion and extension) with the UG.7
Twelve testers took measurements in their study. Because
standard deviations and degrees of freedom were not published with their study results, we could not determine how
their statistics were calculated. The subjects in their study
included patients with elbow disorders, which produced wider
variability in the data. Statistically, this variability, if large,
will produce higher correlation values. In our study, subjects
with normal elbow mobility were measured, producing a
a
b
s
SEM
Universal
145.8
6.3
1.2
Fluid-based
149.4
7.9
1.5
ra
t
.83
4.4b
CONCLUSION
This study compared the UG and the FG and found that
concurrent validity did not exist between these two instruments. It also addressed intertester reliability for the two
instruments and found that the FG had greater intertester
reliability than the UG. Although the UG appears to be a
mechanically sound measurement instrument, even the use
of a standardized testing procedure allowed excessive variability between therapists. Although more research is needed,
we foresee potential usefulness of the FG for improving
goniometric measurement in physical therapy.
REFERENCES
1. Boone DC, Azen SP, Lin C-M, et al: Reliability of goniometric measurements. Phys Ther 58:1355-1360, 1978
2. Loessin Grohmann JE: Comparison of two methods of goniometry. Phys
Ther 63:922-925, 1983
3. Hellebrandt FA, Duvall EN, Moore ML: The measurement of joint motion:
Part 3. Reliability of goniometry. Phys Ther Rev 29:302-307, 1949
4. Low JL: The reliability of joint measurement. Physiotherapy 62:277-229,
1976
5. Miller PJ: Assessment of joint motion. In Rothstein JM (ed): Clinics in
Physical Therapy: Measurement in Physical Therapy. New York, NY,
Churchill Livingstone Inc. 1985, vol 7, pp 103-136
6. Moore ML: The measurement of joint motion: Part I. Introductory review
of the literature. Phys Ther Rev 29:195-205, 1949
7. Rothstein JM, Miller PJ, Roettger RF: Goniometric reliability in a clinical
setting: Elbow and knee measurements. Phys Ther 63:1611-1615, 1983
TABLE 2
Elbow Joint Range-of-Motion Measurements Taken with
Universal and Fluid-based Goniometers
Goniometer
restricted statistical range, which leads to attenuated correlation values.
We found the FG to be more reliable than the UG, perhaps
because of its simple technique of use. The UG, in contrast
to the FG, requires careful and sustained alignment to bony
landmarks during movement. Other advantages of the FG
include its light weight, small size, and speed of application.
Its ease of application allows the therapist to obtain ROM
measurements with one hand, leaving the other hand free to
passively move the patient's extremity.
The FG, however, does have several limitations. It must be
used perpendicular to the gravitational field, which necessitates frequent patient position changes. Its size precludes use
on smaller joints such as the interphalangeal joints. Another
disadvantage is that the FG measures only total excursion as
opposed to movement from neutral. This factor may account
for the significant difference we found between the two measurement instruments with the t test. Because the FG measures
only total movement, it cannot specify the part of the ROM
where movement occurs. Lastly, the cost of the FG is about
three times that of the UG.
Future research involving the FG is necessary to fully
understand its clinical applicability. This research could be in
the areas of 1) protocol establishment for standardization, 2)
determining reliability for other joints, and 3) clinical trials.
Further study of both the FG and the UG is required to
determine their validity.
Pearson product-moment correlation.
p < .05 using the paired-data f test.
TABLE 3
Intertester Reliability of Universal and Fluid-based Goniometers
Universal Goniometer
Fluid-based Goniometer
Source
df
Between subjects
29
SS
2292.9
MS
R
79.0
df
SS
MS
29
2651.7
125.9
30
155.0
5.2
.92
.53
Within subjects
968
30
711.1
23.7
R
PHYSICAL THERAPY
RESEARCH
8. Moore ML: The measurement of joint motion: Part 2. The technic of
goniometry. Phys Ther Rev 29:256-264, 1949
9. Hsieh C-Y, Walker JM, Gillis K: Straight-leg-raising test: Comparison of
three instruments. Phys Ther 63:1429-1433, 1983
10. Buck CA, Dameron FB, Dow JM, et al: Study of normal range of motion in
the neck utilizing a bubble goniometer. Arch Phys Med Rehabil 40:390392, 1959
11. Hamilton GF, Lachenbruch PA: Reliability of goniometers in assessing
finger joint angle. Phys Ther 49:465-469, 1969
12. Clarke GR, Willis LA, Fish WW, et al: Preliminary studies in measuring
range of motion in normal and painful stiff shoulders. Rheumatol Rehabil
14:39-46,1975
13. Clarke GR: Measurement in shoulder problems. Rheumatol Rehabil
15:191-193,1976
Volume 68 /Number 6, June 1988
14. Fitzgerald GK, Wynveen KJ, Rheault W, et al: Objective assessment with
establishment of normal values for lumbar spinal range of motion. Phys
Ther 63:1776-1781, 1983
15. Burdett RG, Brown KE, Fall MP: Reliability and validity of four instruments
for measuring lumbar spine and pelvic positions. Phys Ther 66:677-684,
1986
16. Gogia PP, Braatz JH, Rose SJ, et al: Reliability and validity of goniometric
measurements at the knee. Phys Ther 67:192-195, 1987
17. Enwemeka CS: Radiographic verification of knee goniometry. Scand J
Rehabil Med 18:47-50, 1986
18. Norkin CC, White DJ: Measurement of Joint Motion: A Guide to Goniometry. Philadelphia, PA, F A Davis Co, 1985, p 38
19. Currier DP: Elements of Research in Physical Therapy, ed 2. Baltimore,
MD, Williams & Wilkins, 1984, pp 163-164,171, 230
969