Identifying Sleep Apnea from Self

Sleep
11(5):430-436, Raven Press, Ltd., New York
© 1988 Association of Professional Sleep Societies
Identifying Sleep Apnea from Self-Reports
*tLinda E. Kapuniai,*David J. Andrew, *:j:David H. Crowell, and
*§James W. Pearce
*Sleep Disorders Center of the Pacific,
Straub Clinic and Hospital, and the t School of Public
Health, the :f:Department of Psychology and Pacific Biomedical Research Center, and the
§lohn A. Burns School of Medicine, University of Hawaii, Honolulu, Hawaii, U.S.A.
.~
Summary: An apnea score (AS) was developed as a potential screening tool for
sleep apnea. This was based on self-report questionnaire responses of 76 sleep
disorder center patients and 20 sleep survey volunteers. Twenty volunteers
and 23 patients (group I) comprised the initial AS development group. Their
questionnaire responses were compared to polysomnographic apnea indexes
(AI) and apnea plus hypopnea indexes (AHI). Stepwise multivariate discriminant analysis was used to test whether or not selected group I questionnaire
responses could be used to correctly classify respondents into apnea (AI or
AHI >5) or nonapnea (AI, AHI ~5) groups. Self-reports of "stops breathing
during sleep," "loud snoring," and history of adenoidectomy best discriminated normal (AI ~5) from apnea (AI >5) cases. The AS derived from group
I responses to these three variables was then computed for group II (n = 53).
Mter examination of the AS results, the AS was modified to include just
"stops breathing" and "loud snoring" and the AI criterion was raised to 10 per
hour. This revised AS correctly identified 100% of the cases with moderatesevere sleep apnea (AI or AHI >40) and 70-76% of all sleep apnea cases with
AI or AHI >5. Predictive accuracy was 88% for AI > 10. The two questions
that comprise the AS should be incorporated into risk appraisal instruments or
interviews to screen for sleep apnea. Key Words: Sleep apnea-Self-report
score-Prevalence-Sensitivity-Specificity.
There is a growing body of literature suggesting that the prevalence of sleep apnea
may be greatly underestimated for adults in industrialized nations 0-10). In 1981,
Carskadon and Dement (7) found some degree of sleep apnea in 37.5% of 24 asymptomatic elderly subjects ". . . selected for good health and absence of sleep
complaints." Lavie (5) has reported sleep apnea in 0.89% of presumably normal men in
the Israeli work force and has suggested that this be considered only a minimum
estimate of probable cases. In an ambulatory screening study (10), sleep apnea was
detected in 20% of an unselected sample of male outpatients, and in 16% of male power
plant employees, a rate higher than previously expected.
Data from sleep disorder centers have demonstrated that the prevalence of sleep
Accepted for publication March 1988.
Address correspondence and reprint requests to Dr. Linda E. Kapuniai at Sleep Disorders Center of the
Pacific, Straub Clinic and Hospital, Inc., 888 S. King Street, Honolulu, HI 96813, U.S.A.
430
I
SELF-REPORTS AND SLEEP APNEA
431
apnea appears to increase with age, a fact with significant public health implications
given the rapidly increasing proportion of elderly in countries such as the United States.
Although it appears to occur most frequently in men after 40 years of age, the prevalence in women after menopause is thought to approximate that found in men. There is
no method, apart from polysomnography, however, for determining the prevalence of
sleep apnea in the general population.
Given the potentially adverse effects of sleep apnea on cardiopulmonary functioning
and daytime performance, possible negative drug interactions, and the high cost of
polysomnography, an inexpensive, reliable, and valid self-report sleep apnea instrument would be a valuable screening tool. Such an instrument could yield prevalence
estimates from various sectors of the adult population to aid in planning preventive
health care and in identifying cases for subsequent diagnostic studies.
The purpose of this preliminary study was to assess the predictive accuracy of
self-reported factors known to be present in adults with polysomnographically defined
sleep apnea. In this study, combinations of self-reported characteristics were evaluated
in terms of their association with the polysomnographic apnea index (AI) and apnea
plus hypopnea index (AHI). Those characteristics most significantly associated with AI
and AHI were formed into an apnea score (AS). The AS then was tested in a second
group of subjects for sensitivity, specificity, and predictive accuracy.
METHODS
Design
In this exploratory study, responses to selected items on the Hawaii Sleep Questionnaire (HSQ) (11) from 20 adult survey respondents (11) and 23 sleep disorder center
(SDC) patients (group I) were compared with Als and AHls derived from nocturnal
polysomnography (NPSG) conducted after HSQ completion. An AS was developed
from the results of these comparisons and applied to a second group (group II, n = 53)
of adult SDC patients who also had completed the HSQ and had AI and AHI results.
Group II, therefore, served as a test of the AS as a predictor of sleep apnea.
Subjects
Twenty survey cases plus 23 SDC patients comprised the original study sample,
group I (n = 43); 53 additional SDC patients formed the application sample, group II.
The SDC cases were consecutive patients who had completed a HSQ followed by an
all-night PSG. These patients were selected without regard to complaint or subsequent
diagnosis. Group I included 25 men and 18 women; group II had 42 men and 11 women.
Ages ranged from 36 to 69 years for group I (mean 50, SD 13.6) and 17-78 years (mean
46; SD 14.2) for group II. Educational level was 14 (±3.9) years of school completed in
group I and 14 (±2.8) years in group II. Ethnic group distributions for groups I and II,
respectively, were: white (44%, 38%), Oriental (28%, 30%), Hawaiian-part-Hawaiian
(21%, 19%), and other (7%, 13%). Groups I and II comparisons revealed no significant
differences in age, education, income, ethnicity, or body mass index (BMI) (12). Group
II had a significantly higher percentage of men (79%) than group I (58%), X2 (df = 1) =
4.06 (p = 0.04).
Data Acquisition
Self-report
The HSQ is a general sleep history questionnaire that includes questions on characteristics known to be present in sleep apnea patients. Apnea-associated questions inSleep, Vol. 11, No.5, 1988
432
L. E. KAPUNIAI ET AL.
clude frequency graded in terms of never, rarely, sometimes, often, or always of (a)
stopping breathing during sleep, (b) loud snoring, and (c) waking from sleep gasping or
short of breath. It also includes questions on sex, age, height, weight, and sleep history.
All respondents were asked to indicate if they had had a tonsillectomy or adenoidectomy.
Poiysomnography
A single NPSG was obtained for all group I and II cases. Apneas and hypopneas of
10 s or greater duration were identified by observation of air flow measured by right and
left nasal and oral thermocouples and by rib cage and abdominal inductive plethysmography. (Some of the earlier studies used a single abdominal respi-cuff or thoracic and
abdominal strain gauges.) The same technician scored all the polysomnograms.
Data analysis
In order to obtain a clinically useful AS, data analysis was carried out in four phases.
Phase I: Mter the NPSG, group I cases were divided into apnea and nonapnea subgroups; Phase II: group I AS development; Phase III: group II AS application; and
Phase IV: post hoc evaluation. In Phase II, step-wise discriminant analyses (13) were
conducted to ascertain the combination of selected HSQ responses that best discriminated group I apnea or nonapnea subgroups established in Phase I. The most discriminating subset of variables was examined to identify those variables best suited to
computing an AS. In Phase III, the resultant AS was computed for an independent
group II sample. The degree to which the AS correctly identified sleep apnea and
nonapnea cases was summarized in terms of specificity, sensitivity, and predictive
accuracy. Phase IV consisted of post hoc analyses of the group II AS and NPSG
indexes. Details of these data analysis phases are as follows.
Phase I
Prior to the analysis of the HSQ responses, AI (14) and AHI (14) were calculated for
each individual using a computer-generated NPSG report program. Apnea-nonapnea
comparison groups were formed for both the AI and AHI. An apnea case was defined
by an AI >5 or AHI >5. Although five apneas per hour may not be considered indicative of clinically significant sleep apnea, it is a common cutoff criterion reported in
apnea-nonapnea studies (6,15-17).
Phase II
As there were only 13 group I cases with AI >5, the number of self-report variables
that could be included as discriminating variables in the step-wise discriminant analysis
was limited. The HSQ items selected were chosen on the basis of clinical judgment.
These variables, along with the values entered in the discriminant analysis, were loud
snoring and stops breathing during sleep (never, rarely = 1, sometimes, often, always
= 2); wakes gasping or short of breath (never = 1, rarely = 2, sometimes = 3, often
= 4, always = 5), adenoidectomy (1 = yes, 2 = no), and BMI.
It is important to note that excessive daytime sleepiness and gender were not among
those selected. A preliminary correlational analysis of self-report responses with AI
and AHI values revealed that excessive daytime sleepiness was not significantly related
to AI and AHI; and therefore, it was not included in the set selected for this study. Even
though male gender was expected to be highly associated with sleep apnea, it was felt
Sleep, Vol. 11, No.5, 1988
SELF-REPORTS AND SLEEP APNEA
433
that inclusion of gender in a screening instrument would prejudice against detection of
sleep apnea in women.
Two separate step-wise discriminant analyses were conducted: first, comparing
cases with AI ~5 (nonapnea) and >5 (apnea); second, comparing cases with AHI ~5
(nonapnea plus hypopnea) and >5 (apnea plus hypopnea) cases. The combinations of
discriminating variables consistently yielding the highest associations with AI and AHI
would be used to formulate the AS. Mter the AS set of variables was selected, a
"positive for apnea" response (e.g., stops breathing sometimes, often, or always) had
to be present on each variable selected in order to designate a respondent as a probable
sleep apnea case.
Phase III
The AS developed from group I data was determined for each group II case. Then,
all group II cases were classified by AS into apnea or nonapnea subgroups. AS sensitivity and specificity were calculated. In addition, predictive accuracy was computed
for presence and for absence of sleep apnea. Predictive accuracy for sleep apnea was
calculated as the number of cases with AI (or AHI) >5 divided by the number of cases
with AS positive for sleep apnea. The accuracy of predicting nonapnea cases was
calculated as the number of cases with AI (or AHI) ~5 divided by the number of cases
with AS indicative of no sleep apnea.
RESULTS
AS development
The order of the step-wise selection of significantly discriminating variables was as
follows: (a) for AI, stops breathing, adenoidectomy, BMI, and wakes gasping or short
of breath, X2 (df = 4) = 13.90 (p = 0.008) and (b) for AHI, stops breathing, loud
snoring, and adenoidectomy, X2 (df = 4) = 12.20 (p = 0.007). These results defined the
criteria for selecting the variables for the AS: (a) Two of the three variables from the
AHI analysis-stops breathing and adenoidectomy-were chosen because they also
appeared in the AI subset. (b) Even though loud snoring only appeared in the AHI
subset, it was chosen because of its strorig clinical correlation with sleep apnea. (c) BMI
and wakes gasping or short of breath were discarded from the AS computation for
several reasons. First, both appeared only in the AI subset of discriminating variables,
and second, BMI requires additional computational steps that might deter use in a
clinical setting. Finally, (d) waking gasping or short of breath was eliminated because
it may be a symptom of other disorders, such as gastroesophageal reflux or asthma. In
summary, responses to these three variables, stops breathing, loud snoring, and adenoidectomy, constituted the AS.
With the variables identified, the actual AS computation was as follows. A value of
1 was assigned to each of the following: (a) a response of stops breathing sometimes,
often, or always, (b) a response of loud snoring sometimes, often, or always, and (c) a
report of no history of adenoidectomy. To be considered a probable sleep apnea case,
responses had to yield an AS of 3, that is, meet all three criteria. Any other combination
of responses (a score of less than 3) would place a respondent in the nonapnea group.
AS score application in group II
As a test of the AS utility as a clinical screening tool, the AS was determined for each
group II subject. AS specificity in relation to the AI was 67%, and for AHI, it was 69%;
Sleep, Vol, 11, No, 5,1988
434
L. E. KAPUNIAI ET AL.
sensitivity was 61% for AI and 59% for AHI (Table lA). Predictive accuracy of AS was
69% for the nonapnea cases (AI ~5 criteiion) and 58% for sleep apnea cases (.LA... I >5).
For AHI, 67% of the sleep apnea cases were identified correctly, as were 62% of the
nonapnea cases.
In the Phase IV post hoc evaluation, other combinations of group II AS and polysomnographic criteria were examined to determine whether or not improved screening
outcomes could be revealed. First, adenoidectomy was reported by only 26% of group
II cases. In light of this, the AS was recomputed without adenoidectomy, that is, only
stops breathing and loud snoring were included. In this revised AS, a score of 2 indicated a probable apnea case. When adenoidectomy was deleted, sensitivity increased
from 61% to 76% for AI >5, (Table lB), but specificity remained about the same: 65%
instead of 67% (a difference of one case). For AHI, specificity decreased from 69% to
65% and sensitivity increased to 70%.
Using this revised AS, the next modification was to raise the polysomnographic
criterion from 5 to 10/h, which is more in line with current clinical judgment for clinically significant amounts of sleep apnea. This change resulted in substantial improvements in sensitivity for both AI (83%) and AHI (78%) (Table lC). For those group II
cases with AI >20, 83% were identified by the AS as probable sleep apnea cases; for
AHI > 20, the figure was 81 %. When a criterion of 40 apneas or apneas plus hypopneas
per hour was imposed, 100% of these moderate-severe sleep apnea cases were in the
AS = 2 group. The criterion of lO/h appeared to yield the best all-around results.
DISCUSSION
The purpose of this study was to evaluate how selected self-report responses in the
form of an AS could be used to screen for sleep apnea. The AS, based on responses of
stops breathing and loud snoring sometimes, often, or always on the Hawaii Sleep
Questionnaire, was 100% effective in identifying sleep apnea cases with AI or AHI
>40. Adults with the revised AS of <2 could be expected to have little or no significant
sleep apnea (AI or AHI ~10) 80-88% of the time.
The accuracy of a screening instrument can be evaluated in two ways. First, negative
(nonapnea) cases should be excluded from further assessment, and second, positive
TABLE 1. Group II (n = 53) sensitivity, specificity, and predictive accuracy for (A) AI and
AHI ",;;;5 or >5 based on AS derived from self-reports of loud snoring, breathing cessation
during sleep and adenoidectomy, (B) loud snoring and breathing cessation only, (C) AI and
AHI > 10 for criteria in (B)
Apnea score
Predictive accuracy
Sensitivity
Specificity
Apnea
No apnea
AI
AHI
61%
59%
67%
69%
58%
67%
69%
62%
AI
AHI
76%
70%
65%
65%
61%
68%
76%
68%
AI
AHI
83%
78%
63%
67%
54%
64%
88%
80%
A
B
C
Sleep, Vol.ll, No.5, 1988
SELF-REPORTS AND SLEEP APNEA
435
cases should be flagged for further diagnostic evaluation. Based on the above results,
the AS was most effective when a sleep apnea criterion of at least 10/h was used. At this
level, 83% (AI) and 78% (AHI) of actual sleep apnea cases would have been flagged for
further evaluation. However, the AS also resulted in a substantial number of false
positives. Conversely, the AS yielded few false negatives; that is, the revised AS
accurately identified cases with AI ~ 10 88% of the time and AHI ~ 10 80% of the time.
Few cases with AI or AHI >20 and none with indexes >40 were placed in the nonapnea
category. These results were based on a simple yes-no additive formula from responses
to these two questions: When you are sleeping how often do you do the following or
someone has told you you do the following: (a) stop breathing, (b) snore loudly.
The AS was not as discriminating at lower levels of apnea frequency, but was very
accurate at higher levels. An AS of 2 could alert the clinician to. consider the potential
presence of sleep apnea and possible need for polysomnography. These cases also
could be followed for weight gain, excessive daytime sleepiness, or other indications
that sleep apnea might be present. Polysomnographic evidence of clinically significant
sleep apnea would be required to make a definite diagnosis.
With regard to the results obtained in this study, there were several factors that might
have contributed to the rather low AS specificity: (a) Levels of apnea signified by
indexes less than 10 may be too close to normal amounts of sleep apnea to be detected
in a cursory screen; (b) reliance on a single NPSG for the AI and AHI; and (c) lack of
direct observation.
1. AS specificity was not sufficient to warrant use of the AS in general population
surveys because of the large number of false-positives. These individuals might not be
available for further evaluation, and therefore, accurate estimates of sleep apnea prevalence could not be obtained. In a clinical setting, however, the AS could playa role in
preventive education. Cases with positive AS (including false-positives) could receive
information about snoring and sleep apnea as they relate to factors such as weight gain,
hypertension, or otolaryngologic abnormalities. At the same time, additional questioning could be pursued relative to presence of excessive sleepiness or sleeping partner
reports of apneic episodes. As apnea indexes of > 5 continue to serve as standards for
sleep apnea (6,15-17), it is important to persevere and develop a score that would be
reliable at various levels of sleep apnea.
2. Reliance on a single NPSG to define a sleep apnea case has been challenged
(15,16). In one study of NPSGs on 2 consecutive nights (15), AI varied over nights to
the extent that 15% of their sample would have been misdiagnosed using only first night
indexes >5. This degree of error could have inflated the false-positive rate in this study.
Subtracting 15% from the AI specificity of 37% and AHI rate of 33% (Table lC) would
reduce the error to 22% and 18%, respectively, which is still on the high side but
substantially better than chance.
3. The absence of direct reports from sleeping partners may have reduced the probability of accurate reporting of snoring and breathing cessation. Although it may be
preferable to include a sleeping partner's evaluation, this requirement would place
other restrictions on who could be screened. Taken together, however, these three
factors may account for some of the predictive error in this study population.
With the current availability of effective treatments for obstructive sleep apnea, such
as continuous positive airway pressure and uvulopalatopharyngoplasty, unnecessary
morbidity and mortality could be avoided through more education and earlier diagnoses. Currently, most health risk appraisals include one or two questions on sleep
Sleep, Vol. /1, No.5, 1988
436
L. E. KAPUNIAI ET AL.
difficulties, but these questions are not an adequate screen for a condition that seems
to be present in large numbers of adults. The resuits from this study iend quantitative
support to clinically observed signs associated with sleep apnea. Inclusion of the AS in
clinical histories and risk appraisals would be an important and simple step toward
preventive health education and early diagnosis of sleep apnea.
Acknowledgment: This research was supported in part by the Leahi Trust and the George F.
Straub Trust. Access to clinic visitors for the survey was made possible by Straub Clinic physicians, the Kaiser Permanente Medical Group, and Kaiser Foundation Hospital of Honolulu.
Computer and other facilities were made available by the School of Medicine and the International Center for Health Promotion and Disease Prevention, School of Public Health, University
of Hawaii.
REFERENCES
1. Block A, Boysen P, Wynne J, Hunt L. Sleep apnea, hypopnea and oxygen desaturation in normal
subjects. N Engl 1 Med 1979;300:513-7.
2. Coleman RM, Roffwarg H, Kennedy SJ, et al. Sleep-wake disorders based on a polysomnographic
diagnosis, a national cooperative study. lAMA 1982;247:997-1003.
3. Guilleminault C, Van de Hoed J, Mitler M. Clinical overview of the sleep apnea syndromes. In: Guilleminault C, Dement WC, eds. Sleep apnea syndromes. New York: Alan R. Liss, 1981:1-12.
4. Krieger J, Turlot J, Mangin P, Kurtz D. Breathing during sleep in normal young and elderly subjects:
hypopneas, apneas, and correlated factors. Sleep 1983;6:108-120.
5. Lavie P. Incidence of sleep apnea in a presumably healthy working population. A significant relationship
with excessive daytime sleepiness. Sleep 1983;6:312-318.
6. Ancoli-Israel S, Kripke DF, Mason WJ, Gabriel S, Kaplan O. Sleep apnea and PMS in a randomly
selected elderly population: final prevalence results. Sleep Res 1986;15:101.
7. Carskadon MA, Dement WC. Respiration during sleep in the aged human. 1 GerontoI1981;36:420-423.
8. Coleman RM, Miles LE, Guilleminault C, Zarcone VP, Van den Hoed J, Dement WC. Sleep-wake
disorders in the elderly: a polysomnographic analysis. lAm Geriatr Soc 1981 ;29:289-296.
9. Guilleminault C, Dement WC. Sleep apnea syndromes and related sleep disorders. In: Williams RL,
Karacan I, eds. Sleep disorders diagnosis and treatment. New York: Wiley, 1978:9-78.
10. Peter JH, Fuchs E, Kohler U, et al. Studies in the prevalence of sleep apnea activity (SAA): evaluation
of ambulatory screening results. Eur 1 Respir Dis 1986;146:451-458.
11. Kapuniai LE. Psychosocial factors associated with physician utilization for sleep problems. Unpublished
doctoral dissertation. Honolulu: University of Hawaii, 1984.
12. Keys A, FidanzaF, Karvonen MJ, Kimura N, Taylor HL. Indices of relative weight and obesity. 1 Chron
Dis 1972;25:322-343.
13. SPSS Inc. SPSSX user's guide (ed 2). Chicago, IL: McGraw-Hill, 1986.
14. Bornstein SK. Respiratory monitoring during sleep: polysomnography. In: Guilleminault C, ed. Sleeping
and waking disorders: indications and techniques. Menlo Park, CA: Addison-Wesley, 1982:183-212.
15. Kramer M, Silva C. Night to night variability of apnea. Sleep Res 1986;15:138.
16. Dickel MJ, Sassin J, Mosko S. Sleep disorders in an aged population: preliminary findings of a longitudinal study. Sleep Res 1986;15: 116.
17. Hoch CC, Reynolds CF, Stack JA, et al. Sleep apnea and psychopathology in geriatric patients. Sleep
Res 1986;15: 130.
Sleep, Vol. JI, No.5, 1988