Training Slow Speaking Rate to Treat Motor Speech Disorders

Training Slow Speaking Rate to
Treat Motor Speech Disorders
Authors: Laura Clinton
Greg Turner, Ph.D., CCC-SLP
Robert de Jonge, Ph.D., CCC-A
INTRODUCTION
Motor speech disorders (dysarthria and apraxia of speech)
result from neurological impairment in the areas of the brain
responsible for motor planning, motor programming,
neuromuscular control, and execution of speech; and they
account for 41% of all acquired communication disorders
(Yorkston, Beukelman, Strand, & Bell, 1999; Duffy, 2005).
Dysarthria can cause a reduction in speaking rate, abnormal
prosodic features, and significant intelligibility deficits
resulting in a reduced ability to communicate successfully
(Ansel & Kent, 1992; Kent, 1992; Duffy, 2005).
Apraxia of speech is commonly characterized by groping,
inconsistent articulation errors, especially in longer utterances,
and increased errors in repetition based tasks; therefore, the
apraxic speaker exhibits problems of speech initiation, speech
sequencing, speech selection, reduced rate of speech,
disturbances in articulation and rhythm, abnormal prosody,
and decreased intelligibility (Dworkin, 1991; Chapey, 1994;
McNeil, 1997; Yorkston, et al., 1999; Wambaugh & Nessler,
2004).
Treatment of motor speech disorders may consist of
behavioral, medical, and assistive treatment techniques.
(Duffy, 2005).
One behavioral technique to improve communication is
slowing speaking rate.
Duffy (2005) states, “Rate may be the most powerful single,
behaviorally modifiable variable for improving intelligibility”
(p. 479).
Purpose
Slowing speaking rate has been found to be an effective
therapy technique for both dysarthria and apraxia of speech
(Simmons, 1978; Yorkston and Beukelman, 1981; Yorkston et
al., 1999). One of the outcomes of slowing speaking rate is an
improvement in speech intelligibility (Southwood, 1987;
Wambaugh and Martinez, 2000); however, few studies have
documented the effectiveness of methods for the control of
speaking rate in individuals with motor speech disorders. The
purpose of this study is to evaluate a training program aimed
at slowing speaking rate for read sentences in an individual
with both dysarthria and apraxia of speech through the use of
visual, auditory, and verbal feedback.
METHODS
Participant
The participant in this investigation was a forty-nine-year-old
male who has developmental speech and language deficits (see
Table 1). He reported that though he had the communication
disorder since childhood, he received no speech remediation
services during the years of his formal education.
The participant was diagnosed with mild-moderate apraxia of
speech and dysarthria. The participant had not received
speech therapy for two years prior to this study.
Table 1: Assessment Results
Measure
Result
Apraxia Battery for Adults-2nd ed.
mild & moderate
(majority of ratings)
Ross Information Processing Assessment-2nd ed.
severe rating
immediate memory
recall of general information
spatial orientation
marked rating
recent memory
temporal orientation (remote memory)
moderate rating
temporal orientation (recent memory)
organization
problem solving and abstract reasoning
mild rating
auditory processing and retention
orientation to environment
Frenchay Dysarthric Assessment
most impaired areas
jaw, tongue, intelligibility
Western Aphasia Battery
Aphasia Quotient (100 possible)
86.6
language deficits
writing, reading, word fluency
Phonetic Intelligibility Test
word intelligibility score
69%
Experimental Design
The experimental design used in this study consisted of a
single-subject ABAB research design. The first A phase is the
baseline phase and was continued until speaking rate stability
was achieved. Stability was defined as average sentence
duration of the 18 stimulus sentences falling within +/-5% of
each other over three consecutive baseline sessions. The first
B phase consists of the treatment phase where the participant
was trained to reduce speaking rate. The second A phase
represents the withdrawal phase. During this phase the
participant did not receive treatment. Reinstatement of
treatment represents the second B phase and was completed in
the same way as the first treatment phase. Sentence duration
and speech intelligibility was acquired on a weekly basis
during the last three phases and every other day during the
first phase.
Choosing Target Speaking Rate
The speaking rate that maximized the participant’s
intelligibility was used as the target rate for this study. To find
this rate, the participant produced the 18 stimulus sentences
(each seven syllables in length) at his habitual rate and at 90%,
80%, 70%, 60%, and 50% of the habitual rate.
The participant was presented with a prerecorded auditory model of
each of the 18 sentences at a targeted sentence duration associated
with the targeted speaking rate (e.g., 90%). The intensity
waveform of each model was presented both auditorily and visually
to the participant through the use of the Real-Time Pitch program
within the Sona-Speech II, model 3650 program (KayPentax,
2006). In addition, the participant was trained to use the visual
feedback of his own sentence productions from the Real-Time
Pitch program to match the model production.
The investigator put two vertical lines on both the top and
bottom panels of the Real-Time Pitch screen indicating the
target duration range (i.e. mean plus and minus five percent
plus reaction time). The participant was asked to produce the
different target rates by slowing articulation time and not
increasing pause time. The participant produced each sentence
until one production fell within the target duration range. This
resulted in a total of 108 sentence productions (18 sentences X
6 speaking rates).
Three listeners listened to a digitized recording of each of the
18 sentences for a given speaking rate and orthographically
transcribed each sentence. The speaking rate judged as
having the highest intelligibility rating was 60% of the
participant’s habitual speaking rate.
Treatment
The treatment phase of this study focused strictly on slowing
rate of the nine training sentences to a target durational range
of 4.39-4.85 seconds. The target sentence duration range
represents +/- 5% of the average duration of the 60% target
speaking rate. This range was chosen based on a study by
Hyland and Weismer (1988).
The participant’s target duration range was changed from +/5% to +/- 7.5% of the participant’s habitual speaking rate
during week six of the first treatment phase. This was done
based on the observation of the experimenter that the
participant was consistently short of the smaller target range
and was unable to make progress.
For each sentence produced by the participant the
investigator first displayed and played one of her
prerecorded sentences in the upper panel of the computer
screen and noted how the end of the time history of her
sentence ended between the vertical lines indicating target
duration range. The participant was directed to the lower
screen containing the same set of vertical lines. When the
participant was prepared, he was asked, “Are you ready”
by the instructor, he replied “Yes” and the instructor then
said, “Begin” and began recording at that time while the
participant then produced the same sentence. The sentence
was recorded and displayed as an intensity time history
trace in the lower panel of the computer screen. This was
repeated three times for each sentence regardless of
accuracy.
The participant received verbal, auditory, and visual feedback
after his production. Verbal feedback consisted of knowledge
of performance and knowledge of results.
Treatment sessions consisted of distributed practice and
random practice of the training sentences within and across
treatment sessions, with the focus on slowing rate by
increasing articulation time and not by inserting pauses.
Treatment continued until the participant reached the target
speaking rate of 60% of his habitual rate with 80% accuracy
over three consecutive treatment sessions. After reaching the
criterion, the withdrawal phase began.
The Second B Phase was conducted in the same way as the
First B Phase. The last three phases of the study continued for
seven weeks.
One time each week, throughout the treatment phase,
withdrawal of treatment phases, and reinstatement of treatment
phase, both the trained and untrained sentences were recorded
in a single walled sound booth. One time a week, on random
days, the participant was tape recorded using the Tascam DAT
tape recorder and Crown CM311A head mounted microphone
while spontaneously producing the 9 trained sentences and the
9 untrained sentences to evaluate generalization.
Each tape recorded sentence was digitized at 44.1 kHz via the
TF32 program (Milenkovic, 2002) and displayed on the screen
in the form of a sound pressure waveform. Cursors were
placed at the beginning and end of the sentence for
measurement of sentence duration.
In order to collect weekly intelligibility measures, the digitized
sentences were normalized to -3 dB and presented to three
listeners through a customized software program developed in
Revolution 2.8.1 (www.runrev.com). The PC-based program
used Quick Time to play .wav files through the computer’s
sound card. The listeners had the opportunity to listen to each
sentence twice (Yorkston & Beukelman, 1981). Each listener
listened to the sentences through loudspeakers at a self-chosen
most comfortable listening level. The listeners
orthographically transcribed the sentences on a keyboard. The
investigator calculated the percent correct (total words
correctly identified) for each of the three listener’s
transcription for the trained and untrained sentences and
calculated an average percent correct across the three listeners.
The intelligibility data will not be presented in this poster.
RESULTS
Accuracy Results Within the Treatment Program
Accuracy data was collected during the treatment phases (Refer
to Figure 1).
The first treatment phase was terminated after the participant
achieved 80% accuracy of the target duration/speaking rate (60%
of his habitual) for three consecutive sessions. It took seven
weeks to meet exit criterion during the first treatment phase.
The withdrawal phase and second treatment phase also
continued for seven weeks in order to equalize the effects of
extraneous variables (McReynolds and Kearns, 1983).
During the second treatment phase the participant achieved
criterion by the 4th session and maintained an average of 93%
accuracy after criterion was achieved.
Duration Results of Trained Sentences
Results indicate the participant reached a stable average
sentence duration/speaking rate of 2844 ms during baseline
measurements (final 3 data collection points) of the trained
sentences (Refer to Figure 2).
Sentence duration was variable during the first treatment
phase for trained sentences. It ranged from 2698-3967 ms
The participant was not successful in slowing sentence
production to his goal of 60% of his habitual speaking rate.
The participant, however, was able to slow speaking rate to a
maximum of 70% of his habitual speaking rate.
Results indicate the participant was able to slow his habitual
speaking rate (increase sentence duration) in 6 out of seven
sessions during the first treatment phase.
Withdrawal data indicates the participant produced a stable
sentence duration ranging from 2962-3117 ms compared to
range of 2698-3967 ms for the first treatment phase,
indicating an increase in speaking rate.
The data collected during the second treatment phase was
also variable. The client was able to alter speaking rate;
however, he was not successful in achieving his goal of 60%
of habitual speaking rate. Average sentence duration in the
second treatment phase ranged from 2967-3975 ms
indicating an increase in sentence duration relative to the
withdrawal phase. Results indicate the participant was able
to slow his speaking rate relative to his average habitual
speaking rate for the baseline phase in all 7 of the data
collection sessions during the second treatment phase.
Duration Results of Untrained Sentences
Duration results of the untrained sentences were similar to the
results of the trained sentences (Refer to Figure 3).
The participant’s average habitual speaking rate for the untrained
sentences was determined to be 2620 ms during baseline
measurements.
The data collected during the first treatment phase indicated that
the participant was able to slow his speech, but unable to reach a
slowed rate of 60% of his habitual speaking rate. Sentence
duration in the first treatment phase was variable and ranged from
2754-3829 ms. The participant was only able to reach a maximum
slowed speaking of 70% of his habitual speaking rate relative to
average habitual speaking rate for the baseline phase. The
participant slowed his speaking rate (increased sentence duration)
in all 7 data collection sessions in the first treatment phase.
Withdrawal data indicates the participant produced a stable
sentence duration range of 2908-3136 ms compared to range
of 2754-3829 ms for the first treatment phase, indicating an
increase in speaking rate.
The data collected during the second treatment phase was also
variable. The client was able to alter speaking rate; however,
he was not successful in achieving his goal of 60% of habitual
speaking rate. Average sentence duration in the second
treatment phase ranged from 2698-3843 ms, indicating an
increase in sentence duration relative to the withdrawal phase.
The participant was able to slow his speaking rate relative to
average habitual speaking rate for the baseline phase, and
increase sentence duration in all 7 of the data collection
sessions in the second treatment phase.
CONCLUSIONS
The participant was able to alter his speaking rate and produce
the trained sentences at 60% of his habitual speaking rate with
80% accuracy during therapy, but was unable to transfer the
new speaking rate to data collection outside of the therapy
room. That is, the participant’s performance during data
collection did not reflect performance during treatment.
However, on average the participant did slow his habitual
speaking rate for both treatment phases relative to the baseline
and withdrawal phase.
Speakers with AOS do not consistently adjust speaking rate
for sentences when instructed to speak at given rates (Robin,
Bean and Folkins, 1989). This was true for the present
speaker and was the reason for expanding the target range.
Perhaps the client would have produced a slower speaking rate
if he was told specifically during data collection to reduce his
speech to exactly what he had practiced during the treatment
session. This was attempted after the treatment program had
concluded and verbal instruction by itself was found to be
unsuccessful.
Another possibility for lack of generalization of the new
speaking rate outside of he therapy room may be due to the
participant’s deficits in the area of memory as indicated in the
results of the Ross Information Processing Assessment (See
Table 1).
The participant’s ability to generalize was observed to be
dependent on the time of data collection within a session (i.e.,
beginning or end of session). If data was collected at the
beginning of the therapy session, speaking rate was faster than
if it was collected at the end of the therapy session. These
findings support the influence of the participant’s memory
deficits on his ability to generalize.
A final reason for the lack of generalization may be due to the
fact that the generalization task involved removal of all cues
(i.e., visual, auditory and verbal). Articulatory methods for
treatment of AOS focus on gradual removal of cueing
(Yorkston et al. 1999). A more programmatic decrease in
cueing may have lead to better generalization of rate.
Future Research
Replication of this study should be completed with different
participants exhibiting motor speech disorders.
The evaluation of the effectiveness of different feedback
modes (i.e., visual, verbal, and auditory feedback) on
acquisition and generalization of a slowed speaking rate
should be documented.
Investigation of the interaction between different modes of
feedback in the training of a slowed speaking rate along with
participant characteristics should be addressed.
Selected References:
Ansel, B. M., & Kent, R. D. (1992). Acoustic-Phonetic Contrasts and
intelligibility in the dysarthria associated with mixed cerebral palsy.
Journal of Speech and Hearing Research, 35, 296-308.
Chapey, R., Ed. (1994). Language intervention strategies in adult aphasia
(3rd ed.). Brooklyn, NY: Williams & Wilkins.
Dworkin, J. P. (1991). Motor speech disorders: a treatment guide. St.
Louis, MO: Mosby.
Duffy, J. R. (1995). Motor speech disorders: substrates, differential
diagnosis, and management. St. Louis, MO: Elsevier.
Kent, R. D. (1992). Intelligibility in speech disorders: Theory,
measurement, and management. Philadelphia, PA: John
Benjamins Publishing.
McNeil, M. R. (1997) Clinical management of sensorimotor
speech disorders. New York, NY: Theime.
Simmons, N. N. (1978). Finger counting as an intersystemic
reorganizer in apraxia of speech. Clinical Aphasiology
Conference Proceedings,174-179.
Southwood, H. (1987). The use of prolonged speech in the
treatment of apaxia of speech. Clinical Aphasiology, 15, 277-287.
Wambaugh, J. L. & Martinez, A. L. (2000). Effects of rate and rhythm
control treatment on consonant production accuracy in apraxia of speech.
Aphasiology, 14,851-871.
Wambaugh, J. L. & Nessler, C. (2004). Modification of sound
production treatment for apraxia of speech: acquisition and
generalization effects. Aphasiology, 18,407-427.
Yorkston, K. M., & Beukelman, D. R. (1981b). Ataxic dysarthria:
Treatment sequences based on intelligibility and prosodic considerations.
Journal of Speech and Hearing Disorders, 46, 398-404.
Yorkston, K. M., Beukelman, D. R., Strand, E. A., & Bell, K. R. (1999).
Management of motor speech disorders in children and adults (2nd ed).
Austin, TX: ProEd.