Touch-Triggered Withdrawal Reflexes for Safer Robots

Touch-Triggered Withdrawal Reflexes for Safer Robots
Torbjørn S. Dahl, Erick A. R. Swere and Andrew Palmer
torbjorn.dahl|Erick.swere|[email protected]
Cognitive Robotics Research Centre
University of Wales, Newport
Abstract
Covering robots with a skin-like sensor enables a range of skills that will
improve human-robot interaction (HRI) both in terms of safety and
communication. In this chapter we review the current state of robot safety
and current understanding of human reflexes, in particular the human
nociceptive withdrawal reflex (NWR), i.e., the reflex that causes withdrawal
from noxious, including painful, stimuli. We present and analyse the results
of an experiment capturing human NWR motion data and also present a
model developed to use the results to implement a similar reflex on the Nao
humanoid robot. Finally we present the idea of protective reflexes as robot
middleware where we define middleware as a set of generic skills
independent of application area.
1. Withdrawal Reflexes and Robot Safety
The he work presented here is part of a wider research context.
The
ROBOSKIN project is funded by the European Commission’s Framework
Programme 7 (FP7) and its activities are centred on new sensor technologies
that can cover large areas of a robot’s body with a touch sensitive material,
i.e., robot skin. The activities of the ROBOSKIN project span a range of
levels from sensor hardware to application specific skills. The project
studies hardware for sensing not only the force and location of touch, but
also the direction of the force and other non-touch phenomena such as heat
and proximity.
The ROBOSKIN project is also developing a level of
problem domain and robot platform independent middleware such as
protective reflexes and touch recognition and production skills. Finally, the
ROBOSKIN project studies the application of robot skin technology in
several domains, including programming by demonstration (Calinon et al.,
2007) and HRI for therapeutic play (Dautenhahn et al., 2009). This chapter
focuses in particular on the protective reflexes activities of the ROBOSKIN
project.
Touch-triggered withdrawal reflexes are a technology that can contribute to
safer human, robot interaction (HRI). Safety is an area that already has a
long track record in HRI research. In this Section, we review existing work
on robot safety and place touch-triggered withdrawal reflexes within this
larger context.
1.1 The Principle of Separation
Traditionally, in industrial applications, safe HRI is achieved by isolating
the robot from the human (Corke, 1999) (ANSI, 1999) (Gaskill and Went,
1996), in effect prohibiting interaction. As robots move out of isolated
industrial environments and into interactive environments, this approach is
no longer tenable (Corke, 1999). Despite the fact that several manufacturers
of service robots have released their technology onto the commercial
market, we do not yet have worldwide guidelines or regulations concerning
the safety aspects of such robots (Kabe et al., 2009).
Recently, an
industrial assistant robot system designed for close human-robot-interaction
was launching into the market (Oberer et al., 2006). The new international
standard for robot systems ISO 10218-1 (ISO, 2006) covers this situation
and provides regulations for HRI. In this case, the humans in the active
workspace of a robot can be protected by the internal safety control, while
the area surrounding both the robot cells and the operator should be guarded
against entry of unauthorized personnel.
Industrial assistant robots are
typically both heavy and fast requiring comprehensive safety control
systems usually involving high costs. Service robots are typically smaller
than industrial robots and thus have lower kinetic energy. They also have
higher levels of mobility and autonomy than assistant robots in industry.
They are not restricted to static mass production scenarios, but are employed
in a wide range of roles in industrial and home applications. The successful
commercialization of service robots requires a low pricing strategy that
cannot support the comprehensive safety control systems employed by
industry.
1.2 Sensitisation
Consider the example of a compliant robot arm colliding with an object. In
this case, the compliance provides some measure of safety within a range,
but when the range of compliance is exhausted, the force of the impact will
again increase and along with the probability of damage or injury.
Compliance provides a window of time within which it is necessary to sense
the impact and stop or even reverse the underlying motion in order to avoid
the potential damage. Haptic sensors, e.g., motor stress or strain gauges are
commonly applied for this purpose. An early warning system would only
be effective, however, if coupled with correspondingly quick and beneficial
reflex motions. This is achieved in humans and animals through dedicated
neural pathways called reflex arcs, and evolved reflex motions. In Section 2
we review what is known about these biological mechanisms. In Section 3
we discuss the capture of human reflex motions and present an experiment
designed for this purpose. In Section 4 we present initial results from these
experiments.
In Section 5 we discuss technologies and previous work
related to protective robot reflexes and in Section 6 we discuss the potential
roles of generic protective reflexes in a wider robot control architecture and
also mechanisms for the integration of generic reflexes and domain specific
robot behaviours.
2. Human Nociceptive Withdrawal Reflexes
A reflex is an almost instantaneous movement that the body performs in
response to a stimulus without having to think about it (Zehr and Stein,
1999). For example, a human being does not decide to extend the leg when
the tendon below the knee is hit. Instead it happens automatically. Healthy
human beings posses many types of reflexes and many of these reflexes are
present from birth. Reflexes allow an animal to quickly adjust its behaviour
to sudden environmental changes so as to protect itself. Analyses of the
nervous system show that reflexes are essentially direct connections
between the sensory neurons, recognising the sensory data, and the motor
neurons responsible for the response.
2.1 The Anatomy of a Reflex
The simplest form of animal behaviour is the reflexive behaviour in which
some fast, stereotyped responses are triggered by a particular class of
environmental stimuli (Beer et al., 1990) (Ghez, 1990). A reflexive response
occurs, for example, when one steps on a sharp object or when one touches
a hot surface. Reflexes allow an animal to quickly adjust its behaviour to
sudden environmental changes so as to protect itself. Reflex actions are
realised by the reflex arc (Stűttgen, 1999). The reflex arc is the simplest and
most primitive nerve pathway in the human body (Kandel et al., 1995). A
single connection can contain as little as a single synapse. All reflex arcs
have a minimum of five functional elements:
1. The receptor; this host the dendrite of the sensory neuron and reacts to a
stimulus.
2. The sensory neuron; this conducts the afferent impulses to the central
nervous system (CNS).
3. The integration centre; this consists of one or more synapses in the CNS.
4. The motor neuron; this conducts the efferent impulses from the
integration centre to an effector.
5. The effector; this is muscle fibres or glands, respond to the efferent
impulses by contraction or secretion a product, respectively.
Reflexes protect the body from things that can harm it, for example, if the
human hand is about to touch a hot object, as soon as the body sense the
heat from that hot object, a reflex will cause the human to immediately
remove the hand before the warning message of the danger from that hot
object gets to the human brain. Reflexes exist in different complexity stages,
native, learned or trained. The automatic, stereotyped reaction to
disturbances is a basis to survive in a mainly consistent environment. A
biological reflex action can be divided into three fundamental phases:
1. The reflex is triggered by an adequate stimulus that is recognized by a
receptor cell.
2. The generated neural activation is passed from the sensory nerves to the
respective processing centre in the spinal cord.
3. In the spinal cord the signal is transmitted through excitatory or
inhibitory synapses to the motor nerves, which are connected to the
descending nerves of higher control structures.
2.2 Types of Reflexes
Reflexes are classified as either autonomic or somatic. Autonomic reflexes
are not subject to conscious control. These reflexes activate smooth, i.e.,
involuntary, muscles, including the heart. These reflexes also activate the
glands of the body and regulate body functions such as digestion and blood
pressure. Somatic reflexes include all reflexes that stimulate skeletal, i.e.,
voluntary, muscles.
These reflexes can be modulated by conscious
processes.
Autonomic Reflexes Autonomic or visceral reflexes, refer to reflexes such
as those that effect gland secretion or cause the smooth muscle of the
cardiac system to contract. The main purpose of the visceral reflexes is to
ensure that the involuntary process of the body are in full operating
condition and can be tailored to react to a given situation. Heart rate,
respiratory rate, blood flow, and digestion are examples of the types of
visceral reflexes that require constant monitoring by the internal body
systems. Reflexes like coughing, swallowing, sneezing or vomiting are also
considered to be autonomic. However, these reflexes require at least some
participation of the skeletal muscles. The autonomic reflexes relate mainly
to parts of the human physiology that are typically not replicated on
humanoid robots such as the heart and the lungs. In general, they relate to
basic operation, keeping energy flowing through the body. As such these
reflexes are not currently relevant for implementation on robots, but one can
imagine that similar reflexes could be implemented on robots to, e.g., reduce
unnecessary activity when battery levels are low or to facilitate recharging.
The Somatic Reflexes: The somatic, or voluntary, reflexes enable humans
to react to environmental changes. They involve 31 pairs of spinal nerves,
i.e., nerves running through the spinal cord, and 12 pairs of cranial nerves,
i.e. nerves emerging directly from the brain without going through the spine.
This system controls movements of skeletal muscles. Somatic reflexes
include the flexor, stretch, crossed extensor, superficial cord, corneal, and
gag reflexes. The first four reflexes are controlled by spinal nerves while
the corneal, i.e. related to the transparent part of the eye, and the gag
reflexes are controlled by cranial nerves.
Spinal Reflexes:
Reflexes mediated by spinal nerves include stretch,
flexor, crossed extensor and superficial cord reflexes: The flexor, or
withdrawal, reflex is reviewed in detail in Section 2.3. The remaining
reflexes are reviewed briefly here.
Stretch reflexes are important postural reflexes that act to maintain posture,
balance, and locomotion. Stretch reflexes are produced by tapping a tendon,
which stretches the attached muscle. This stimulates muscle spindles, i.e.
specialized sensory receptors in the muscle, and causes reflex contraction of
the stretched muscle also resisting further stretching. Stretch reflexes are
segregated into different types including the patellar reflex, the Achilles
reflex and the biceps reflex. The patellar or knee-jerk reflex occurs when
the tendon just below the kneecap is struck. The reflex is processed at spinal
cord and does not involve the brain.
This reflex is known to be a
monosynaptic reflex, i.e., only one synapse needs to be crossed to complete
the circuit which triggers the reflex. The Achilles reflex or ankle-jerk reflex
is produced when the Achilles tendon at the back of the ankle is firmly
tapped. This reflex is also a monosynaptic. The biceps-jerk reflex is a
polysynaptic reflex, i.e., its arc contains more than one synapse. The biceps
reflex causes sudden contraction of a biceps and brings the forearm up
sharply when the tendon of the biceps muscle is firmly tapped. The origin
of the stretch reflexes is balance and posture maintenance. As such they
could provide a way of improving balance and posture on humanoid robots
and their potential benefits should be explored further.
The crossed extensor reflex is activated when numerous spinal cord
segments are activated via impulses coming from the receptor organs, and
the release of multiple motor neurons is required, this is known as an intersegmental reflex arc. This means that multiple effector organs are initiated
and multiple receptor organs sent signals. The required sensory impulses
from a receptor organ cross the spinal cord in order to stimulate an effector
organ on the opposite side of the body. This is known as a contra-lateral
action. This reflex, like the stretch reflexes, help maintain balance and
posture.
The superficial cord reflexes, or abdominal and plantar reflexes, are initiated
by the stimulation of receptors in the skin and mucous membranes. The
superficial cord reflexes depend both on brain participation and on the cordlevel reflex arc. The abdominal reflex contracts the muscles in the abdomen
on cutaneous stimulation, i.e., stimulation of the skin. The purpose of this
reflex is to protect the abdomen.
The plantar reflex is elicited by
stimulating the cutaneous receptors in the sole of the foot. In adults,
stimulation of these receptors causes the toes to flex and move closer
together. This is effectively a withdrawal reflex working to protect the foot,
but unlike the nociceptive withdrawal reflexes described below, the plantar
reflex is activated without noxious, e.g., painful, stimuli. The superficial
chord reflexes in general protect particularly vulnerable areas of the human
body.
This principle of lowering the reflex activation threshold for
particularly vulnerable areas could potentially be transferred to robots. The
principle has already been employed in developmental robotics, where the
palmar reflex, i.e., the reflex that closes a baby’s hand when something
touches its palm, has already been used as inspiration for the
implementation of an anthropomorphic finger with multi-point tactile
sensation (Banks, 2001).
Cranial reflexes: Two reflexes mediated by cranial nerves are the corneal
reflex and the gag reflex. The corneal reflex, or blink reflex, is mediated
through the trigeminal nerve. This reflex is activated by touching the cornea
and causes the eye to move. A similar reflex in robots could be beneficial to
protect cameras, however it would require sensitisation of the camera area
and a related protective mechanism, i.e., a robotic eyelid. The gag reflex is
activated by touching the back of the throat. As previously noted, the
absence of air and digestive channels in robots makes this reflex irrelevant
for robots at the moment.
2.3 The Nociceptive Withdrawal Reflex
A withdrawal reflex, also known as a flexor reflex, is a polysynaptic reflex
arc (Sherrington, 1910). Sensory and motor neurons are part of this reflex,
as are association neurons. Pain is the most obvious stimuli that initiates the
flexor reflex, as painful stimuli sets the whole reflex into action. The
receptor responds and sends the information via sensory neurons. It reaches
the spinal cord, and there, the association neurons take over the situation.
Motor neurons receive the information and immediately the muscles
involved contract to produce a withdrawing snap action. Additionally and
simultaneously, the antagonistic muscles relax, otherwise they may inhibit
the body's ability to snap the muscles into withdraw. It is not uncommon for
the body to initiate additional reflexes along with the flexor reflex. The
nociceptive flexion reflex (NFR) was first described in detail at the
beginning of the 20th Century by Sherrington (Sherrington, 1910) as a
general excitatory response of flexor muscles and inhibitory response of
extensor muscles to stimulation of the cutaneous afferents and deep nerves.
Nociception is the sense of noxious, potentially tissue damaging, stimuli
including pain. Sherrington also define reflex receptive field (RRF) as the
assemblage of receptive points that might evoke a particular reflex
movement when suitably stimulated. A receptive field is the region of the
sensor where an adequate stimulus elicits a response (Churchland and
Sejnowski, 1996). The first study of human NFR was carried out in 1960s
and found indications of a more complex organization of the human
withdrawal reflex (Hagbarth, 1960) (Kugelberg et al., 1960). The study,
carried out by Grimby (1963), observed ankle dorsiflexion as a result of
stimulation of the medial, distal sole of the foot, ankle extension after
stimulation of the plantar surface of the heel, inversion following
stimulation of the medial side of the foot sole, and eversion after stimulation
of the lateral side of the foot sole. These findings reveal a complex response
tending to withdraw the stimulated area from the noxious source.
In
humans, a reflex withdrawal reaction can be elicited by transcutaneous
electrical
stimulation
of
a
sensory
peripheral
nerve
and
the
electromyographic response recorded from the flexor and extensor muscles.
This nociceptive withdrawal reflex (NWR) is a polysynaptic spinal
nociceptive reflex and represents the mechanism for withdrawing an
extremity from injury (Sherrington, 1910).
The NWR is reproducible,
stimulus-dependent and is closely correlated with the intensity of subjective
pain perception (Willer, 1977) (Willer, 1984) (Chan and Dallaire, 1989).
Therefore the NWR and its modulation have been widely used in
experimental (deBroucker et al., 1989) (Arendt-Nielsen et al., 2000)
(Andersen, 2007) and pharmacologic studies (Petersen-Felix et al., 1998)
(Piguet et al., 1998) (Escher et al., 2007) as a non-invasive
neurophysiologic tool to objectively assess spinal nociceptive processing.
The repetition of reflex stimulations applied at the same spot of a subject
can result in a gradual decrease in the NWR amplitude. This spinal
phenomenon is termed habituation. Habituation is intensity and frequency
dependent, occurring more frequently at low intensities and at high
stimulation frequencies (Shahani & Young, 1971) (Dimitrijević et al.,
1972). Different studies have found that the background activities can also
affect the response of withdrawal reflexes. In stationary conditions, a
modulation of the nociceptive reflex in tibialis anterior, a muscle in the
lower leg, depends on which limb is supporting the body while standing
(Rossi & Decchi, 1994). Reflexes were seen to be increasing when standing
on the contralateral leg, and were also seen decreasing when supporting on
the ipsilateral leg compared with standing on both legs. The study of
comparing tibialis anterior responses during running and walking was
carried out by Duysens et al. (1991) and found that the cutaneously evoked
reflex sizes, using non-nociceptive intensities, are generally large during
running when the background activity is also largest.
Another study
(Duysens et al., 1993) also observed larger reflexes were during running
than during standing. Finally, research on infant humans and monkeys has
shown that the ability to inhibit reflexes increases with age, allowing
increased cortical control as a person matures (Sherrington, 1910).
2.4 Reflex Receptive Fields
Research on the NWR has shown that a reflex has a related reflex receptive
field (RRF) and that each RRF controls only a single or small group of
muscles (Andersen, 2007). The effectiveness of an RRF decays as the
stimuli moves from the field’s centre towards its edge. Further research on
RRFs has shown that such fields are also clearly observable in humans
(Andersen et al., 1999). Two RRFs on the sole of the human foot are
illustrated in Figure 1.
Figure 1: The reflex receptive fields (RRFs) on the sole of the foot for the
Tibialis anterior (TA) and Peroneus longus (PL) muscles in the lower leg
The two RRFs and their related muscles are illustrated in Figure 3 control
two muscles in the leg, the tibialis anterior and the peroneus longus. The
locations of these two muscles in the human leg are illustrated in Figure 1.
3. Capturing the Human Nociceptive Withdrawal Reflex
Modelling human reflexes is a two-part process. First the reflexes must be
produced, i.e., provoked. Second, they must be captured. In the work
presented here, the first point produced unexpected difficulties, as described
below.
3.1 Provoking the NWR
A range of methods have been used to elicit withdrawal reflexes in humans.
These include the application of pin-pricks or heat (Willer, 1984), reaching
with a transparent barrier (Sherrington, 1910), direct electronic stimulation
of the sensory nerve (Arendt-Nielsen et al., 1990) and electronic stimulation
of the sole of the foot (Willer, 1985). To capture the reflexes, a goniometer
or angle-meter, is commonly used (Andersen et al., 2004), measuring the
reflex for a single joint. Our work will use the Gypsy-6 mechanical fullbody motion capture suit from Animazoo Ltd. to capture protective reflex
responses. The Gypsy-6 suit is shown in Figure 5.
Figure 2: The Gypsy-6 mechanial motion capture from Animazoo Ltd. used
in our withdrawal reflex experiments
From the literature there were strong indications that the NWR habituates
rapidly.
We therefore arrange a schedule where our volunteers were
subjected to only three stimuli and where the order of the stimuli would
change.
This setup allowed us to test statistically for any habituation
effects.
3.2 The Withdrawal Reflex Capture Experiments
In order to get a good quality model of a withdrawal reflex and avoid any
possible anomalies in a single person, base it on reflexes from a range of
subjects in. To ensure consistent reflexes we formalised an experimental
setup where five different points on the subjects arm were stimulated during
different but similar reaching actions. The formalisation included the initial
position of the subject’s arm, the position a target to be reached for relative
to the subject, and an approach angle and stimulation point for a stimulation
tool consisting of a rod with a tip capable of delivering a mild (<200V)
shock to the subject. The procedure also involves dampening the skin on
the subject’s arm with a sponge. During early trials, a significant number of
subjects did not produce any significant responses to the stimuli presented.
These subjects reported that the stimuli were so weak that they were
difficult to detect. We discovered that different skin moisture levels had a
critical effect on the reflexes motions produced.
A procedure for each trial was also formalised which made it impossible for
the subject to predict the contact point by blindfolding the subjects and
covering their ears.
To enable the subjects to reach the target while
blindfolded, they were first allowed three attempts at reaching without the
blindfold. The blindfold was then introduced and the subjects were asked to
attempt to reach the target between two or three times, decided at random,
before the stimulus was introduced.
4
Results
In order to describe the reflex motion, it was necessary to isolate it from the
interrupted reaching motion. These isolated reflex motions could then be
aggregated and compared to give provide a good understanding of typical
reflex motions.
4.1 Identifying the Reflex Motion
Shoulder Rotation
40
Z_roll
X_pitch
Y_yaw
-40
-60
19.99
19.57
19.16
18.74
18.32
17.91
17.49
17.07
-20
16.66
0
16.24
Degree
20
Time (s)
Figure 3 presents a plot of the rotations of each of the three degrees of
freedom (DOFs) in a subject’s shoulder joint over the time of the final
reaching movement when stimulation took place.
Shoulder Rotation
40
Z_roll
19.99
19.57
X_pitch
Y_yaw
-40
-60
19.16
18.74
18.32
17.91
17.49
17.07
-20
16.66
0
16.24
Degree
20
Time (s)
Figure 3: The rotations of the 3 DOFs in a subject's shoulder joint during a
trial when stimulation took place. The rapidly changing values between
18.32 and 19.16 seconds reflect the relatively high speed of the reflex
motion compared to the reaching motion.
The withdrawal reflex can be easily identified from the plots in
Figure 3 and Error! Reference source not found.Figure 3 as the reflex
motion is much quicker and produces a more rapid change in the rotation
values than the interrupted reaching motion. It is also clear that the start and
end time of the reflex are consistent over the different DOFs. This allowed
us to extract the duration of the reflex motion and the corresponding
changes in rotation and translation values.
We performed the experiment described above on a group of 10 volunteers,
all males aged between 18 and 25 years of age. This gave us six samples for
each of the five stimulation points. The raw data describing the rotation of
each joint for the first stimulation point, the upper arm, bicep are given in
Table 1. As an exception, we got seven data points for this trial. The first
column gives a unique number of the subject columns two to six give the
rotation angle for each DOF of each joint, three for the shoulder and two for
the elbow. The last column gives the duration of the reflex motion.
Subject
Joint Rotation (degrees)
θ0x
θ0y
θ0z
θ1x
θ1y
Duration
(seconds)
1
0.92 0.23 0.16 39.80 33.99 0.19
2
0.03 0.53 0.58 35.25 34.37 0.40
3
0.06 0.44 0.02 13.94 15.64 0.19
4
1.07 0.08 0.53 4.83
0.31
0.67
5
0.31 0.05 0.01 8.50
7.74
0.34
6
0.03 0.07 0.04 3.71
1.43
0.91
7
0.03 0.07 0.01 14.23 2.53
0.27
x
0.35 0.21 0.19 17.18 13.72 0.42
s
0.45 0.20 0.25 14.53 14.90 0.27
Table 1: The joint rotations for the shoulder and elbow joints for each
subject for stimulation point 1, the bicep, with mean and standard deviation.
Data equivalent to that presented in Table 1 was also produced for the other
four stimulation points.
There was large variance in the data and
subjectively we have classified the responses into three classes: ‘noresponse’, ‘very small response’ and ‘normal response’. For modelling
purposes we plan to use only the data from the responses classified as
‘normal response’.
5
A Robotic Withdrawal Reflex
The concepts of robotic skin and robotic reflexes have both been explored
previously by multiple research groups. In this Section we review that work
and explain how our research differ from that work.
5.1 Robot Skin
Our work will provide study protective reflexes. These can be triggered by
both thermal receptors and mechanoreceptors. However, the first generation
robot skin prototype will only have pressure sensory capabilities.
Historically a wide range of sensory technologies has been studied in the
context of robot tactile sensing including microelectromechanical systems
(Gouaillier & Blazevic, 2006), force-sensitive resistors (FSRs) (Liu et al.,
1995) and a quantum tunnelling compound (Sandini et al., 2006). Dahiya et
al. (2010) have published a comprehensive review of human sensory
capabilities and potential technologies for robot skin. It is an explicit
objective for the ROBOSKIN project to develop robot skin technology
further, including an integration of multiple sensory modalities into a single
sensor. The initial work, however, will be focused on a new sensor
technology that provides purely tactile feedback from large areas of the
body. A picture of the initial prototype skin sensor that will be used for this
work work is given in Figure 4.
Each of the golden disks on the
interconnected triangular base plates is an individual touch sensor. Cannatta
et al. (2008) have presented this technology in detail.
Figure 4: A robot skin prototype fitted on a human hand, picture provided
by the Italian Institute of Technology
5.2 Related Work
Many contemporary robots are designed to imitate the functional skeletal
motions of man or animal in either locomotion or manipulation (Bekey and
Tomovic, 1986). The structures of the robots are anthropomorphic and the
joints of manipulators are commonly termed shoulder, elbow and wrist even
though these joints possess fewer degrees of freedom than the
corresponding human joints.
However, while the joints have some
similarity to human or animal anatomy, such is not the case with robot
control systems. Rather than using biological prototypes for robot control,
current systems are based largely on the application of either classical or
modern control theory to the robots. Yet, functional motions such as
locomotion or manipulation are under the control of complex hierarchical
systems which receive a variety of sensory inputs, perform signal processing
and pattern recognition, infer the desired actions on the basis of the
available knowledge, plan the motion and execute it. The implementation
of robot reflexes has been devised by analogy with the way in which
biological systems process sensory information in producing simple motions
of the extremities which called artificial reflex control (ARC). The ARC
principle was first implemented in the control of multifunctional grasping
devices by Tomovic and Boni (1962). A shape-adaptive artificial hand was
designed and constructed. The sensory input was provided by pressure
sensitive pads on the fingers. Unequal pressure on the pads caused the
fingers to begin closing until the distribution was equalized, thus adapting to
the shape of the object being grasped. A later version of the "Belgrade hand"
was reported in 1975 (Tomovic & Stojiljkovic, 1975). The principle of nonnumerical control has also been applied to the design of assistive devices for
both upper and lower extremity disability (Tomovic and Rabishong, 1975)
(McGhee et al., 1978). The view that locomotion is basically an
asynchronous process, triggered by sensory information and leading to
specific joint states, was proposed by Tomovic and McGhee (1966) and
applied to the construction of a quadruped walking machine by Frank
(1968). Bekey et al. (1977) used a finite state model for the mapping of
muscle states to various pathological gaits in human subjects. The
electromyogram from each of eight muscles in the leg was quantized into
two levels and the resulting binary vector used as an input to a pattern
recognition algorithm.
Brooks et al. (1999) took inspiration from the human vestibule-ocular reflex
(VOR) and opto-kinetic nystigmus (OKN) in the development of the stereo
vision head of the Cog humanoid robot. VOR stabilises the eyes during
rapid head movements by counter rotating the eyes as a response to head
velocity as sensed by the otolith organs in the inner ear. A similar reflex
was implemented on the robot, driving the eyes as a response to rotation
measurements from a gyro.
OKN compensates for slow, smooth head
motions as a response to the optical flow sensed by the retina. A similar
mechanism was implemented using the optical flow on the background
image from Cog’s cameras.
5.3 The Fixed Mean Response Model
To implementing a withdrawal reflex on the Nao humanoid robot we will
sensitise the areas on the robot corresponding to the stimulus points and use
these sensitised areas to trigger the corresponding reflex as represented by
the model used to represent the human capture data. The simplest model we
have developed is based on calculating the mean joint angle difference in
the shoulder (3 DOF), elbow (2DOF) and the mean duration of the
withdrawal reflex.
As a result of individual differences in the underlying reaching motion
produced by the human subjects, the captured human reflexes were
produced from a range of similar, but slightly different poses. Reflexes
based on the fixed mean response model can be expressed in any arm
configuration, but being a simple average of human reflexes produced in
different poses, they are not likely to be realistic as they will not reflect any
differences in motion correlated to the differences in the pose at the time of
stimulation.
5.4 A Gaussian Mixture Model
The fixed response model of reflexes does not take into account differences
in start and end position and related differences in the reflex motion. To
capture these features of the reflex motions, we will also produce a Gaussian
mixture model (GMM) of the reflexes, using the spatially and temporally
normalised human reflex movements.
Compared to the fixed mean
response model, a GMM will provide a more sophisticated reflex model
able to express differences in movement correlated to differences in pose at
the time of stimulation.
5.5 A Hierarchical Self-Organising Map Model
In addition to the Fixed Mean model and the Gaussian mixture model we
will develop a connectionist model based on the Compressed Sparse Code
Hierarchical Self-Organising Map (CoSCo-HSOM) algorithm (Pierris &
Dahl, 2010). The CoSCo-HSOM model has been shown to be more robust
in terms of handling motions of varying duration, not requiring an initial
standardisation. By reusing sparse encodings of the principal components
of observed motions in a hierarchy of increasingly abstract sequences, the
CoSCo-HSOM algorithm achieves highly compressed representations of
motion data, though not as compressed as GMM.
6
Reflexes as Robot Middleware
Modularisation and reuse are two key principles of good software
engineering. We are dedicated to follow these principles and produce selfcontained and reusable functionality to avoid repetition of effort and thus
increase the rate at which robotic research progresses. In this Section we
discuss these concepts in a humanoid robot behaviour context.
6.1 Hardware Independent Generic Skills
In order to support modularisation of humanoid robot behaviours we use the
term middleware. In an operating system context this is software that sits
between the operating system and the user applications. In the humanoid
robotics context we define middleware as the software that exists between
the hardware specific software and the application specific software. An
example of hardware specific software is the software that calculates the
distance to an object based on vergence between the robots eyes, something
that is again dependent of the distance between the robots eyes (Ruesch,
2008). Correspondingly, we can consider the ability to estimate the distance
to an object through vision as generic middleware function.
A typical
example of application specific behaviour is the ability to dynamically
change interactions style, based on tactile interaction patterns, in order to
facilitate communication with an autistic child (Dautenhahn et al., 2009).
Protective reflexes can form a software layer that can potentially be ported
between humanoid platforms, i.e., it can be relatively hardware independent
and provide a generally useful, i.e., domain independent, capability. In
order to realise these ambitions, it is necessary to develop usable interfaces
both downwards to the specific platforms and upwards to as of yet
unspecified applications.
6.2 Humanoid Robot Hardware Abstraction
The iCub project has already done a lot of work on making the iCub
software reusable. Using YARP (yet another robotics platform) (Metta et.
al., 2006) they have separated the hardware from higher level application by
presenting a well defined interface to the hardware in terms on YARP
modules for the head, arms, legs, etc. This work will abstract the iCub
interface further to create a standard humanoid interface. In particular we
aim to provide this interface for the iCub, Nao and Kaspar platforms. Each
of these platforms share a basic physiology but they also have unique
features.
Figure 5: The iCub, Nao and Kaspar humanoid robots, all targets for the
same set of generic withdrawal reflexes
6.3 Integrating Generic and Domain Specific Skills
On the other side of the middleware is the interface it presents to domain
specific applications. Currently, a wide range of behaviour representations
are in use. Many systems use ad-hoc representations built on top of popular
low-level platform standards such as Player/Stage (Vaughan et al., 2003) or
YARP (Metta et al., 2006). There are also efforts at standardising higher
level behaviour representations using frameworks such as behaviour-based
AI (Arkin, 1997) (Proetzsch et al., 2010). On top of these frameworks,
there are a wide range of representations used to learn robot behaviours.
These include artificial neural networks (Yamashita & Tani, 2008),
Gaussian mixture models (Calinon & Billard, 2009) and reinforcement
learning approaches (Hester et al., 2010). While these technologies provide
representations for individual behaviours, they do not provide mechanisms
for integrating skills on different levels.
Our approach will embed the
generic skills in relevant hardware modules, hiding their presence from
developers who, as a result, will be able to use the underlying hardware as
normal with the additional assumption that the robot will keep itself safe by
expressing protective reflexes whenever necessary. For this purpose we will
develop a range of alternative YARP modules, mirroring the existing YARP
representation of the humanoid iCub robot using abstract modules such as
‘arm’, ‘leg’ and ‘head’.
The new modules will, to the developer look
exactly like the existing ones, but they will require the presence of tactile
sensors and they will, in given circumstances, override the commands from
the domain specific applications and instead express a set of built-in
protective reflexes.
7
Conclusions
Our next step is to use the results from our experiments to implement
withdrawal reflexes on the Nao and then to generalise this implementation
so that it can be ported to different platforms and be used beneficially in a
wide range of problem domains. At the same time, we will start capturing
reflex motion for a new scenario where the subject stands still and receives a
stimuli in the hip/lower torso area to see how reflex motions interact with
other crucial capabilities such as balance maintenance that are likely in their
turn to rely on a separate set of reflexes. Finally, we also plan to approach
the reflex modelling problem from an optimisation angle, automatically
adapting robot reflex movements to optimise a set of metrics such as the
minimisation of stimuli strength and duration.
Acknowledgements
This research was partially funded by the European Commission’s Seventh
Framework Programme FP7/2007-2013) under grant agreement number
231500.
References
American National Standards Institute (1999). American national standard
for industrial robots and robot systems safety requirements.
Andersen, O. K., Sonnenborg, F. A. & Arendt-Nielsen, L. (1999). Modular
organisation of human leg withdrawal reflexes elicited by electrical
stimulation of the foot sole. Muscle & Nerve, 22(11):1520-1530.
Andersen, O. K., Finnerup, N. B., Spaich, E. G., Jensen, T. S. & ArendtNielsen, L. (2004). Expansion of nociceptive withdrawal reflex receptive
fields in spinal cord injured humans’, Clinical Neuropsychology,
115(12):2798–2810.
Andersen, O. (2007). Studies of the organization of the human nociceptive
withdrawal reflex. Acta Physiologica, 189(654):1-35.
Arendt-Nielsen, L., Anker-Moller, E., Bjerring, P. & Spangsberg, N. (1990).
Onset phase of spinal bupivacaine analgesia assessed quantitatively by laser
stimulation. British Journal of Anaesthesia, 65(5):639.
Arendt-Nielsen, L., Sonnenborg, F. & Andersen, O. (2000). Facilitation of
the withdrawal reflex by repeated transcutaneous electrical stimulation: An
experimental study on central integration in humans. European Journal of
Applied Physiology, 81(3):165-173.
Arkin, R. C. (1998). Behavior-Based Robotics, MIT Press.
Banks, J. (2001).
Design and Control of an Anthropomorphic Robotic
Finger with Multi-point Tactile Sensation. MIT AI Lab Technical Report,
AITR-2001-005.
Beer, R., Chiel, H. & Sterling, L. (1990). A biological perspective on
autonomous agent design. Robotics and Autonomous Systems, 6(1-2):169186.
Bekey, G. Chang, C., Perry, J. & Hoffer, M. (1977). Pattern recognition of
multiple EMG signals applied to the description of human gait.
Proceedings of the IEEE, 65(5):674-681, 1977.
Bekey, G. & Tomovic, R. (1986). Robot control by reflex actions. In the
Proceedings of the 1986 IEEE International Conference on Robotics and
Automation (ICRA), 3:240-247.
Calinon, S., Guenter, F. & Billard, A. (2007). On learning, representing and
generalizing a task in a humanoid robot. IEEE transactions on systems,
man and cybernetics, Part B. Special issue on robot learning by
observation, demonstration and imitation, 37(2):286–298.
Cannata, G., Maggiali, M., Metta, G. & Sandini, G. (2008). An embedded
artificial skin for humanoid robots.
In the Proceedings of IEEE
International Conference on Multisensor Fusion and Integration for
Intelligent Systems, pp434–438, Seoul, Korea, August 20-22.
Chan, C. & Dallaire, M. (1989).
Subjective pain sensation is linearly
correlated with the flexion reflex in man. Brain research, 479(1):145-150.
Churchland, P. & Sejnowski, T (1996). Receptive fields: Size and centersurround organization. The Computational Brain. MIT Press.
Corke, P. (1999).
Safety of advanced robots in human environments.
Discussion paper for the International Advanced Robotics Programme
(IARP).
Dautenhahn, K., Nehaniv, C., Walters, M., Robins, B., Kose-Bagci, H.,
Mirza, A. & Blow, M. (2009). Kaspar - A minimally expressive humanoid
robot for human-robot interaction research.
Applied Bionics and
Biomechanics, 6:369-397.
deBroucker, T., Willer, J. & Bergeret, S. (1989). The nociceptive flexion
reflex in humans: A specific and objective correlate of experimental pain.
Issues in pain measurement, Raven Press Ltd., New York, pp337-364.
Dimitrijevic, H., Faganel, J., Gregoric, M., Nathan, P & Trontelj, J. (1972).
Habituation: effects of regular and stochastic stimulation.
Journal of
Neurology, Neurosurgery & Psychiatry, 35(2):234-242.
Duysens, J., Tax, A., Trippel, M. & Dietz, V. (1993). Increased amplitude
of cutaneous reflexes during human running as compared to standing. Brain
Research, 613(2):230-238.
Duysens, J., Tax, A., Doelen, B., Trippel, M., & Dietz, V. (1991). Selective
activation of human soleus or gastrocnemius in reflex responses during
walking and running. Experimental Brain Research, 87(1):193-204.
Escher, M., Daali, Y., Chabert, J., Hopfgartner, G., Dayer, P. & Desmeules,
J. (2007).
Pharmacokinetic and pharmacodynamic properties of
buprenorphine after a single intravenous administration in healthy
volunteers: a randomized, double-blind, placebo-controlled, crossover
study. Clinical therapeutics, 29(8):1620-1631.
Frank, A. (1968).
Automatic control systems for legged locomotion
machines.
Gaskill, S. & Went, S. (1996). Safety issues in modern applications of
robots. Reliability Engineering and System Safety, 52:301-307.
Ghez, C. (1990). The control of movement. Principles of neural science.
ISO
(2006)
International
Standard
10218-1:2006.
International
Organization for Standardization.
Gouaillier, D. & Blazevic, P. (2006).
A mechatronic platform, the
Aldebaran robotics humanoid robot. Proceedings of the 32nd IEEE Annual
Conference on Industrial Electronics (IECON 2006), pp4049-4053.
Grimby, L. (1963).
Normal plantar response: integration of flexor and
extensor reflex components. British Medical Journal, 26(1):39-50.
Hagbarth, K. (1960) Spinal withdrawal reflexes in the human lower limbs.
British Medical Journal, 23(3):222-227.
Kabe, T., Tanaka, K., Ikeda, H., & Sugimoto, N. (2009). Consideration on
safety for emerging technology - Case studies of seven service robots.
Safety Science.
Kandel, E., Schwartz, J. & Jessell, T. (1995). Essentials of neural science
and behavior. McGraw-Hill/Appleton & Lange.
Oberer, S., Malosio, M. & Schraft, R. (2006). Investigation of robot human
impact. VDI BERICHTE, 1956:87.
Kugelberg, E., Eklund, K. & Grimby, L. (1960). An electromyographic
study of the nociceptive reflexes of the lower limb: Mechanism of the
plantar responses. Brain, 83(3):394-410.
Liu, H., Meusel, P. & Hirzinger, G. (1995). A tactile sensing system for the
DLR three-finger robot hand.
In the Proceedings of the International
Symposium on Measurement and Control in Robotics, pp91-96, Smolenice
Castle, Slovakia.
McGhee, R., Tomovic, R., Yang, P. & MacLean, I. (1978).
An
experimental study of a sensorcontrolled external knee locking system.
IEEE Transactions on Biomedical Engineering, pp195-199.
Metta, G., Fitzpatrick, P. & Natale, L. (2006). YARP: Yet Another Robot
Platform. International Journal of Advanced Robotics Systems, special issue
on Software Development and Integration in Robotics, 3(1):43-48.
Petersen-Felix, S., Luginbuhl, M., Schnider, T., Curatolo, M., ArendtNielsen, L. & Zbinden, A. (1998). Comparison of the analgesic potency of
xenon and nitrous oxide in humans evaluated by experimental pain. British
Journal of Anaesthesia, 81(5):742-747.
Pierris, G. & Dahl, T. S. (2010). Compressed Sparse Code Hierarchical
SOM on Learning and Reproducing Gestures in Humanoid Robots.
Accepted to the International Symposium in Robot and Human Interactive
Communication (RO-MAN'10), Viareggio, Italy.
Piguet, V., Desmeules, J. & Dayer, P. (1998)
Lack of acetaminophen
ceiling effect on R-III nociceptive flexion reflex.
European journal of
clinical pharmacology, 53(5):321-324.
Proetzsch, M., Luksch, T. & Berns, K. (2010). Development of complex
robotic systems using the behavior-based control architecture iB2C.
Robotics and Autonomous Systems, 58:46-67.
Rossi, A. & Decchi, B. (1994). Flexibility of lower limb reflex responses to
painful cutaneous stimulation in standing humans: evidence of loaddependent modulation. Journal of Physiology, 481(2):521-532.
Ruesch, J., Lopes, M., Bernardino, A., Hornstein, J., Santos-Victor, J. &
Pfeifer R. (2008).
Multimodal Saliency-Based Bottom-Up Attention: A
Framework for the Humanoid Robot iCub. In the Proceedings of the IEEE
International Conference on Robotics and Automation (ICRA 2008), pp962967, Pasadena, California.
Sandini, G., Metta, G. & Vernon, D. (2006). The iCub cognitive humanoid
robot: An open-system research platform for enactive cognition. 50 Years
of Artificial Intelligence, pp358-369.
Shahani, B. & Young, R. (1971). Human flexor reflexes. British Medical
Journal, 34(5):616-627.
Sherrington, C. (1910). Flexion-reflex of the limb, crossed extension-reflex,
and reflex stepping and standing. The Journal of Physiology, 40(1-2):28121.
Stűttgen, G. (1999).
Reflex therapy in dermatology.
Clinics in
dermatology, 17(1):77-91.
Tomovic, R. & Boni, G. (1962).
An adaptive artificial hand.
IRE
Transactions on Automatic Control, 7(3):3-10.
Tomovic, R. & McGhee, R. (1966). A finite state approach to the synthesis
of bioengineering control systems. IEEE Transactions on Human Factors
in Electronics, pp65-69.
Tomovic R. & Rabishong, P. (1975). Trends in assistive devices for upper
and lower extremities. In the Proceedings of the 5th Internal Symposium on
External Control of Human Extremities, pp27-31, Dubrovnik.
Tomovic R. & Stojiljkovic, Z. (1975). Multifunctional terminal device with
adaptive grasping force," Automatica, 11(6):567-570.
Vaughan, R. T., Gerkey, B. & Howard, A. (2003). On Device Abstractions
for Portable, Resuable Robot Code. In the Proceedings of the IEEE/RSJ
International Conference on Intelligent Robot Systems (IROS 2003),
pp2121-2427, Las Vegas, Nevada.
Willer, J. (1977). Comparative study of perceived pain and nociceptive
flexion reflex in man. Pain, 3(1):69-80.
Willer, J. (1984). Nociceptive flexion reflex as a physiological correlate of
pain sensation in humans: Pain Measurement in Man. Neurophysiological
Correlates of Pain, 87:110.
Willer, J. (1985).
Studies on pain: Effects of morphine on a spinal
nociceptive flexion reflex and related pain sensation in man.
Brain
Research, 331(1):105-114.
Zehr, E. & Stein, R. (1999). What functions do reflexes serve during human
locomotion? Progress in neurobiology, 58(2):185-205.
Y. Yamashita and J. Tani: "Emergence of functional hierarchy in a multiple
timescale neural network model: a humanoid robot experiment", PLoS
Computational Biology, Vol.4, Issue.11, e1000220, 2008.
Statistical Learning by Imitation of Competing Constraints in Joint Space
and Task Space S. Calinon and A. Billard Advanced Robotics, 23 (2009)
2059-2076
Hester, T., Quinlan, M. & Stone, P. (2010) Generalized Model Learning for
Reinforcement Learning on a Humanoid Robot. In the Proceedings of the
IEEE International Conference on Robotics and Automation (ICRA 2010),
Anchorage, Alaska.