Non-homogeneous servers in emergency medical - (DEP)

ARTICLE IN PRESS
Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
www.elsevier.com/locate/seps
Non-homogeneous servers in emergency medical systems:
Practical applications using the hypercube queueing model
Reinaldo Morabitoa,, Fernando Chiyoshib, Roberto D. Galvãob
a
Departamento de Engenharia de Produc- ão, UFSCar, 13565-905 São Carlos, SP, Brazil
b
Programa de Engenharia de Produc- ão, COPPE/UFRJ, Brazil
Abstract
In this paper, we study the effects of considering homogeneous versus non-homogeneous servers in applications of the
hypercube queueing model. This is important since approximate methods available for solving the model for homogeneous
servers are computationally much less time-consuming than the exact methods required for the non-homogeneous case.
Illustrative examples are initially presented to show the degree to which using homogeneous versus non-homogeneous
servers can differ. Then, two ambulance deployment applications dealing with Brazilian emergency medical systems, in a
city and along a highway, are analyzed. The basic operational characteristics of non-homogeneous systems were compared
to the respective predictions produced under the simplifying assumption of homogeneous servers. It was found that, even
when the degree of non-homogeneity of the servers is not highly significant, homogeneity may lead to poor predictions of
the actual operational characteristics of non-homogeneous systems.
r 2007 Elsevier Ltd. All rights reserved.
Keywords: Emergency medical systems; Queueing; Hypercube model; Ambulance deployment; Non-homogeneous servers
1. Introduction
The hypercube queueing model, proposed by Larson [1], and studied by several authors (e.g., [2–4]) is an important
tool for planning service systems, especially within urban environments where servers travel to offer some type of
service to clients (server-to-customer service). The model takes into account geographical and temporal complexities
of the regions under consideration, and is based on spatial queueing theory and Markovian analysis approximations.
Basically, the idea is to expand the state space description of a queueing system with multiple servers in
order to represent each server individually and incorporate more complex dispatch policies. A number of
system performance measures can be estimated, either region-wide, or for each server or region. These include
workloads, mean response times, and fraction of dispatches of each server to each region, among others.
Applications of the hypercube model in the United States include the location of ambulances in Boston [5]
and Greenville, NC [6], police patrolling in New Haven [7] and Orlando, FL [8], and a program for visits by
the social system (see [9, Chapter 5]). More recently, the hypercube has been considered as a deployment
model for response to terrorism attacks and other major emergencies [10].
Corresponding author.
E-mail addresses: [email protected] (R. Morabito), [email protected] (F. Chiyoshi), [email protected] (R.D. Galvão).
0038-0121/$ - see front matter r 2007 Elsevier Ltd. All rights reserved.
doi:10.1016/j.seps.2007.04.002
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
2
Some examples of the use of the hypercube model in Brazil include the design of a repair service related to
interruptions in the distribution of electrical energy, studies of the public emergency medical system of the City
of Campinas [11], and emergency medical systems that operate on highways in the states of Rio de Janeiro and
São Paulo [12,13]. Other references related to applications and extensions of the hypercube model can be
found in Halpern [14], Jarvis [15], Batta et al. [16], Swersey [4], Chiyoshi et al. [17], Saydam and Aytug [18],
Galvão et al. [19] and Atkinson et al. [20].
To solve the hypercube model, it is necessary to find the solution to a system of 2n linear equations (where n
is the number of servers in the system) that provides the equilibrium probabilities of all possible states of the
system (the state probabilities). For moderately sized systems, this can be done in a number of ways, such as
using Gauss’ elimination method, or the iterative method of Gauss–Seidel [21].
For larger systems (n415), these exact methods become computationally prohibitive. Larson [22] proposed an
approximate method based on a system of n non-linear equations to solve the model, but it is valid only if the
servers are homogeneous. Nevertheless, the method is computationally much less time-consuming than exact
methods, with average errors of less than 4% when the corresponding values are compared with exact solutions.
In the present paper, we assume that the servers are non-homogeneous within applications of the hypercube model.
Since systems with such servers require an exact method of solution, our approach can only be justified if both:
(i) the application under consideration requires the use of non-homogeneous servers; and
(ii) it can be shown that the values of the output measures of the hypercube model differ substantially from the
homogeneous case when non-homogeneous servers are considered.
Regarding the first point, systems with non-homogeneous servers can be found in several practical
applications. For example, in the emergency medical systems of some Brazilian cities, ambulances can be
either advanced support (VSAs) or basic support vehicles (VSBs) [11]. Such servers, even when they share the
workload, may realize average service times that differ substantially as a function of the service offered.
On the other hand, it is possible to find systems with similar servers in terms of vehicle, crew and equipment,
but with different service times when their fixed location is considered. For example, for emergency service
systems available on certain Brazilian highways, the average service times can vary due to travel time, which is
obviously a function of their locations (see [12,13]).
The second point is dealt with in the remainder of the paper. In Section 2, illustrative examples show the
degree to which solutions of the hypercube model using homogeneous and non-homogeneous servers can
differ. Two practical applications of moderate size (no15) are then discussed in Sections 3 and 4. Although the
differences in output measures do not appear to be as significant in these applications as in the illustrative
examples, they still suggest the need to use non-homogeneous servers in selected real-world scenarios. Finally,
in Section 5, we discuss the results of Sections 2–4, and present our conclusions.
2. Illustrative examples
In this section, the effects of non-homogeneity of the service rates are analyzed through three illustrative
models based on a three-atom network with three servers. These structures, referred to as ‘‘toy-models’’ by
some authors, are the smallest, non-trivial models to incorporate the basic characteristics of systems under
analysis. By their very nature, they are used to gain insight into problems of interest, but are generally unable
to address the complexity of real-world systems.
As previously noted, the insight we seek here involves the degree to which the operating characteristics of
the hypercube model depart from predictions offered by homogeneous server-based models. To this end, the
toy-models seem appropriate tools because, as it will be shown below, strictly regular and symmetric
structures can be built into such structures and, in a controlled experiment, the net effect of server nonhomogeneity on the output measures can be evaluated.
The first model (here, Model I) is the classical hypercube, in which the home locations of the servers are
decentralized, and a fixed-preference dispatch policy, with preferences set according to the shortest travel
distance/time criterion, is in use. The other two formulations (here, Models II and III) incorporate those
characteristics of the models used in the case studies of Sections 3 and 4, respectively.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
3
Model II is a centralized structure in which all servers have the same home location. Further, since distance
is no longer a meaningful criterion for defining priorities, a random dispatch policy is assumed. Model III has
as its distinguishing feature the fact that the backup is partial, in the sense that servers cannot service calls
from all atoms. It is assumed there is no point in dispatching a server when the travel distance/time exceeds a
predefined limit.
Although the decision whether or not to assume a waiting line of calls is not critical in Models I and II, the
decision to not service a call in Model III is supported by the existence of a secondary emergency system to
handle overflows from the main system. In this case, a zero-capacity waiting line model seems appropriate.
The network on which Models I–III are defined consists of a region with three atoms connected by one-way
roads, as shown in Fig. 1. It is assumed that the centroids of the atoms are located at the vertices of an
equilateral triangle with sides of unit length. The inter-atom travel distances are shown in Table 1.
2.1. Model I
As noted earlier, Model I, is a decentralized structure with a shortest distance-based dispatch policy. Each
atom is the home location of a server, and a fixed-preference dispatch policy, with preferences set according to
shortest distances, is in use. The preference matrix is shown in Table 2.
For a zero-capacity waiting line system, the hypercube state probabilities are defined by a set of flowbalancing equations built around the hypercube states [9]:
State
{0 0 0}
{0 0 1}
{0 1 0}
{1 0 0}
{0 1 1}
{1 0 1}
{1 1 0}
{1 1 1}
Equation
lp000 ¼ m1p001+m2p010+m3p100,
(l+m1)p001 ¼ l1p000+m2p011+m3p101,
(l+m2)p010 ¼ l2p000+m1p011+m3p110,
(l+m3)p100 ¼ l3p000+m1p101+m2p110,
(l+m2+m1)p011 ¼ (l1+l2)p001+l1p010+m3p111,
(l+m3+m1)p101 ¼ l3p001+(l1+l3)p100+m2p111,
(l+m2+m3)p110 ¼ (l3+l2)p010+l2p100+m1p111,
mp111 ¼ lp110+lp101+lp011,
1
2
3
Fig. 1. Map of the three-atom region.
Table 1
Inter-atom travel distances
From atom i
1
2
3
To atom j
1
2
3
0
1
2
2
0
1
1
2
0
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
4
Table 2
Server dispatch preferences
Atom i
Server preferences
1
2
3
1st
2nd
3rd
1
2
3
2
3
1
3
1
2
where lj is the arrival rate of calls from atom j; mi is the service rate of server i; m ¼ m1+m2+m3 is the total
service rate; l ¼ l1+l2+l3 is the total arrival rate; and p, followed by a sub-string of 1’s and 0’s, refers to the
probability of the corresponding state, where 1 implies an available server and 0 a busy server.
For instance, p011 is the probability that servers 1 and 2 are busy and server 3 is available (note that the
server states are described from right to left).
Consider the equilibrium equation for state {0 1 1}. Note that on the left side (outflow rate of state {0 1 1}),
the system leaves state {0 1 1} if a call arrives, or a service is completed by servers 1 or 2. Conversely, on the
right side (inflow rate of state {0 1 1}), the system enters state {0 1 1} if: (i) a call arrives from atoms 1 or 2 when
the system is in state {0 0 1}; (ii) a call arrives from atom 1 when the system is in state {0 1 0}; or (iii) a service is
completed by server 3 when the system is in state {1 1 1}. (See Table 2.)
The above set of equations is singular; an additional equation is thus required to determine the hypercube
state probabilities. The standard procedure is to replace one of the equations by an equation related to the
probability normalization condition. The flow equation associated with state {1 1 1} is a convenient candidate
for replacement because the normalization equation can be used to handle a non-zero-capacity waiting line, if
required. For a zero-capacity waiting line, this equation is p000+p001+?+p111 ¼ 1, while, for an infinite
capacity queue, it becomes p000+p001+?+ p111/(1r) ¼ 1, where r ¼ l/m is the average workload of the
system.
It is assumed that the non-homogeneity of the system is characterized by the service rates of the three
servers, these rates being such that m14m24m3. It is further assumed that m1 is symmetric to m3 with respect to
m2, so that the ratio m1/m3, referred to herein as the fast to slow ratio (FSR), is a proper measure of the system’s
non-homogeneity. To build the non-homogeneity measure as an input parameter of the model, the average
service time is taken equal to the time unit, so that m2 ¼ 1 and, thus, m1 ¼ 2(1+1/FSR)1 and
m3 ¼ 2(1+FSR)1.
In order to analyze the effects of the servers’ non-homogeneity on the output measures, we begin by
selecting three of these metrics: workloads (ri), dispatch frequencies (fi), and average travel distances (Dui) of
server i. Consider, for instance, the average travel distance formula (the analysis is similar for travel times):
P
Dui ¼
f ½1
ij d ij þ ðDQ PS mi =mÞ
,
P ½1
j f ij þ ðPS mi =mÞ
j
where f ½1
ij is the fraction of all dispatches that send server i, when available, to atom j; dij is the average travel
distance from server i to atom j (when this server is available); PS is the system saturation probability; and DQ
is the average travel distance to a queued call [9].
A closer look at this formula shows that it is a weighted average of distances in which the dispatch
frequencies are the weights; its numerator is a weighted sum of distances while its denominator is the total
dispatch frequency of server i, fi. The expression is, in fact, the average distance traveled by the server when it
is dispatched to service a call, and could thus be called the average travel distance per dispatch.
If we look at the numerator, it represents the average distance traveled by the server for each call serviced by
the system. While the dispatch frequency shows the fraction of all calls serviced by the server, the average
traveled distance per call is the share of the server of the total distance traveled by the fleet of service vehicles.
The average travel distance per call of server i (the numerator of Dui , denoted by Du0i ) is included in this
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
5
analysis as a fourth measure of performance. Note that the average travel distance per call includes all calls,
and not just those to which a particular server was dispatched.
With the system workload set at 0.5 and evenly distributed demand assumed, the output measures for the
infinite capacity model are shown in Table 3 for three levels of non-homogeneity of the servers (FSR ¼ 1.2, 1.5
and 2.0). Results corresponding to the homogeneous system (FSR ¼ 1.0) are included to show the regular and
symmetric structure of the model; note that in this case every performance measure is the same for all three
servers. The output measures for average server 2 are generally not affected by server non-homogeneity while
those for slow server 3 are roughly symmetric to those for fast server 1.
As one would expect, the fast server takes a greater share of the system’s workload (here, the term workload
bears its original meaning as ‘‘load of work’’ and does not refer to the corresponding output measure). At
FSR ¼ 2.0, the fraction of all calls servicedPby the fast server is f1 ¼ 0.4082, whereas the share of the slow
server is f3 ¼ 0.2599 (note in Table 3 that i f i ¼ 1). This greater workload implies a higher proportion of
calls from outside its preferential area, and, so, its average traveled distance per dispatch (Du1 ¼ 0.6440) is
also higher.
The average distance traveled by the fast server for each call serviced by the system is also higher
(Du0i ¼ 0:2629), due to the combined effect of higher dispatch frequencies and traveled distances per dispatch.
Notice that, for FSR ¼ 2.0, the total distance traveled for each serviced call is 0.5957 (0.2629+
0.1938+0.1390). The share of the fast server of this total is 0.2629 (44.1%), while that of the slow server is
0.1390 (23.3%). As for the output measure ‘‘workload’’ (the average fraction of time the server is busy), we see
that the workload of the fast server, r1, tends to be smaller for higher FSRs; yet, its increased activity in terms
of number of calls serviced does not increase its utilization. In fact, the higher service rate of the fast server
results in a higher idle time for the server.
In order to assess the importance of server non-homogeneity on the performance measures of Table 3,
Table 4 shows these measures expressed as percentage deviations from the corresponding values of the
homogeneous system.
When the output measures are so expressed, we find the least affected to be workload and the average
traveled distance per dispatch. The most heavily affected measure is the average traveled distance per call,
while dispatch frequency falls somewhere in between.
The output measures related to travel distances must be taken cautiously. In a sense, all output measures are
dependent on the network topology since the dispatch preferences are derived from the inter-atom distances.
However, while this dependence is on the ordering of distances for workload and dispatch frequency, the
distance-related measures are dependent on the distances themselves, implying a stronger network topology
dependency.
Table 3
Model I: effects of server non-homogeneity on the performance measures
Performance measure
Server
Fast to slow ratio—FSR
1.0
1.2
1.5
2.0
Workload ri
Fast
Average
Slow
0.5000
0.5000
0.5000
0.4857
0.4983
0.5190
0.4718
0.4974
0.5456
0.4592
0.4980
0.5847
Dispatch frequency fi
Fast
Average
Slow
0.3333
0.3333
0.3333
0.3532
0.3322
0.3145
0.3774
0.3316
0.2910
0.4082
0.3320
0.2599
Travel distance per dispatch Dui
Fast
Average
Slow
0.5789
0.5789
0.5789
0.5953
0.5776
0.5659
0.6162
0.5786
0.5514
0.6440
0.5838
0.5351
Travel distance per call Du0i
Fast
Average
Slow
0.1930
0.1930
0.1930
0.2103
0.1919
0.1780
0.2326
0.1919
0.1604
0.2629
0.1938
0.1390
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
6
Table 4
Model I: effects of server non-homogeneity on performance under homogeneity (% deviation)
Performance measure
Server
FSR (%)
1.2
1.5
2.0
Workload ri
Fast
Average
Slow
2.9
0.3
3.8
5.6
0.5
9.1
8.2
0.4
16.9
Dispatch frequency fi
Fast
Average
Slow
6.0
0.3
5.6
13.2
0.5
12.7
22.5
0.4
22.0
Travel distance per dispatch Dui
Fast
Average
Slow
2.8
0.2
2.3
6.4
0.1
4.8
11.2
0.8
7.6
Travel distance per call Du0i
Fast
Average
Slow
9.0
0.6
7.8
20.5
0.6
16.9
36.2
0.4
27.9
From the data displayed in Table 4, it can be seen that, depending on the degree of server non-homogeneity,
performance using models that assume homogeneous servers may be at great variance with respect to the nonhomogeneous case.
2.2. Models II and III
Model II is a centralized formulation with random dispatch. Thus, all servers are located in the same atom
and there is no preferential service area for them. Since there are no distance differences to distinguish the
incoming calls, a random dispatch policy is assumed. In building the system’s equations, we must argue
differently from Model I. The transition rate, from, say, state {0 0 0} to state {0 0 1}, must be 13 of the total
arrival rate since random selection of the server to be dispatched will assign 13 of all calls to server 1. In a similar
way, the transition rate from state {0 0 1} to state {0 1 1} must be 12 of the total arrival rate for, then, we have
only two servers from which to randomly choose, and, in the long run, server 2 will be chosen half the time.
Keeping this in mind, the complete set of flow equations can be built for this model.
In evaluating the dispatch frequencies of unqueued calls ðf ½1
ij Þ, the random nature of the dispatch policy must
be taken into account, and appropriate formulas constructed. For example, the dispatch frequency of server 2
to atom 3 is obtained by combining:
the probability that the incoming call comes from atom 3, which is l3/l;
1
3 of the probability that all servers are free;
half the probability that server 2 is one of two free servers; and
the probability that server 2 is the only free server.
The resulting expression is thus: f ½1
23 ¼ (l3/l){(1/3)p000 + (1/2)(p001+p100) + p101}.
The dispatch frequencies for servicing queued calls can be evaluated using the usual formulas. Taking again
the dispatch frequency of server 2 to atom 3, this frequency is evaluated as the joint probability that all servers
are busy (PS), that the call to be serviced comes from atom 3 (l3/l), and that the first server to become free is
server 2 (m2/m). This results in: f ½2
23 ¼ PS(l3/l)(m2/m).
Model III is a partial backup model. It assumes that each atom is the home location of a server, and that a
server can only service calls from its preferential atom and from the second closest atom. Server 1, for
example, can take calls from atoms 1 and 2 but not from atom 3. The corresponding dispatch matrix is shown
in Table 5.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
7
It may be assumed that the decision to not dispatch server 1 to atom 3 is based on the existence of a
secondary service system to handle such a call; a zero-capacity waiting line model thus seems appropriate. The
flow equations of the system can be built in the usual way. Care must be taken, however, when calculating
some of the upward transition rates. Consider state {0 1 1}, for example, where servers 1 and 2 are busy and
server 3 is free. A call from atom 1 will not change the state of the system because it cannot be serviced by
server 3 (Table 5); the call will be lost to the system. The upward transition rate from state {0 1 1} to state
{1 1 1} will therefore be: (l2+l3).
The dispatch frequencies can be evaluated in two steps, bearing in mind that, in this model, all dispatches
are without delay. The dispatch frequency of server 3 to atom 2, for example, will be evaluated as the joint
probability that the incoming call has originated in atom 2, that server 2 is busy, and that server 3 is free: f ½1
32
¼ (l2/l)(p010+p011).
The dispatch frequency of server i to
P atom j is defined simply as the fraction of all dispatches that send
server i to atom j; so we would expect i;j f ½1
ij ¼ 1. Due to lost calls, however, the dispatch frequencies based
on this formula will sum to less than one; in fact, they will sum to the fraction of calls serviced by the system
(or, one minus the fraction of lost calls). For this reason, a second step is required in which results from the
first step are normalized to sum one.
In order to provide a more comprehensive view of the effects of server non-homogeneity, Table 6 shows the
performance measures of the fast server as percentage deviations from the homogeneous system predictions
for each of the three models, at different levels of non-homogeneity, for homogeneous demand and r ¼ 0.50.
From Table 6, the conclusions regarding Model I seem to hold for both Models II and III. Depending on
the degree of server non-homogeneity, model predictions based on the simplifying assumption of homogeneity
may be at significant variance with the non-homogeneous situation. In fact, when the fast server is 20% faster
Table 5
Server dispatch preferences, partial backup
Atom i
Preferences
1
2
3
1st
2nd
1
2
3
2
3
1
Table 6
Models I–III: effects of server non-homogeneity on the performance of server 1 [fast server] (% deviations)
Performance measure
Model
FSR (%)
1.20
1.50
2.00
Workload ri
I
II
III
2.9
3.1
4.2
5.6
6.2
8.5
8.2
9.1
12.7
Dispatch frequency fi
I
II
III
6.0
5.7
4.6
13.2
12.6
10.2
22.5
21.2
17.5
Travel distance per dispatch Dui
I
II
III
2.8
0.0
4.2
6.4
0.0
9.7
11.2
0.0
16.8
Travel distance per call Du0i
I
II
III
9.0
5.7
9.0
20.5
12.6
20.9
36.2
21.2
37.3
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
8
than the slow server, results may be up to 9% in excess of homogeneous system predictions. This excess may
rise to 20.9% and 37.3% if the fast server is, respectively, 50% and 100% faster than the slow server.
2.3. Larger models
The data discussed thus far are based on a three-server hypercube model. It seems reasonable to ask how
useful is the insight provided by the analysis of such a small structure? To answer this question, larger models
with the same structure as the three-server structure were used; namely, a network based on homogeneous
demand areas (atoms) connected by one way ring-roads, one server per atom, with one fast and one slow
server. The number of servers with average service rate is then: (total number of servers—2), the two
remaining servers corresponding, respectively, to a fast and a slow server.
Table 7 shows the percentage deviations from the homogeneous model predictions for workload and total
dispatch rate, as a function of server type and number of servers. For all models, the total workload was set to
0.5, and the degree of non-homogeneity (as measured by FSR) to 1.5.
The data displayed in Table 7 show that the insight provided by the three-server hypercube model holds for
larger models as well. Table 8 shows the computing times (in seconds) needed to run the hypercube model,
using, respectively, the exact and approximate methods. Note that the latter can only be used for
homogeneous servers; the corresponding computing times thus refer to only the homogeneous case. Times for
the exact method, on the other hand, refer to only the non-homogeneous case. All problems were run on a
Pentium IV microcomputer with 3.0 GHz and 512 MB of RAM.
It should be observed that figures shown in the preceding sections were all derived from simple models,
specifically designed to show the effects of server non-homogeneity on performance measures of the hypercube
model. In the next two sections, the same effects will be analyzed for two models associated with real-world
problems.
3. Case study 1: SAMU-Campinas
In this section, we analyze the effects of non-homogeneous servers in a practical application: the public
emergency medical system of the City of Campinas (SAMU-Campinas), studied in Takeda et al. [11]. In 1998,
Table 7
% Deviations from homogeneous model predictions
Performance measure
Server type
Number of servers (%)
3
5
10
15
20
Workload
Fast
Average
Slow
5.6
0.5
9.1
6.0
0.3
10.0
6.3
0.1
10.5
6.4
0.1
10.7
6.4
0.0
10.7
Dispatch frequency
Fast
Average
Slow
13.2
0.5
12.7%
12.9
0.2
12.1
12.4
0.1
11.6
12.4
0.1
11.5
12.2
0.0
11.4
Note: System workload ¼ 0.5. Fast to slow service rates ratio ¼ 1.5.
Table 8
Processing times (s) for solving the hypercube model (exact and approximate methods)
Solution method
Exact
Approx.
Number of servers
3
5
o0.01
o0.01
0.03
o0.01
10
0.06
o0.01
15
1.86
0.03
20
107.36
0.05
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
9
SAMU-Campinas covered an area of 796 km2 with approx. 909,000 inhabitants. This region had, at the time,
10 ambulances dedicated to emergency calls: two vehicles for advanced support (called VSAs), and the other
eight for basic support (VSBs). The area was divided into five reporting regions (Central, North, South, East
and West), while the emergency calls were divided into two classes: those for advanced support (advanced
calls, for short), preferentially served by a VSA, and calls for basic support (basic calls), preferentially served
by a VSB.
The City’s operational policy in 1998 included the following: VSAs and VSBs, when available for service,
were to stay at system base (one of the public hospitals located within the Central region). If an advanced call
arrives, the system dispatches an available VSA; if all VSAs are busy, then the system dispatches an available
VSB; if all VSBs are also busy, the call is kept in a waiting list until either a VSA or a VSB becomes available.
Similarly, if a basic call arrives, the system preferentially dispatches a VSB. The total number of queued calls
(advanced and basic) is limited by the number of vehicles operating in the system, that is, usually up to 10
calls. If a new call arrives when the system is busy (i.e., 10 calls being served plus 10 calls waiting in line), then
this call is immediately transferred to another emergency system.
Takeda et al. [11] analyzed SAMU-Campinas using a hypercube queueing model composed of 10 atoms
(each of the five regions was split into two atoms, one for advanced calls, the other for basic calls), and 10
servers (two VSAs and eight VSBs). The system obeyed the following preference dispatching rule: for
advanced-call atoms (even-numbered atoms 2, 4, 6, 8 and 10), the primary servers are the VSAs with the
backup servers being the VSBs. For basic-call atoms (odd-numbered atoms 1, 3, 5, 7 and 9), the primary
servers are the VSBs with the backup servers being the VSAs.
This policy thus considers priorities on both the type of call and type of server, allowing the separate
computation of performance measures that are of key importance to the system: the mean travel times for each
type of call; the workloads for each type of server and call; the fraction of advanced calls served by VSAs, and so
on. This simple adaptation of the hypercube model to deal with priorities is referred to as the ‘‘layering’’ of
reporting regions [9]. The resulting model involves the solution of systems of 210 linear equations; it was solved
using the Gauss elimination method. Additional details on the model can be found in [11].
The arrival rates (advanced and basic) are considered exponentially distributed with the following values
(calls per hour): l1 ¼ 0.8535, l2 ¼ 0.1035, l3 ¼P0.8535, l4 ¼ 0.0776, l5 ¼ 0.8535, l6 ¼ 0.0776, l7 ¼ 1.5001,
l8 ¼ 0.1293, l9 ¼ 0.8276 and l10 ¼ 0.1293 (l ¼ 10
j¼1 lj ¼ 5:4054), all measured in a certain peak period of the
system [11]. The service times (VSAs and VSBs) are also considered exponentially distributed with values m1,
m2, ..., m10, measured in the same peak period. At the time data were collected, all service rates were similar
(approximately one call served per hour) due mainly to the system configuration (all ambulances located in the
Central Region). The validity of the exponential distribution assumptions here is supported by data presented
in [11].
The mean travel times between atoms and the mean on-scene service times are different for the VSAs and
VSBs: the former travel faster (approximately 20% faster) than the VSBs; on the other hand, the on-scene
times of the VSAs are, on average, higher (5% higher). The combination of these characteristics, together with
the centralized configuration, yields very similar mean service times for the VSAs and VSBs (63 and 66 min,
respectively).
For current purposes, we initially considered all servers (VSAs 1 and 2 and VSBs 3, 4, ..., 10) to have mean
service
P times of 1 h (i.e., as if they were homogeneous with service rates m1 ¼ m2 ¼ ? ¼ m10 ¼ 1 calls per hour;
m ¼ 10
i¼1 mi ¼ 10). We also used the arrival rates for each atom and the mean travel times between atoms as
computed in [11]. The servers share the overall system workload using the priority dispatching policy
mentioned above. (For advanced atoms, the primary servers are the VSAs, and the backup servers the VSBs;
for basic atoms, the primary servers are the VSBs with the backup servers being the VSAs.) The maximum
number of calls allowed in the waiting line is 10.
The first data columns of Tables 9 and 10 show the workloads (ri), and mean travel times (Tui) [Table 9] of
the servers, and the mean travel times to the atoms (Taj) [Table 10], all of which were obtained for what we call
Configuration 1. Note that, similar to the travel distance per dispatch Dui in Section 2, Tui (Taj) is the
weighted average of the travel times (where the dispatch frequencies are the weights). The corresponding
formula’s numerator is the weighted sum of the travel times while the denominator is the total dispatch
frequency of the server. Tui and Taj are therefore mean travel times per dispatch.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
10
Table 9
Workloads (ri) and mean travel times (Tui) of servers, in minutes: comparison between Configurations 1, 2 and 3—homogeneous versus
non-homogeneous servers
Server i (VSAs and VSBs)
VSA 1
VSA 2
VSB 3
VSB 4
VSB 5
VSB 6
VSB 7
VSB 8
VSB 9
VSB 10
Configuration 1
Configuration 2
Configuration 3
m1 ¼ ? ¼ m10 ¼ 1
m1 ¼ m2 ¼ 0.5, m3 ¼ ? ¼ m10 ¼ 1.125
m1 ¼ m2 ¼ 2, m3 ¼ ? ¼ m10 ¼ 0.75
ri
Tui
ri
Tui
ri
Tui
0.34
0.34
0.59
0.59
0.59
0.59
0.59
0.59
0.59
0.59
10.6
10.6
14.2
14.2
14.2
14.2
14.2
14.2
14.2
14.2
0.50
0.50
0.55
0.55
0.55
0.55
0.55
0.55
0.54
0.54
10.4
10.4
14.2
14.2
14.2
14.2
14.2
14.2
14.2
14.2
0.29
0.29
0.71
0.71
0.71
0.71
0.71
0.71
0.71
0.71
11.3
11.3
14.3
14.3
14.3
14.3
14.3
14.3
14.3
14.3
Table 10
Mean travel times (Taj) to atoms (advanced and basic), in minutes: comparison between Configurations 1, 2 and 3—homogeneous versus
non-homogeneous servers
Atom j (advanced and basic)
Configuration 1
Configuration 2
Configuration 3
m1 ¼ ? ¼ m10 ¼ 1
m1 ¼ m2 ¼ 0.5, m3 ¼ ?
¼ m10 ¼ 1.125
m1 ¼ m2 ¼ 2,
m3 ¼ ? ¼ m10 ¼ 0.75
Taj
Taj
Taj
12.9
12.8
12.8
12.7
4.0
13.4
13.3
13.3
13.2
4.2
12.8
12.8
12.8
12.6
4.3
Advanced
10.6
11.0
10.6
Basic
Basic
Basic
Basic
Basic
16.0
15.9
15.9
15.8
5.3
16.1
16.0
16.0
15.9
5.3
15.8
15.7
15.8
15.5
5.5
Basic
14.1
14.2
14.0
All
13.8
13.9
13.6
Advanced
Advanced
Advanced
Advanced
Advanced
1
3
5
7
9
2
4
6
8
10
As one would expect, the workload is equally shared by all servers of the same type (VSAs and VSBs), where
the mean travel times of the VSAs are, on average, shorter than those of the VSBs since the former are
generally more time efficient (see Table 9). Substantial improvements can be realized in this emergency system
by decentralizing part of the VSBs located in the Central region, as shown in [11].
Tables 9 and 10 compare the results of Configuration 1 with two possible variations: Configuration 2, with
m1 ¼ m2 ¼ 0.5 and m3 ¼ m4 ¼ ? ¼ m10 ¼ 1.125, and P
Configuration 3 with m1 ¼ m2 ¼ 2 and m3 ¼
m4 ¼ ? ¼ m10 ¼ 0.75, both of which also satisfy m ¼ 10
i¼1 mi ¼ 10 calls per hour (i.e., system capacity
remains unchanged). Note that in Configurations 2 and 3 we, respectively, halve and duplicate the VSA service
rates with respect to Configuration 1.
In addition to the expected effect of changes in service rates on server workloads (r) (Table 9), observe that
the impact of servers on mean travel time (Tui in Table 9) and the atoms (Taj in Table 10) are relatively small.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
11
This is in spite of substantial variations in the service rates. Although demand is not perfectly homogeneous,
and the dispatch priorities are not totally random in the present case (because of the ‘‘layering’’ of regions),
current results are in accordance with those obtained for Model II in Table 6. In the latter case, travel distance
per dispatch of the servers (Dui) were not affected by the non-homogeneity of the servers. (See Table 6, Model
II, for information related to the fast server; results for the slow and average servers were the same.)
We also ran configurations that are not reported in Table 10. Fig. 2 illustrates the variation of mean travel
times to advanced calls (Series 1 in the figure) and to all calls (Series 2) for service rates of the VSAs varying
from 0.1 to 4 calls per hour (or, equivalently, for service rates of the VSBs varying from 1.225 to 0.25 calls per
hour). Note that as we increase the service rates of the VSAs (or, equivalently, reduce the service rates of the
VSBs), the mean travel time to all calls diminishes (see Series 2 of Fig. 2) because the VSAs service more calls
than do the VSBs, and their travel times are shorter.
On the other hand, the mean travel time to an advanced call increases as the service rates of the VSAs
become: (i) too small (less than 0.5 in Configuration 2), as the VSAs become less available to service advanced
calls, or (ii) too large (greater than 2 in Configuration 3), as the VSBs become less available to service basic
calls. It is worth mentioning that the behavior of travel times could not be evaluated with this level of detail if
the analysis were limited to homogeneous servers.
Another important advantage of considering non-homogeneous servers in the hypercube model involves the
calibration process of mean service times (m1
i ). In certain emergency systems, travel time can represent a
significant proportion of total service time. In SAMU-Campinas, for instance, travel time from the
operational base to the locale of the call corresponds, on average, to 12% of the total service time of VSAs and
VSBs [11]. As this proportion increases, adjustment of the mean service time of each ambulance becomes more
important in order to reflect the effects of relevant geographic factors. This may also have importance for the
analysis of scenarios where some, or all, ambulances, are decentralized.
The approximate method of Larson [22] to solve the hypercube model assumes that the mi’s are the same for
all servers and inputs to the model, independent of both server location and dispatching policy. With exact
methods (considering the servers as non-homogeneous), it is possible to compute the mi’s as outputs of the
model (as functions of the system configuration). This can involve a calibration process using the iterative
procedure of Larson and Odoni [9].
As a matter of illustration, Table 11 presents the results obtained for Configuration 1 following the
calibration process. In this experiment, we considered as model inputs the on-scene service times, assumed
exponentially distributed with means of 50 min (Configuration 1a) and 40 min (Configuration 1b); that is,
Mean travel times
15
14
Travel times (min)
13
12
11
10
9
8
7
Series1
6
Series2
5
0
2
4
Service rate of the VSA's
Fig. 2. Mean travel times to advanced calls and to all calls.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
12
Table 11
Workloads (ri), mean server travel times (Tui) and mean travel times to atoms (Taj), in minutes: comparison between Configurations 1a
and 1b
Server i
VSA 1
VSA 2
VSB 3
VSB 4
VSB 5
VSB 6
VSB 7
VSB 8
VSB 9
VSB 10
Configuration 1a
Configuration 1b
ri
Tui
ri
Tui
0.37
0.37
0.63
0.63
0.63
0.63
0.63
0.63
0.63
0.63
10.8
10.8
14.3
14.3
14.3
14.3
14.3
14.3
14.3
14.3
0.27
0.27
0.54
0.54
0.54
0.54
0.54
0.54
0.54
0.54
10.3
10.3
14.2
14.2
14.2
14.2
14.2
14.2
14.2
14.2
Atom j
Advanced
Advanced
Advanced
Advanced
Advanced
Advanced
Basic 1
Basic 3
Basic 5
Basic 7
Basic 9
Basic
All
2
4
6
8
10
Configuration 1a
Configuration 1b
Taj
Taj
13.0
12.9
13.0
12.8
4.2
10.7
16.0
15.9
16.0
15.8
5.4
14.1
13.8
12.5
12.5
12.5
12.4
3.6
10.3
15.9
15.9
15.9
15.9
5.0
14.1
13.7
approximately 83% and 67% of the mean service time of Configuration 1 (m1
i ¼ 1 h), respectively (in SAMUCampinas, the mean was 53 min [11]). We sought the mean travel times (Tui’s) and service rates (mi’s) of the
ambulances.
After calibration (five iterations were necessary for a precision of 106 in the mi’s), the initial values
m1 ¼ m2 ¼ ? ¼ m10 ¼ 1 changed to m1 ¼ m2 ¼ 0.9861, m3 ¼ m4 ¼ ? ¼ m10 ¼ 0.9337 (Configuration 1a), and
to m1 ¼ m2 ¼ 1.1928, m3 ¼ m4 ¼ ? ¼ m10 ¼ 1.1079 (Configuration 1b). Note that, in both cases, the overall
capacity of the system was modified with respect to Configuration 1: Configuration 1a resulted in m ¼ 9.4
(o10) calls per hour, whereas Configuration 1b yielded m ¼ 11.2 (410) calls per hour.
Comparing Tables 9 and 10 with Table 11, we observe that changes in the server workloads can be
substantial. The effect on more aggregate performance measures, such as mean travel time, however, was
relatively small. This was expected given the results of Model II in Section 2 (see the 0% deviation in ‘‘travel
distance per dispatch’’ for the fast server in Table 6; results for the slow and average servers were the same).
4. Case study 2: Anjos do Asfalto
In this section, we revisit the case of an emergency medical system on a Brazilian highway, named Anjos do
Asfalto, which was analyzed in Mendonc- a and Morabito [12]. In 1997–1998, this system consisted of a central
station and six fixed bases located along the highway connecting Rio de Janeiro and Sao Paulo. Each base had
an ambulance that, when available for service, was in the base.
The allocation of ambulances to service accidents is coordinated in the following way: when there is an
accident, the central station is called and immediately dispatches, to the site of the accident, the ambulance of
the base that is closest to the scene. If that ambulance is busy (unavailable), the central station dispatches the
ambulance of the second nearest base. If this ambulance is also busy, the central station transfers the call to
another emergency system, usually a local system (generally unable to provide service with the quality
available from the highway emergency medical system). Under these circumstances, the call is said to be lost.
Notice that given this dispatching policy, the system will never have a queue of waiting calls.
Mendonc- a and Morabito [12] analyzed the system Anjos do Asfalto using a hypercube queueing model
composed of 10 atoms and six servers, which followed the partial backup dispatching policy described above.
The model involves systems of 26 linear equations, which was solved using the Gauss elimination method.
(More details of the model can be found in [12].)
In that paper, the user interarrival times were considered exponentially distributed with the following rates
(arriving calls per minute): l1 ¼ 0.0028, l2 ¼ 0.0008, l3 ¼ 0.0020, l4 ¼ 0.0011, l5 ¼ 0.0018, l6 ¼ 0.0001,
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
13
P
l7 ¼ 0.0038, l8 ¼ 0.0018, l9 ¼ 0.0023 and l10 ¼ 0.0018 (i.e., l ¼ 10
j¼1 lj ¼ 0:018), measured during a selected
peak period of the system. The service times were also considered exponentially distributed with the following
ratesP
(served calls per minute): m1 ¼ 0.019, m2 ¼ 0.014, m3 ¼ 0.017, m4 ¼ 0.010, m5 ¼ 0.024 and m6 ¼ 0.013 (i.e.,
m ¼ 6i¼1 mi ¼ 0:097), measured in the same peak period. The validity of the exponential distribution
assumptions in this system is supported by data presented in [12]. The loss probability of a call and the overall
mean travel time to a call were approximately 5% and 8.6 min, respectively.
Table 12 compares the original results (Original Configuration) with those obtained assuming
homogeneous servers (Homogeneous Configuration), with all servers having the same service rate:
m1 ¼ m2 ¼ ? ¼ m6 ¼ 0.0162 (note that the original overall capacity m ¼ 0.097 is maintained). Note that
differences between the server workloads in the two configurations can be substantial (e.g., 29% for server 4
and +29% for server 5). On the other hand, differences in more aggregate performance measures, such as
mean server travel time, were not necessarily significant (Table 12). Thus, system loss probability and overall
mean travel time vary less than 5% between the two configurations.
In order to highlight the effect of non-homogeneous servers in the current study, we arbitrarily chose a
configuration with very different servers (Very Heterogeneous Configuration), in which the service rate of the
ambulance in base i is twice that of the ambulance in base (i1), that is: m1 ¼ 0.0015, m2 ¼ 0.0031, m3 P
¼ 0.0062,
m4 ¼ 0.0123, m5 ¼ 0.0246 and m6 ¼ 0.0493 (note that the overall capacity is maintained, i.e., m ¼ 6i¼1 mi ¼
0:097 calls per minute).
Table 13 compares results of the Homogeneous Configuration with those obtained for the Very
Heterogeneous Configuration. Observe that for some servers the difference in mean travel time is substantial
(e.g., from 7.5 to 12.2 min for server 2). In some sense, this was expected; refer to Model III in Section 2. In this
model, the demand is perfectly homogeneous and the calculations were made for r ¼ 0.5. Notice that in Table
12 the demand is not homogeneous and the workloads are below 0.5. Although the Very Heterogeneous
Configuration and Model III are not strictly comparable, the trend observed in Table 12 is to some extent
present in that model (see Table 6). System loss probability and the overall mean travel time vary from 4.8%
to 16.7%, and from 8.3 to 9.7 min, respectively.
Finally, in order to illustrate the benefits of using non-homogeneous servers in the process of calibrating
average service times (mi1), we performed an experiment assuming that the on-scene service times are
exponentially distributed with means of 60 min (Configuration calibrated with 60 min) and 50 min
(Configuration calibrated with 50 min).
The mean on-scene service time of Anjos do Asfalto belongs in this interval, as reported in [12]. After
calibration, the initial values: m1 ¼ m2 ¼ ? ¼ m6 ¼ 0.0162 changed to those presented in Table 14. Notice that
in both cases the overall system capacity differs from that of the Homogeneous Configuration: In the first case,
the capacity is m ¼ 0.0880 (o0.0970), whereas in the second this capacity is m ¼ 0.1037 (40.0970).
It should be noted that in Table 14, although some values of mi and ri have deviations larger than 10% from
corresponding values in the Homogeneous Configuration, the deviations between mean travel times, Tui, are
relatively small (less than 5%). The variations in Tui are consistent with the results of Section 2 (observe the
4.2% variation shown for the fast server of Model III in column FSR ¼ 1.2 of Table 6). System loss
probability and the overall mean travel time of the configurations calibrated with 60 and 50 min are 5.8% and
Table 12
Workloads (ri) and mean server travel times (Tui), in minutes: Comparison between the Original and the Homogeneous Configurations
Server i
1
2
3
4
5
6
Original Configuration
Homogeneous Configuration
% Deviations
ri
Tui
ri
Tui
ri
Tui
0.14
0.19
0.16
0.30
0.18
0.15
4.7
7.5
5.9
11.5
10.5
9.4
0.15
0.17
0.17
0.22
0.24
0.13
4.6
7.5
5.7
11.5
9.1
10.1
13
10
1
29
29
16
1
0
3
0
13
7
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
14
Table 13
Workloads (ri) and mean server travel times (Tui), in minutes: comparison between the Homogeneous and the Very Heterogeneous
Configurations
Server i
Homogeneous Configuration
1
2
3
4
5
6
Very Heterogeneous Configuration
Deviations (%)
ri
Tui
ri
Tui
ri
Tui
0.15
0.17
0.17
0.22
0.24
0.13
4.6
7.5
5.7
11.5
9.1
10.1
0.68
0.61
0.40
0.28
0.17
0.04
5.7
12.2
7.0
11.7
9.6
9.4
345
255
138
29
27
66
23
62
22
2
6
6
Table 14
Service rates (mi), workloads (ri) and mean server travel times (Tui), in minutes: comparison between the Homogeneous Configuration and
Calibrated Configurations
Server i
1
2
3
4
5
6
Homogeneous Configuration
Configuration calibrated with 60 min
Configuration calibrated with 50 min
mi
ri
Tui
mi
ri
Tui
mi
ri
Tui
0.0162
0.0162
0.0162
0.0162
0.0162
0.0162
0.15
0.17
0.17
0.22
0.24
0.13
4.6
7.5
5.7
11.5
9.1
10.1
0.0155
0.0148
0.0152
0.0139
0.0144
0.0142
0.16
0.19
0.18
0.25
0.26
0.14
4.7
7.6
5.8
11.7
9.6
10.3
0.0183
0.0175
0.0180
0.0163
0.0169
0.0167
0.14
0.16
0.15
0.21
0.23
0.12
4.6
7.3
5.7
11.5
9.1
10.0
4.4%, and 8.5 and 8.2 min, respectively. They also do not differ substantially from values reported for the
Homogeneous Configuration.
5. Concluding remarks
In this paper, we studied the effects of considering homogeneous versus non-homogeneous servers in
practical applications of the hypercube model. We initially presented illustrative examples with centralized and
decentralized home locations of servers, fixed and random preference dispatch policies, total and partial
backup, and infinite and zero-capacity waiting lines. Experiments with these examples showed that, depending
on the degree of non-homogeneity of the servers, model output measures can differ substantially from the
homogeneous case to when non-homogeneous servers are considered. These results imply that, in order to
appropriately analyze the situations mentioned above, as well as other situations that may arise in practical
applications, decision- and policy-makers should at least consider the use of non-homogeneous hypercube
models.
We then analyzed two practical applications of the model requiring the use of non-homogeneous servers. In
these applications, the servers had different average service times depending on the type of service offered and
their home locations. Variations in average service times had their expected impact on server workloads. The
effects on aggregate output measures such as mean travel time, however, were relatively small when
considering homogeneous versus non-homogeneous servers. Although the effects on aggregate output
measures were more significant in the illustrative examples of Section 2, the results presented in the current
paper still suggest use of non-homogeneous servers in selected real-world applications.
Also noteworthy is that it would have not been possible to evaluate the behavior of travel times at the level
of detail done here had the analysis been limited to homogeneous servers.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
15
Another advantage of considering non-homogeneous servers in hypercube models is the calibration of mean
service times. In certain emergency systems, travel times may represent significant fractions of total service
times. Further, as these fractions increase, the adjustment of mean service times becomes important in
reflecting the effects of various geographic factors and dispatching policies on servers. An accurate service time
calibration in such cases thus suggests the need to consider non-homogeneous servers.
It is worth mentioning that in order to solve the hypercube model for non-homogeneous servers, highly
time-consuming exact methods must be used since the approximate method of Larson [22] assumes that
servers are homogeneous. For larger systems, exact methods become computationally prohibitive, especially
when embedded into optimization schemes that require repeated solutions of the hypercube model.
An alternative for evaluating systems with non-homogeneous servers that are too large for hypercube
techniques is discrete simulation. Extensions of the approximate method to deal with non-homogeneous
servers are currently limited to situations with no waiting lines [15]. An interesting topic for future research
would thus be the development of approximate methods for other non-homogeneous applications, such as
those considered in this paper. Another possible line of research would involve the embedding of these
methods into optimization procedures for the design and planning of large emergency systems.
Policy implications of the present work suggest that a detailed and accurate statistical analysis of the data
pertaining to a given application must precede the design of server-to-customer systems. Issues such as
appropriate home locations of servers, the nature of dispatch policies, backup systems, the relative magnitude
of travel times, and the nature of resulting waiting lines must be carefully considered in light of the
corresponding data. It would then be possible to determine whether a simplified model with homogeneous
servers could suffice to correctly describe the system under design, or whether a more sophisticated model with
non-homogeneous servers would need to be considered.
Acknowledgments
The authors thank two anonymous referees and Prof. Barnett R. Parker for their useful comments and
suggestions. This research was supported by the Brazilian National Research Council (CNPq Grants 471919/
01-2, 305042/03-3, 522973/95-7) and by FAPERJ, the Research Foundation of the state of Rio de Janeiro
(FAPERJ Grant E26/151.934/2000).
References
[1] Larson RC. A hypercube queueing model for facility location and redistricting in urban emergency services. Computers & Operations
Research 1974;1:67–95.
[2] Brotcorne L, Laporte G, Semet F. Ambulance location and relocation models. European Journal of Operational Research
2003;147:451–63.
[3] Kolesar P, Swersey AJ. The deployment of urban emergency units: a survey. In: Swersey A, Ignall E, editors. Delivery of urban
services—TIMS studies in the management sciences, vol. 22. Amsterdam: Elsevier Science B.V.; 1986. p. 87–119.
[4] Swersey AJ. The deployment of police, fire and emergency medical units. In: Pollock SM, editor. Handbooks in OR & MS, vol. 6.
Amsterdam: Elsevier Science B.V.; 1994. p. 151–200.
[5] Brandeau ML, Larson RC. Extending and applying the hypercube model to deploy ambulances in Boston. In: Swersey A, Ignall E,
editors. Delivery of urban services—TIMS studies in the management sciences, vol. 22. Amsterdam: Elsevier Science, B.V.; 1986.
p. 121–53.
[6] Burwell TH, Jarvis JP, McKnew MA. Modeling co-located servers and dispatch ties in the hypercube model. Computers &
Operations Research 1993;20:113–9.
[7] Chelst KR, Barlach Z. Multiple unit dispatches in emergency services: models to estimate system performance. Management Science
1981;27(12):1390–409.
[8] Sacks SR, Grief S. Orlando Police Department uses OR/MS methodology, new software to design patrol districts. OR/MS Today
1994;February:30–2.
[9] Larson RC, Odoni AR. Urban Operations Research. EnglewoodCliffs, N.J: Prentice Hall, Inc.; 1981.
[10] Larson RC. OR models for homeland security. OR/MS Today 2004;31:22–9.
[11] Takeda RA, Widmer JA, Morabito R. Analysis of ambulance decentralization in an urban emergency medical service using the
hypercube queueing model. Computers & Operations Research 2007;34(3):727–41.
[12] Mendonc- a F, Morabito R. Analysing emergency medical service ambulance deployment on a Brazilian highway using the hypercube
model. Journal of the Operational Research Society 2001;52:261–70.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002
ARTICLE IN PRESS
16
R. Morabito et al. / Socio-Economic Planning Sciences ] (]]]]) ]]]–]]]
[13] Iannoni AP, Morabito R. A multiple dispatch and partial backup hypercube queueing model to analyze emergency medical systems
on highways. Transportation Research, Part E, doi:10.1016/j.tre.2006.05.005, 2007, to appear.
[14] Halpern J. Accuracy of estimates for the performance criteria in certain emergency service queueing systems. Transportation Science
1977;11:223–42.
[15] Jarvis JP. Approximating the equilibrium behavior of multi-server loss systems. Management Science 1985;31:235–9.
[16] Batta R, Dolan JM, Krishnamurthy NN. The maximal expected covering location problem: Revisited. Transportation Science
1989;23:277–87.
[17] Chiyoshi FY, Galvão RD, Morabito R. A note on solutions to the maximal expected covering location problem. Computers &
Operations Research 2003;30(1):87–96.
[18] Saydam C, Aytug H. Accurate estimation of expected coverage: revisited. Socio-Economic Planning Sciences 2003;37:69–80.
[19] Galvão RD, Chiyoshi FY, Morabito R. Towards unified formulations and extensions of two classical probabilistic location problems.
Computers & Operations Research 2005;32:15–33.
[20] Atkinson JB, Kovalenko IN, Kuznetsov N, Mykhalevych KV. Heuristic solution methods for a hypercube queueing model of the
deployment of emergency systems. Cybernetics and Systems Analysis 2006;42(3):379–91.
[21] Atkinson KE. An introduction to numerical analysis. New York: Wiley; 1978.
[22] Larson RC. Approximating the performance of urban emergency service systems. Operations Research 1975;23:845–68.
Please cite this article as: Morabito R, et al. Non-homogeneous servers in emergency medical systems: Practical applications using the
hypercube queueing model. Socio-Economic Planning Sciences (2007), doi:10.1016/j.seps.2007.04.002