Acta Biotheor (2011) 59:53–66 DOI 10.1007/s10441-010-9107-8 REGULAR ARTICLE The Sunk-cost Effect as an Optimal Rate-maximizing Behavior Theodore P. Pavlic • Kevin M. Passino Received: 7 March 2009 / Accepted: 30 June 2010 / Published online: 24 July 2010 Springer Science+Business Media B.V. 2010 Abstract Optimal foraging theory has been criticized for underestimating patch exploitation time. However, proper modeling of costs not only answers these criticisms, but it also explains apparently irrational behaviors like the sunk-cost effect. When a forager is sure to experience high initial costs repeatedly, the forager should devote more time to exploitation than searching in order to minimize the accumulation of said costs. Thus, increased recognition or reconnaissance costs lead to increased exploitation times in order to reduce the frequency of future costs, and this result can be used to explain paradoxical human preference for higher costs. In fact, this result also provides an explanation for how continuing a very costly task indefinitely provides the optimal long-term rate of gain; the entry cost of each new task is so great that the forager avoids ever returning to search. In general, apparently irrational decisions may be optimal when considering the lifetime of a forager within a larger system. Keywords Solitary animal behavior Patch residence time Rationality Concorde fallacy Escalation error Optimal foraging theory T. P. Pavlic (&) K. M. Passino Department of Electrical and Computer Engineering, The Ohio State University, 205 Dreese Labs, 2015 Neil Avenue, Columbus, OH 43210, USA e-mail: [email protected] K. M. Passino e-mail: [email protected] K. M. Passino Department of Evolution, Evology, and Organismal Biology, The Ohio State University, 318 W. 12th Avenue, Columbus, OH 43210, USA 123 54 T. P. Pavlic, K. M. Passino 1 Introduction Influential work by Schoener (1971) postulates that natural selection favors behaviors on a continuum from foraging time minimization to net energetic gain maximization. Techniques developed by Pyke et al. (1977) and Charnov (1973) use gain-to-time rate maximization to quantitatively predict which behavior from this continuum will be favored in particular cases. These techniques have been popularized by Stephens and Krebs (1986) as the prey and patch models of classical optimal foraging theory (OFT). They respectively describe which prey foragers should include in their diet and how long foragers should exploit a patch of prey, which are the two central questions of solitary foraging theory. The rate-maximizing prey and patch models have mixed success in explaining behavioral observations in the field. The prey model accurately describes patterns of preference, and the patch model makes accurate predictions about how foraging durations should change when background parameters change, but the patch model has poor success in many cases when predicting actual foraging durations (Stephens and Krebs 1986; Nonacs 2001; Sih and Christensen 2001). In a review by Nonacs (2001), several examples are picked that show foraging durations tend to be longer than expected by the classical patch model. Nonacs concludes that OFT is incomplete and that optimal behavior is not described by rate maximization. However, recent studies show how observed behaviors that are inconsistent with classical OFT are indeed rate maximizing under an adjusted foraging model. For example, shorebirds that appear to violate classical OFT are shown to be rate maximizers when explicitly modeling digestive constraints (van Gils et al. 2005) and the value of information discovery (van Gils et al. 2003). Here, we show how explicitly modeling foraging costs can answer Nonacs’ criticisms of the classical OFT’s underpredictions of foraging duration. Additionally, we show how the same modifications lead to foraging theoretic explanations of the sunk-cost effect (Arkes and Blumer 1985; Arkes and Ayton 1999; Kanodia et al. 1989; Staw 1981), which describes behaviors that invest more time in the more costly of two otherwise equivalent resources. This effect is also known as the Concorde fallacy (Dawkins and Carlisle 1976) because it describes an apparent logical fallacy analogous to the continued development of a supersonic jet that never returns a profit. However, by giving a foraging theoretic explanation, we show that the fallacy is actually an optimal behavior. This explanation is also consistent with observations of animals that commit longer feeding times after moving into areas where prey requires greater energy to acquire (Nolet et al. 2001). Additionally, if a forager’s uncertainty in estimating patch quality is modeled as an initial average cost associated with each patch type, the extended foraging durations predicted by our augmented OFT is consistent with the ‘‘wait for good news’’ behaviors predicted and observed in supposed Bayesian foragers (Olsson and Holmgren 1998; van Gils et al. 2003; Olsson and Brown 2006). This paper is organized as follows. First, in Sect. 2, we summarize the principal results from classical OFT. Next, in Sect. 3, we present a common empirical criticism of OFT and provide a mathematical response supporting OFT. In Sect. 4, 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 55 we augment the traditional graphical analysis method from OFT to include effects of search and in-patch costs. This analysis shows how explicitly including costs leads to longer patch residence times that are more consistent with observed behavior. We use similar graphical methods in Sect. 5 to provide a detailed explanation of the sunk-cost effect as an adaptive rate-maximizing behavior. Finally, in Sect. 6, we give some concluding remarks. 2 Classical Optimal Foraging Theory Optimal foraging theory models a forager that faces n 2 N types of prey. The forager encounters prey of type i [ {1, 2, …, n} according to an independent Poisson counting process with rate ki [ 0. Each encountered prey item of type i is picked for handling with probability pi [ [0,1], and those chosen items are exploited for si C 0 time on average. The forager pays a cost (e.g., energy) of cs per unit search time, but it receives an average reward of gi(si) after handling an item of type i. Assuming that natural selection favors foragers that maximize their lifetime gain, behaviors should trade increased reward per encounter for increased number of encounters in a lifetime. So the optimal behavior will maximize the rate P cs þ ni¼1 ki pi gi ðsi Þ P : ð1Þ 1 þ ni¼1 ki pi si The prey model treats the case where si is fixed for each type i and the forager must make a binary choice to handle or ignore each encountered prey. The patch model treats the opposite case where p1 = p2 = _ = pn = 1 (i.e., all encountered prey are exploited) and the forager must choose when to stop exploiting each prey. A central result of the patch model is the marginal value theorem (MVT) (Charnov 1973, 1976), which states that nonzero exploitation times s1 ; s2 ; . . .; sn that make P cs þ ni¼1 ki gi ðsi Þ 0 P gi ðsi Þ ¼ and g00i ðsi Þ\0 ð2Þ 1 þ ni¼1 ki si for each type i will correspond to an optimal patch foraging behavior. That is, behaviors that equate per-prey rate of gain with per-lifetime rate of gain will appropriately balance present reward with future opportunity cost so as to be optimal over a lifetime. 3 OFT Criticism and Explicit Processing Costs Nonacs (2001) bases his criticisms on the observation that Eq. 2 implies that g01 ðs1 Þ ¼ g02 ðs2 Þ ¼ ¼ g0n ðs Þ: That is, if behaviors in nature could be described by the MVT, then every per-type rate of average gain (i.e., ‘‘speed’’ of gain) should be equal to some global giving-up density (GUD). In the studies that Nonacs reviews, foragers stop exploiting different prey types at different speeds, which leads him to the conclusion that the MVT does not hold in reality. 123 56 T. P. Pavlic, K. M. Passino As discussed by Stephens and Krebs (1986) the function gi is not meant to be a gross observable reward to the forager. Instead, it models the energetic reward to the forager minus the internal handling cost, which is usually not externally observable. To make this distinction clearer, we explicitly introduce an average handling cost ci(si) for exploiting prey of type i for an average of si time, and we let gi(si) be the corresponding average observable reward. Thus, the forager should maximize P cs þ ni¼1 ki pi ðgi ðsi Þ ci ðsi ÞÞ P : ð3Þ 1 þ ni¼1 ki pi si In this framework, the MVT states that times s1 ; . . .; sn that make P cs þ ni¼1 ki gi ðsi Þ ci ðsi Þ 0 0 P gi ðsi Þ ci ðsi Þ ¼ and g00i ðsi Þ c00i ðsi Þ\0 ð4Þ 1 þ ni¼1 ki si will correspond to an optimal patch behavior. From Eq. 4, it must be that g01 ðs1 Þ c01 ðs1 Þ ¼ g02 ðs2 Þ c02 ðs2 Þ ¼ ¼ g0n ðsn Þ c0n ðs2 Þ: The rare case when the observable rewards are such that g01 ðs1 Þ ¼ g02 ðs2 Þ ¼ ¼ g0n ðsn Þ is when the internal handling cost rates are also all equal, and so a global GUD is exceptional. Differences in terminal handling densities across types reflect differences in the handling burden of those types. For example, for types with the same reward–time curve and linear handling cost functions (i.e., ci ðsi Þ,ci si for type i and constant ci [ 0), smaller marginal handling costs (i.e., c0i ðsi Þ ¼ ci for type i) should lead to longer optimal exploitation times—foragers spend relatively less time in patches with steep handling-cost functions. 4 Graphical Optimization and Long Residence Times The intuitive graphical analysis approach frequently used by Stephens and Krebs (1986) for the single-type case can be extended to our model, which makes all costs explicit and uses multiple types. To do so, we define k,k1 þ k2 þ þ kn to be the average Poisson encounter rate for all prey types combined. Then 1/k is the average search time between encounters, and n n n X X X ki ki ki pi gi ðsi Þ; c, pi ci ðsi Þ; and s, pi s i g, k k k i¼1 i¼1 i¼1 are the average per-encounter handling gain, cost, and time. Equation 3 then becomes s g c ck : 1 kþs ð5Þ Graphical optimization of Eq. 5 is shown in Fig. 1. Each point in the shaded area corresponds to a particular choice of preferences, p1, p2, …, pn, and exploitation times, s1,…, sn. The upper boundary of the shaded area is the optimal frontier on which all optimal behaviors are found. Optimization consists of rotating a ray originating from (-1/k,cs/k) counter-clockwise from a 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 57 Fig. 1 Effect of searching on optimal patch residence times. As search cost cs or average interarrival time 1/k increases, the slope R* of the optimal ray will decrease, and the optimal average exploitation time s will increase -90 angle to the least upper bound of the angles less than 90 where the ray intersects the shaded region. The slope of the optimal ray R* is the optimal value of Eq. 5, and the corresponding values ðg cÞ and s are the optimal per-encounter average net handling gain and exploitation time. Hence, this graphical interpretation of rate maximization allows for predictions of how perturbing parameters of the model will impact the average handling time s: An increase in the average handling time may be due to an increase in the optimal exploitation time for each patch (i.e., changes in si for each type i) or due to additional patch types being added to the optimal diet (i.e., changes in pi for each type i) or a mixture of the two effects. This graphical approach does not suggest how to predict each individual si or pi ; it is only meant to investigate how certain parameter perturbations will affect general foraging trends. Specific optimal diet content and patch exploitation times can be found using algorithms from classical OFT (Charnov 1973, 1976; Stephens and Krebs 1986; Pavlic 2007). When Stephens and Krebs (1986) consider graphical solutions to the single-type case, they frequently discuss how changes in encounter rate k1 lead to changes in optimal behavior. For multiple types, the overall rate k can be changed while holding the density ki/k constant for each type i. The graphical example in Fig. 1 shows that the optimal average exploitation time s will increase if either the overall encounter rate k decreases or the search cost rate cs increases. In both cases, to maximize lifetime reward, the forager must increase present gains to compensate for future search losses—the opportunity cost of more exploitation decreases when search time or cost increase. Likewise, if the average per-encounter handling cost c is increased, the g c curve will be shifted down the vertical axis and the optimal average exploitation time s will increase. This prediction suggests an explanation for the ostensibly anomalous observation by Nolet et al. (2001) that tundra swans spend more time feeding in areas of deep water where more energy is required to acquire similar prey as in shallow water. The increased costs necessarily decrease the maximal long-term rate of gain, which is the opportunity cost of increased exploitation, and so patches are exploited longer. This effect is investigated in more detail in Sect. 5. 5 The Sunk-Cost Effect Graphical optimization of Eq. 3 helps explain apparently irrational behaviors discussed in the fields of economics, psychology, and biology in terms of lifetime 123 58 T. P. Pavlic, K. M. Passino gain maximization. In economics, the propensity to continue a costly task after paying some initial cost is sometimes called an escalation commitment, escalation behavior, or escalation error, but it is more commonly known as the sunk-cost effect (Kanodia et al. 1989; Staw 1981). This nomenclature is also used by psychologists (e.g., Arkes and Blumer 1985). In fact, psychologists Arkes and Ayton (1999) note that the sunk-cost effect is equivalent to the Concorde fallacy first described by biologists Dawkins and Carlisle (1976). Although none of these terms are used, the same phenomena is also observed by Nolet et al. (2001). In particular, tundra swans must expend more energy to ‘‘up-end’’ to feed on deep-water tuber patches than they do to ‘‘head-dip’’ to feed on shallow-water patches; however, contrary to the expectations of Nolet et al., the swans feed for a longer time on each high-cost deep-water patch. In every context, the observation of the sunk-cost effect is an enigma because intuition suggests that this behavior is suboptimal. Here, we show how optimization of Eq. 3 predicts the sunk-cost effect for certain scenarios; a common element of every case is a large initial cost. 5.1 Initial Costs: Recognition, Acquisition, Reconnaissance For simplicity, we make a graphical argument with the assumptions that n = 1, cs = 0, and p1 = 1 (i.e., the patch case). We also revert to interpreting g1 as the average net handling gain (i.e., the sum of an observable gain and an internal cost). Under these assumptions, g1 ðs1 Þ 1 k 1 þ s1 ð6Þ is the objective function for optimization. Similar arguments can be made about the general multivariate form in Eq. 3. Consider the case when gain function g1 is initially: (1) (2) (3) negative (i.e., g1(0) \ 0), increasing (i.e., g01 ð0Þ [ 0), and decelerating (i.e., g001 ð0Þ\0Þ Functions meeting items (2) and (3) are treated by Stephens and Krebs (1986). However, these functions are all initially zero. Here, we let the average handling gain g1(s1) be initially negative to reflect some initial cost. For example, rather than treating recognition cost as a separate quantity (e.g., Stephens and Krebs 1986, pp. 79–81), we consider them to be a characteristic of the gain returned to the forager from handling each item (e.g., an initial energy expenditure required simply for access to or identification of a prey). Alternatively, as in the case of the tundra swans observed by Nolet et al. (2001), the initial cost may model the extra energy required to acquire prey (e.g., by ‘‘up-ending’’ instead of simply ‘‘head-dipping’’). Furthermore, just as Olsson and Brown (2006) show how Bayesian foragers receive a relative ‘‘foraging benefit of information’’ for spending extra time in a patch, the initial cost here may be viewed as an average penalty due to uncertainty about the patch gain function. 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 59 Fig. 2 Different recognition costs r2 [ r1 [ r0 have different optimal exploitation times s12 [ s11 [ s10 and optimal rates R2 \R1 \R0 : For two equally shaped gain functions, the one with the higher initial cost will also have the higher optimal residence time (a) (b) (c) The graphical optimization procedure for such a initially negative gain function is shown in Fig. 2. The same gain function is shown in Figs. 2a–c except that initial recognition cost is low, moderate, and high, respectively. When the initial cost increases from r0 to r1, the optimal exploitation time increases from s10 to s11 : Even when the initial cost increases from r1 to r2 and makes the gain function strictly negative, the exploitation time still increases from s11 to s12 : This propensity to exploit items longer when the initial cost is increased is exactly the sunk-cost effect (Arkes and Ayton 1999; Arkes and Blumer 1985). However, it also maximizes the long-term net rate of gain, and so it is the rational behavior. This result can be explained using an opportunity cost interpretation (Houston and McNamara 1999; Stephens and Krebs 1986). The optimal rate R* is an opportunity cost for spending additional time exploiting an item. It represents the maximum expected gain possible per unit time, and so it is costly to exploit items of a particular type for so long that the type’s rate of average gain falls below R*. When the initial cost of handling increases on all encounters, the total gain decreases, and so the opportunity cost for additional exploitation decreases (e.g., R1 \R0 ). Thus, the marginal handling gain must decrease further in order for marginal costs and 123 60 T. P. Pavlic, K. M. Passino marginal benefits to equalize. This effect manifests itself through the increase in optimal exploitation time. By increasing exploitation time when initial cost increases, the forager reduces the amount of time it spends paying high costs. Leaving patches at an earlier time is equivalent to volunteering for paying recognition costs more frequently. It is not the increased initial cost on a single encounter that is important; it is the increased initial cost on all encounters that causes the decrease in opportunity cost for additional exploitation. 5.2 Human and Nonhuman Examples 5.2.1 Feeding Time Increases in Areas of High-Cost Prey In a study by Nolet et al. (2001), the foraging behavior of tundra swans in shallow water is compared to the behavior in deep water. The swans forage on belowground tubers, and thus the swans need only to ‘‘head-dip’’ in shallow water whereas they need to ‘‘up-end’’ in deep water to find and retrieve the tubers. The model that the authors use to explain the tundra swan behavior predicts that there will be a decrease in feeding time in areas where feeding has larger power requirements. However, in deep water where the power requirements to up-end are apparently larger than the requirements to head-dip, the observed tundra swans spent longer times feeding on tuber patches. Although this result is a contradiction of the model used by Nolet et al., it follows directly from the discussion in Sect. 5.1. In particular, let n C 1 (i.e., multiple tuber patch types are available) and p1 = _ = pn = 1 (i.e., encounters are not ignored). For each patch type i, the average net gain function gi is assumed to be initially negative in order to model the energetic burden of acquiring the belowground tubers. Hence, we use the classical marginal value theorem result in Eq. 2 applied to initially negative gain functions. The curve in Fig. 3a models the average value gi of a tuber patch of type i after exploiting the patch for time si. The average cost ci of a patch entry reduces the maximum possible long-term rate of gain R* from all patches. The MVT predicts that the optimal exploitation time si occurs when g0i ðsi Þ ¼ R ; which is depicted in Fig. 3b for three different R* maximum rates corresponding to three different ci entry costs. As the average entry cost of a patch type increases, the maximum rate decreases, and so average exploitation time increases. When R* is very high either due to low cost ci or high return rates from other patches, the optimal commitment time drops to zero because the shallow value function is dominated by steep returns from other options. 5.2.2 Human Preference for High Price One recorded example of the sunk-cost effect in humans comes from studying behavior at the cinema. In a controlled study, Arkes and Blumer (1985) charge movie patrons different prices for tickets to see a particular movie. They show that the likelihood that people attend the movie is positively correlated with the cost of the ticket, and they conclude that this behavior is an irrational mistake. However, 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 61 (a) (b) Fig. 3 Optimal exploitation time for patch type i with acquisition cost ci. Increased costs lead to decreased overall return rate R*, which leads to greater exploitation time in each costly patch. That is, when most encountered patches are costly to enter, exploitation time in each of these high-cost patches will be long in order to reduce their frequency. However, when enough other low-cost patches are available to raise the overall return rate sufficiently high, exploitation time to patches with relatively shallow marginal returns may decrease to zero. That is, when there are enough high-quality patches, time is better spent searching for them this result can also be predicted by the classical marginal value theorem result in Eq. (2) applied to initially negative gain functions.1 Again, consider the case where n C 1 (i.e., multiple types are available) and p1 = _ = pn = 1 (i.e., encounters are not ignored). For each activity type i, let the gain function gi(si) be parameterized by commitment time si that represents the average time the individual is committed to participating in that type of activity. That is, if the commitment time for a type of activity is low, there is a low probability that the individual will complete the activity, and thus the average gain returned from this type of activity will also be low. Likewise, if the commitment time for the activity is high, it will likely be attended, and so the value returned will level off as it returns no value after it is attended. Hence, the analysis summarized in Fig. 3 also applies here. As ticket price 1 Here, to be consistent with Arkes and Blumer (1985), we do not allow encounters to be ignored, and so initial costs are forced and the pure patch model predicts the optimal behavior. The combined prey–patch model better fits reality as ticket purchasing opportunities can be ignored. 123 62 T. P. Pavlic, K. M. Passino increases for this set of individuals who are forced to buy the ticket, they become more strongly committed to attending the movie.2 This example matches classical economic intuition. As the price a consumer pays increases, the overall purchasing frequency decreases. The longer commitment to a higher-priced movie reflects reluctance to make more purchases. Over the long term, this behavior maximizes the net value the consumer holds collectively in goods and capital. Behaving in any other way would be reckless when viewed from a long-term opportunity-cost perspective. In particular, exploitation times are longer in order to accumulate more gain to justify returning to a search that will likely result in finding more ‘‘prey’’ that force high initial costs. Thus, the sunk-cost effect may seem nonsensical for any single purchase, but it is adaptive when considering an economic or ecological system view. 5.3 Escalation Behavior The propensity to continue a costly task ad infinitum is an extreme form of the sunkcost effect known as escalation behavior or escalation error (Kanodia et al. 1989; Staw 1981). In fact, the Concorde fallacy gets its name from the example given by Dawkins and Carlisle (1976) of continued government investment on a supersonic jet that has no promise of ever returning a profit. Here, we show how escalation behavior can be an adaptive rate-maximizing strategy under special circumstances. These examples presume that the optimal long-term rate of gain R* is negative, and so they are a better fit for economic examples where prolonged deficits are allowed. However, as several studies show that foragers can dynamically adjust behavior to maximize the present estimate of long-term rate of gain (Giraldeau and Livoreil 1998; Giraldeau and Caraco 2000; Sih and Christensen 2001), these examples may also be consistent with short-term trends in behavior when foragers are temporarily subjected to high-cost environments. Additionally, this analysis may apply to similar mathematical models of value maximization when behaviors must choose from a set of costly options. For example, in spiders that choose to join existing webs or spin their own (Jakob 2004), small spiders face potentially dangerous competition with large spiders in new webs but also face high costs in building new webs. Also, studies of women in abusive relationships suggests their decisions to leave the relationship are affected by fear of violence toward pets that remain (Faver and Strand 2003). In both of these cases, the optimal behavior may be accompanied by costs that might seem prohibitive in isolation. As in Sect. 5.1, we use the simplified optimization objective given by Eq. 6. In Fig. 4, a positive constant gain function is shown as the limit of asymptotic gain functions of increasing steepness. Because of the asymptotic upper limit, sharp increases in gain correspond to sharp decreases in marginal returns. So the optimal patch residence time decreases as steepness increases because marginal returns fall so quickly. In the limiting case, when the gain function is constant, the optimal behavior is to leave immediately because staying longer cannot result in any 2 If the experimenters allowed for encounters to be ignored (i.e., if participants could choose to not purchase a ticket), movies with zero commitment times would also have zero ticket sales. 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 63 Fig. 4 Analysis of positive constant gain function. The gain function is the limit of smooth asymptotic gain functions of increasing steepness. The limiting optimal behavior is to leave the patch immediately. Staying longer returns no additional gain, and leaving immediately decreases the time between subsequent high-quality patches additional gain. In particular, it is better to invest time searching for new patches that guarantee an entry reward g at regular rate k1 (i.e., long-term rate of gain k1g) rather than staying in any single patch that provides no additional gain (i.e., longterm rate of gain g/?& 0). The opposite effect is demonstrated in Fig. 5, which shows a negative gain function as the limit of asymptotic gain functions of decreasing steepness. As the steepness of each gain function is reduced, the optimal patch residence time increases without bound. So the optimal behavior for a constant gain function that is negative is to remain forever in each encountered patch. It is better to pay a single recognition cost once and remain in the patch forever rather than paying the recognition cost repeatedly at each new encounter. As shown in the lower portion of Fig. 5, even when the gain function is initially decreasing and convex, the optimal behavior is not to immediately leave the patch unless the initial slope is sufficiently negative. It is better for the forager to pay additional costs within the patch rather than face the -k1g lifetime rate of gain Fig. 5 Analysis of negative constant gain function. Escalation error can be viewed as the limit of increasingly shallow gain functions that are initially negative. Zero patch residence time is not restored until steepness is sufficiently negative. For this class of functions, optimal points will never be found in the shaded region between the horizontal axis and the constant gain function 123 64 T. P. Pavlic, K. M. Passino Fig. 6 Initially negative concave gain function has latent escalation behavior that dominates the early local maximum. Leaving the patch immediately returns a higher long-term gain than staying a short time longer, but it returns a lower gain than staying for any time greater than T. Hence, escalation behavior is optimal because it prevents paying any future entry costs associated with immediately leaving the patch. However, escalation behavior is avoided in this case because the convex gain function eventually becomes too steep. A different effect occurs for initially negative and initially steep concave functions like the one shown in Fig. 6. Here, immediate withdrawal from the patch is a locally optimal behavior. That is, the objective function decreases as the exploitation time is increased slightly. However, as exploitation time increases, the objective function reaches its global minimum and then starts increasing. Moreover, there exists a time T such that exploiting these patches for any time s1 C T will be favorable to immediate withdrawal. Again, it is better to exploit patches with a shallow negative gain longer than to search for more patches that have a steep gain. In the supersonic jet construction example (Dawkins and Carlisle 1976), the costly behavior may be adaptive because it prevents starting other costlier ventures. More generally, successful politicians in power may induce endless and massive spending in one area for no other reason than to avoid new spending in other areas. In a foraging context, a forager that must pay a large energetic cost to enter a patch (e.g., due to high predation risk, having to swim upstream, or high uncertainty about patch quality) may hesitate to ever leave that patch if the habitat contains few other types of patches. Similarly, social group members may resist the temptation to leave a low-quality group if joining any other group is accompanied by costly antagonistic violence (e.g., Faver and Strand 2003; Jakob 2004). In general, in a set of apparently bad choices, the choice that has the lowest eventual marginal losses will minimize the very long-term losses. 6 Conclusions Optimal foraging theory introduces quantitative analysis into the study of behavior in a way that complements intuition. We have shown how explicitly modeling foraging costs can improve the accuracy of rate-maximizing OFT methods and answer the criticisms of Nonacs (2001) that rate maximization too often underestimates patch residence time. As verified by van Gils et al. (2003), the Bayesian forager modeled by Olsson and Holmgren (1998) accurately predicts observed patch residence times. This result is investigated by Olsson and Brown (2006) who derive a ‘‘foraging benefit of information’’ that is an additional reward 123 The Sunk-Cost Effect as an Optimal Rate-Maximizing Behavior 65 that Bayesian foragers receive for staying longer in patches to gather information. As these models investigate foraging behavior using numerical dynamic programming techniques, they lack the intuition of classical OFT. However, if the discovery penalty of the information gathering process is modeled as an initial in-patch cost, then relatively simple classical OFT methods also predict longer exploitation times. We also show how explicitly including large initial in-patch costs in classical OFT methods explains why sunk-cost effects like those observed by Nolet et al. (2001) are rational rate-maximizing foraging behaviors. This result seems counterintuitive, but it follows from using a farsighted decision-making model based on comparing present returns to the opportunity cost of not searching. When encounters with costly patches are likely, it is better to spend more time in each patch in order to reduce the accumulation rate of said costs. This interpretation may also explain human preference for high price (e.g., Arkes and Blumer 1985) and the escalation of costly behaviors ad infinitum (e.g., Dawkins and Carlisle 1976; Faver and Strand 2003). Sunk-cost behaviors are criticized because they are examples of making decisions based on past investments; however, they should be better understood as decisions that reduce the frequency of similar future investments. Acknowledgments We thank Thomas A. Waite for his helpful insights and instruction and Ian M. Hamilton for his comments on this paper. We also thank two anonymous referees for their help in improving this paper. Additionally, the comments of three anonymous reviewers on a related submission have also been influential in the presentation of this work. References Arkes HR, Ayton P (1999) The sunk cost and Concorde effects: are humans less rational than lower animals? Psychol Bull 125(5):591–600 Arkes H, Blumer C (1985) The psychology of sunk cost. Organ Behav Hum Decis 35:124–140 Charnov EL (1973) Optimal foraging: some theoretical explorations. PhD thesis, University of Washington Charnov EL (1976) Optimal foraging: the marginal value theorem. Theor Popul Biol 9(2):129–136 Dawkins R, Carlisle TR (1976) Parental investment, mate desertion and a fallacy. Nature 262(5564): 131–133 Faver CA, Strand EB (2003) To leave or to stay? J Interpers Violence 18(12):1367–1377. doi:10.1177/ 0886260503258028 Giraldeau LA, Caraco T (2000) Social foraging theory. Princeton University Press, Princeton Giraldeau LA, Livoreil B (1998) Game theory and social foraging. In: Dugatkin LA, Reeve HK (eds) Game theory and animal behavior. Oxford University Press, New York, pp 16–37 Houston AI, McNamara JM (1999) Models of adaptive behavior. Cambridge University Press, Cambridge Jakob EM (2004) Individual decisions and group dynamics: why pholcid spiders join and leave groups. Anim Behav 68(1):9–20 doi:10.1016/j.anbehav.2003.06.026 Kanodia C, Bushman R, Dickhaut J (1989) Escalation errors and the sunk cost effect: an explanation based on reputation and information asymmetries. J Account Res 27(1):59–77 Nolet BA, Langevoord O, Bevan RM, Engelaar KR, Klaassen M, Mulder RJW, Dijk SV (2001) Spatial variation in tuber depletion by swans explained by differences in net intake rates. Ecology 82(6):1655–1667 doi:10.1890/0012-9658(2001)082[1655:SVITDB]2.0.CO;2 Nonacs P (2001) State dependent behavior and the marginal value theorem. Behav Ecol 12(1):71–83 Olsson O, Brown JS (2006) The foraging benefits of information and the penalty of ignorance. Oikos 112(2):260–273 doi:10.1111/j.0030-1299.2006.13548.x Olsson O, Holmgren NMA (1998) The survial-rate-maximizing policy for bayesian foragers: wait for good news. Behav Ecol 9(4):345–353 123 66 T. P. Pavlic, K. M. Passino Pavlic TP (2007) Optimal foraging theory revisited. Master’s thesis, The Ohio State University, Columbus. http://www.ohiolink.edu/etd/view.cgi?acc_num=osu1181936683 Pyke GH, Pulliam HR, Charnov EL (1977) Optimal foraging: a selective review of theory and tests. Q Rev Biol 52(2):137–154 Schoener TW (1971) Theory of feeding strategies. Annu Rev Ecol Syst 2:369–404 Sih A, Christensen B (2001) Optimal diet theory: when does it work, and when and why does it fail? Anim Behav 61(2):379–390 doi:10.1006/anbe.2000.1592 Staw BM (1981) The escalation of commitment to a course of action. Acad Manag Rev 6(4):577–587 Stephens DW, Krebs JR (1986) Foraging theory. Princeton University Press, Princeton van Gils JA, Schenk IW, Bos O, Piersma T (2003) Incompletely informed shorebirds that face a digestive constraint maximize net energy gain when exploiting patches. Am Nat 161(5):777–793. doi:10.1086/ 374205 van Gils JA, de Rooij SR, van Belle J, van der Meer J, Dekinga A, Piersma T, Drent R (2005) Digestive bottleneck affects foraging decisions in red knots shape Calidris canutus. I. prey choice. J Anim Ecol 74(1):105–119. doi:10.1111/j.1365-2656.2004.00903.x 123
© Copyright 2026 Paperzz