Choice, Consideration Sets and Attribute Filters∗ Mert Kimya † Koç University, Department of Economics, Rumeli Feneri Yolu, Sarıyer 34450, Istanbul, Turkey August 26, 2016 Abstract It is well known that decision makers do not always consider all of the available alternatives when making a choice. When the alternatives have attributes, these attributes provide a natural way to form the consideration set. I assume a procedure in which the decision maker uses the relative ranking of the alternatives on each attribute to reduce the size of the choice set. I provide a characterization of the procedure and illustrate how to identify the underlying preference and consideration set. The model explains certain choice anomalies such as the attraction and compromise effects. JEL classification: D01; D03; D11 Keywords: Bounded Rationality; Choice; Consideration Set; Revealed Preference 1 Introduction Suppose you are searching online for a hotel in a popular destination. Once you enter the dates and the city you will realize that there are too many alternatives to choose from. In which case you will probably use the filters the website conveniently provides for you. A typical website will have filters on price, star rating, review rating and distance. Most likely you will use these filters to remove the hotels that rank poorly on some of these dimensions. Say hotels with a review rating less than 6 out of 10, with a price more than 150 dollars and distance to a point of interest more than 5 miles. Having removed some ∗ I am grateful to Mark Dean for guidance, support and encouragement. I would also like to thank Geoffroy de Clippel and two anonymous referees for helpful comments and suggestions. † E-mail Address: mert [email protected] 1 of the available alternatives this way, you would then select your preferred option from those that remain. It is well known that decision makers do not always consider all of the available alternatives when making a choice. Although there are many models that characterize how the consideration sets might be constructed, one thing that has been overlooked is in many situations alternatives have observable attributes. When the alternatives have attributes it seems natural that the decision maker (DM) uses these attributes to form the consideration set. In this paper, I formalize and characterize such a choice procedure. I model a DM who has a stable preference over a set of alternatives. The alternatives have a set of attributes and the DM has an ordering of the alternatives on each of these attributes. The ordering indicates the DM’s preference on each attribute. In the above example, an attribute is price and the DM ranks cheaper hotels higher than expensive ones on this attribute. Another attribute is distance and the DM ranks hotels with a shorter distance to the point of interest higher than the hotels with a longer distance. Our DM sets a threshold on each attribute ranking and does not consider any alternative that stays below that threshold. She maximizes her preference on the set of remaining alternatives. The thresholds are not determined arbitrarily, but instead they depend on the relative ranking of alternatives in the sense that inclusion of an alternative above (below) the threshold cannot move the threshold down (up) and it can only go up (down) to that alternative.1 This restriction ensures that if an alternative stays above (below) the threshold in some attribute and a new set is obtained by adding alternatives worse (better) than this alternative, then this alternative will again stay above (below) the threshold in this new set. I call the choice procedure that satisfy these conditions choice through attribute filters (CAF). The model has several intuitive interpretations: • As in the hotel choice example, the set might contain too many elements to consider, in which case the DM might use the objectively defined attributes to reduce the size of the set. • The DM might be using a ‘rule of thumb’ to eliminate certain alternatives based on their attribute rankings even when the size of the set is small. For instance the DM might have a fixed cutoff (see Example 5 in Section 2.2) or she might be using satisficing on one of the attribute rankings (see Example 6 in section 2.2) or she might eliminate non-dominant alternatives in a set containing asymmetrically 1 Proposition 2 in Appendix A shows that if we do not put any restriction on how the thresholds are determined then the model has little empirical content. 2 dominated alternatives (see Sections 3.2 and 4). Further examples of this kind is available in Section 2.2. CAF does not provide testable predictions on standard choice data: every observed choice function is rationalizable as CAF (see Proposition 1 in Appendix A). I consider a richer dataset, which I call ‘multicriteria choice data’. This data includes not only the final choice but also the attributes and the rankings of the DM on these attributes. Data that includes observable attributes has been used before both in theoretical and experimental papers (see de Clippel and Eliaz (2012), Gabaix et al. (2006), Huber et al. (1982), Simonson (1989), Tversky and Simonson (1993)). Multicriteria choice data is easy to collect in many settings, especially in controlled laboratory environments, where the alternatives can be defined with a set of attributes.2 I characterize the conditions under which multicriteria choice data is consistent with CAF. This is done through an understanding of what revealed consideration and revealed preference means for the model. The choice of one alternative over another does not indicate preference as the DM may have eliminated the unchosen alternative. However, if we can conclude that the unchosen alternative is considered (revealed consideration), this would imply a preference in favor of the chosen alternative (revealed preference). For example, suppose the choice of the DM is consistent with CAF and she chooses x from some set S. This would trivially imply that x is considered in S. Now assume that as we go from set S to set T , the relative position of x improves in every attribute, but y is chosen in set T . As x is not eliminated by any attribute in set S and as its relative position in every attribute improves as we move from S to T , x should not be eliminated by any attribute in set T . Hence, x is revealed to be considered in set T (revealed consideration). Furthermore, as y is chosen in T and as x is considered in T this would imply that y is revealed to be preferred to x (revealed preference). The necessary and sufficient condition for multicriteria choice data to be consistent with CAF is the acyclicity of the revealed preference. Section 3 provides the necessary and sufficient conditions for the data to be consistent with CAF. It also fully characterizes what we can learn about the DM’s preference (revealed preference) and the consideration set (revealed consideration) if the choice is CAF. The CAF model is consistent with a number of intuitive choice procedures and it is able to explain certain consistently observed choice anomalies such as the attraction and the compromise effects. While explaining these effects, I show that CAF is also inconsistent with the ‘opposites’ of each of these effects. 2 It is important to note that multicriteria choice data and CAF does not assume all of the available attributes and the rankings on these attributes is observable. It only requires that those attributes that are used for the construction of the consideration set are observable. 3 Furthermore, not only that CAF is inconsistent with the opposites of these effects while consistent with the effects themselves, it is shown that an independent model that only allows for choice anomalies in the spirit of these two effects is completely characterized by a simple class of CAF. This is shown in Section 4, where I take the axiomatic approach and directly put restrictions on what kind of choice anomalies are allowed in the data. It turns out that a model that only allows choice anomalies in the spirit of the attraction and compromise effects is completely characterized by a simple class of CAF in which there is only a single undominated alternative in each consideration set. I start with the formal description of the model in Section 2, which also includes a number of choice procedures that are examples of CAF. In Section 3, I provide the necessary and sufficient conditions for the data to be consistent with CAF and show what can be deduced about the preference and the consideration set of the DM if her choice is consistent with CAF. Section 4 studies an axiomatic model that only allows for choice anomalies in the spirit of attraction and compromise effects. This section also shows how this model is related to CAF. Section 5 includes the literature review, it also shows the independence of CAF from several influential models in the literature. Appendix A contains additional auxiliary results and Appendix B contains the proofs. 2 Model 2.1 Multicriteria Choice Data To study CAF I use multicriteria choice data. This data includes not only the choice but also the attributes and the rankings of the alternatives on these attributes. Let Z ⊂ Rk++ be a finite set of alternatives, Z denotes the set of all nonempty subsets of Z. Each l ∈ {1, 2, ..., k} denotes an attribute and zl denotes the value of alternative z on attribute l.3 For convenience I assume no ties in attribute rankings, i.e. for any z, z 0 ∈ Z with z 6= z 0 and l ∈ {1, 2, ..., k}, we have zl 6= zl0 . A choice function is a mapping c : Z → Z such that c(S) ∈ S for all S ∈ Z. It is interpreted as the choice of the DM from each set. Multicriteria choice data includes Z and c. The assumption of no ties in attribute rankings and the restriction to choice functions is only made for convenience. In Appendix A, I show that the main results are still valid with minor modifications even if we relax these assumptions by looking at choice correspondences and ties in attribute values. 3 The model is ordinal. Hence, only the order induced matters. 4 2.2 Choice Through Attribute Filters The DM has a consideration set, a mapping Γ : Z → Z, where Γ(S) ⊆ S for each S ∈ Z. Γ(S) denotes the set of alternatives she considers. She has a complete, transitive, strict and monotone preference on Z.4 Once she forms the consideration set, she is assumed to maximize on this consideration set. It is easy to see that without any restriction on Γ any observed choice would be rationalizable with this procedure.5 The existence of attributes and rankings over these attributes provide a natural way to form the consideration sets. I will consider the case where for each set S the DM has a threshold on each attribute ranking and eliminates everything below this threshold. Furthermore, the thresholds weakly move up (down) when we add an alternative above (below) the threshold. And the extent of this movement is limited to the alternative we include. This last condition ensures that the elimination depends on the relative ranking of the alternatives in the sense that if an alternative x is (not) eliminated by an attribute i in set S. And if set T can be obtained from set S by adding an alternative that ranks (worse) better than x in attribute i then x should (not) be eliminated in set T by attribute i. The following definition formalizes these restrictions. Let Ei (S) denote the set of eliminated alternatives because of attribute i. The consideration set will be given by Γ(S) = S \ (∪i=1,..,k Ei (S)). Let > 0 be a small enough number such that |zl − zl0 | > for all z, z 0 ∈ Z with z 6= z 0 and for all l ∈ {1, 2, .., k}. Definition 1. Γ is an attribute filter if there exists a threshold tS ∈ Rk for each S ⊆ Z and a mapping Ei : Z → Z for each i ∈ {1, .., k}, where Ei (S) ⊆ S, Γ(S) = S \ (∪i=1,2,..,k Ei (S)), Γ(S) 6= ∅ such that (A) Ei (S) = {z ∈ S|zi ≤ tSi } (B) If F = S ∪ x where x ∈ Z and xi ≥ tSi then xi + ≥ tFi ≥ tSi (C) If F = S ∪ x where x ∈ Z and xi ≤ tSi then xi − ≤ tFi ≤ tSi Condition (A) states that anything that stays below the threshold is eliminated in each attribute. Condition (B) ((C)) states that if an alternative is added above (below) the threshold in attribute i, then the threshold can only move up (down). However, this movement is limited by the included alternative; that the threshold can move up (down) 4 A preference is monotone if whenever zl > zl0 for all l ∈ {1, ..., k} we have z z 0 . This assumption is made only because it is reasonable, Remark 2 in Appendix A explains how the results would change if we do not make this assumption. 5 We can claim that Γ(S) = c(S) for all S. 5 enough to eliminate (save) this alternative, but not so much that it eliminates (saves) the alternative ranking above (below) this alternative. The reason we include is that the condition does not dictate whether the included alternative should or should not be eliminated. The implication of dropping the latter two conditions will be discussed in Appendix A. The following examples illustrate the wide range of procedures that lead to an attribute filter. I will show how to set these up as attribute filters in Appendix B. Example 1. (Rational Choice) Eliminate nothing Example 2. (Top n) Eliminate nothing if the set contains less than n elements. If the set contains more than n elements then consider the best n elements according to a single attribute ranking. For example, consider the n cheapest alternatives. Example 3. (Eliminate Worst n) Choose an attribute i. If the set contains less than (or equal to) n elements then consider only the best element according to attribute i. If the set contains more than n elements then eliminate the worst n elements in attribute i. For example, consider only the cheapest if the set contains less than 3 elements and eliminate the most expensive 3 elements if the set contains more than 3 elements. Example 4. (Eliminate Worst n on each attribute) The DM eliminates the worst n alternatives in each attribute ranking. If everything is eliminated then she eliminates worst n − 1 instead, if still everything is eliminated she eliminates the worst n − 2 and so on. Example 5. (Fixed Cutoff ) The DM has a fixed cutoff a∗i for each attribute i and for every set S she eliminates the alternatives that stay below the cutoff at each attribute ranking. The DM also has a fixed ordering of attributes and if everything is eliminated then she removes the cutoff of the attributes in that order until the set contains at least one element. Example 6. (Satisficing (Simon (1955))) The DM has stable preference and a threshold a∗ ∈ Z, she chooses an attribute i. She considers the best element in attribute i. If this element is not preferred to a∗ then she also considers the second best element in attribute i. She continues doing this until she finds an alternative that is preferred to a∗ . If there is nothing preferred to a∗ then she considers everything. For example, she considers the cheapest alternative. If it is better than a∗ then she doesn’t consider anything else. If not, then she considers the second cheapest alternative and so on. At this point, it is important to note that attribute filters are independent of other prominent consideration sets used in the literature. In Section 5 this is formally shown with a comparison to attention filters (Masatlioglu et al. (2012)) and consideration filters (Lleras et al. (2010)). 6 Now, we are ready to define CAF. Definition 2. Choice Through Attribute Filters (CAF) A choice function c is a choice through attribute filters (CAF) if there exists a strict, complete, transitive and monotone preference relation over Z and an attribute filter Γ such that c(S) is the -best element in Γ(S) for every S ∈ Z. 3 Identification and Representation What are the necessary and sufficient conditions for c to be CAF? And if c is CAF then what can we confidently conclude about the preference and the consideration set of the DM? This section aims to answer these questions. I will follow Masatlioglu et al. (2012) in defining the revealed preference and consideration: Definition 3. Suppose c is CAF and there are m different preference, elimination sets and consideration sets representing c, {(Γ1 , {Ei1 }i={1,..,k} , 1 ), (Γ2 , {Ei2 }i={1,..,k} , 2 ), ..., (Γm , {Eim }i={1,..,k} , m )}. Note that this is an exhaustive list of all possible representations consistent with CAF. Then • x is revealed to be preferred to y iff x j y for all j = 1, 2, ...., m • x is revealed to be considered in S iff x ∈ Γj (S) for all j = 1, ..., m • x is revealed to be non-eliminated by attribute i if x ∈ / Eij (S) for all j = 1, ..., m. This is a conservative definition as it requires for all representations to agree to conclude something about the preference or the consideration set of the DM. The following condition will be helpful in our quest to understand these. Condition 1. (RNi ) y ∈ T satisfies RNi in set T if there exists z ∈ T and a set S including y and z such that c(S) = z, yi ≥ zi 6 and • If x ∈ T and xi ≥ yi , then x ∈ S. • If x ∈ S and zi ≥ xi , then x ∈ T . Assume that c is CAF and y satisfies RNi in set T . Then z is chosen in S, so we know that z is not eliminated by attribute i in S. Consider the set F obtained from set S by removing everything that is not in T and that ranks better than z attribute i. Observe that as we go from set S to set F we are just removing alternatives that rank better than 6 Since there are no ties in attribute rankings, this means that either y = z or yi > zi 7 z in attribute i. Therefore, by (B) and (C) in Definition 1, z is not eliminated in set F .7 Since yi ≥ zi , by (A) in Definition 1, y is not eliminated in set F . But, as we go from set F to set T , we are only adding alternatives that rank below y in attribute i, together with (B) and (C) in Definition 1 this implies that y is not eliminated by attribute i in set T .8 This shows that if c is CAF and if y satisfies RNi in set T , then y is revealed to be noneliminated in set T by attribute i. Furthermore, if y satisfies RNi for each i ∈ {1, 2, ..., k} then y is revealed to be considered in set T . But if y is revealed to be considered in set T and if c(T ) 6= y then c(T ) is revealed to be preferred to y. Also by monotonicity, for any two alternatives, x is preferred to y if x dominates y. Lets define the relation P with these observations. Let x D y if x dominates y, i.e. xl > yl for all l ∈ {1, 2, ..., k}. Definition 4. xP y if x >D y or if there exists a set T containing y such that c(T ) = x, x 6= y and y satisfies RNi in set T for each attribute i. What conditions on P would ensure that the choice is consistent with CAF? The following theorem shows that acyclicity is necessary and sufficient. Theorem 1. c is CAF iff P is acyclic. Now that we know which data is consistent with CAF, the question is if the choice is consistent with CAF then what can we conclude about the preferences and the consideration sets of the DM? The next section provides a definitive answer to this question. 3.1 Identification The discussion above has shown that if xP y then x is revealed to be preferred to y. Note that since the underlying preference is transitive we can also conclude that if xP y and yP z then x is revealed to be preferred to z. Which means that if we denote by PR the transitive closure of P , then x is revealed to be preferred to y if xPR y. It turns out that PR is actually all the revealed preference we can get. Furthermore we have seen that if x satisfies RNi in set S then x is revealed to be non-eliminated by attribute i in set S. And if x satisfies RNi for each attribute i, then x is 7 Suppose z is eliminated in set F by attribute i. As we move from set F to set S we are adding alternatives that rank better than z in attribute i. But, by (B) and (C) the movement of the threshold is limited to the alternative added to the set, so none of these alternatives will be able to drive the threshold below z and hence z is eliminated in set S. A contradiction to c(S) = z. 8 As we move from set F to set T we are only adding alternatives that rank below y in attribute i, since by (B) and (C) the movement of the threshold is limited to the alternative we add, none of these alternatives can drive the threshold above y. 8 Attribute 2 x y z Attribute 1 c(xy) = c(xz) = x, c(yz) = z and c(xyz) = z Figure 1: Data corresponding to Example 7 revealed to be considered in set S. It turns out that this is all the revealed non-elimination and consideration we can get. The following theorem summarizes the results. Theorem 2. Suppose c is CAF then • x is revealed to be preferred to y iff xPR y. • x is revealed to be non-eliminated in attribute i in set S iff x satisfies RNi in set S. • x is revealed to be considered in set S iff x satisfies RNi in set S for all i ∈ {1, ..., k}. The proof of the theorem is in Appendix B. The theorem above provides a definitive answer to what we can confidently say about the consideration set and the preference of the DM by looking at her choice if the choice is CAF. The example below demonstrates revealed consideration and preference. Example 7. Suppose that Z = {x, y, z}, there are two attributes and the attribute rankings are given in Figure 1. Suppose the DM has the following (unobservable) preference, x z y and uses the following choice procedure. She always considers two alternatives and she does this by eliminating the worst |S| − 2 alternatives in attribute 1 if |S| ≥ 3. Then she maximizes her preference in the corresponding consideration set. The data corresponding to this example is in Figure 1. It is easy to see that this choice is consistent with CAF. Table 1 lists the revealed preference, revealed non-elimination, revealed consideration and the actual consideration sets. Note that Ni (S) corresponds to the revealed non-elimination by attribute i. Lets look at the set {x, z}. As the choice is x, we know that x trivially satisfies RN1 and RN2 . As z ranks higher than x in attribute 1 this implies that z also satisfies RN1 . Finally, as z is chosen in {x, y, z} and as we go from {x, y, z} to {x, z} we are removing an alternative above z in attribute 2, z also satisfies RN2 in set {x, z}. This 9 S {x, y} {x, z} {y,z} {x, y, z} xz Revealed Preference Revealed N1 (S) {x, y} {x, z} {z} {z} Revealed N2 (S) {x} {x, z} {y, z} {x, y, z} Revealed Consideration {x} {x, z} {z} {z} Actual Γ(S) {x, y} {x, z} {y, z} {y, z} Table 1: Revealed Preference and Consideration for Example 7 is all the revealed non-elimination we can get in this set. Revealed consideration follows by Theorem 2. Furthermore as x is chosen and z satisfies RN1 and RN2 , by Definition 4 we have xP z and by Theorem 2, x is revealed to be preferred to z. It turns out that this is the only revealed preference we can get in this example. A similar exercise can be repeated to get every cell of the table above. Note that for each set the set of alternatives that are revealed to be considered is a subset of the actual consideration set. This will always be the case as revealed consideration includes only those alternatives that are included in the consideration set of every possible representation. Furthermore, in this example the revealed preference is incomplete, which implies that there are multiple preferences that can rationalize the choice. For example, the actual consideration set and the preference of the DM would rationalize the choice, but also the following would: x y z, E2 (S) = {f ∈ S|f2 ≤ 0} = ∅ for all S, E1 (S) = {f ∈ S|f1 ≤ 0} = ∅ for S ∈ {{x, z}, {x, y}}, E1 (S) = {f ∈ S|f1 ≤ y1 } for S ∈ {{x, y, z}, {y, z}} , leading to Γ(xyz) = Γ(yz) = {z}, Γ(xy) = {x, y} and Γ(xz) = {x, z}. An example that is not consistent with CAF is available in the next section. 3.2 Choice Anomalies and CAF The attraction and compromise effects are two of the most consistently observed violations of WARP. The attraction effect has first been demonstrated by Huber et al. (1982), it refers to the observation that inclusion of an asymmetrically dominated alternative may lead to a shift in choice in favor of the dominant alternative. Compromise effect has first been demonstrated by Simonson (1989), it refers to the observation that inclusion of an extreme alternative may lead to a shift in choice in favor of an intermediate alternative. A typical data that corresponds to these effects can be found in Figure 2. CAF is able to explain both of these as resulting from the same choice procedure. First, lets look at the data corresponding to the attraction effect. This data can be 10 Attribute 2 Attribute 2 x x y y z z Attribute 1 c(xy) = c(xz) = x, c(yz) = y, c(xyz) = y Attribute 1 c(xy) = c(xz) = x, c(yz) = z, c(xyz) = y Figure 2: Data corresponding to the attraction effect (on the left) and the compromise effect (on the right) rationalized with the following preference, x y z and the following elimination sets, Ei (S) = ∅ for i = 1, 2 if |S| < 3, E1 (xyz) = x and E2 (xyz) = z. The elimination sets satisfy the restrictions and c maximizes the preference in the corresponding consideration set. The compromise effect can be rationalized with x z y and with the same elimination sets. The idea is simple, in the binary choice between x and y nothing is eliminated, so the DM maximizes her preference in this set. In the attraction effect, when we include z, it makes x look relatively poor on attribute 1, which leads x to be eliminated and y is chosen. In the compromise effect, when faced with all three alternatives, she eliminates the extremes, which leads to the choice of the intermediate one. It is no use for a model to explain such anomalies if it does not put reasonable restrictions on what is consistent with the model. We will now see that neither the opposite of attraction nor the opposite of compromise can be explained with CAF. So, assume that c(xy) = y and c(xyz) = x for both of the set of alternatives given in Figure 2. The following argument shows that this is not compatible with CAF. First note that y satisfies RN1 and RN2 in the set {x, y} trivially, as it is chosen. As we go from {x, y} to {x, y, z} we are adding an alternative that is worse than y in attribute 2. So, y satisfies RN2 in {x, y, z}. As x is chosen in {x, y, z} and as x ranks lower than y in attribute 1, y also satisfies RN1 in {x, y, z}. Therefore y is revealed to be considered in {x, y, z} implying xP y. As c(xyz) = x we know that x satisfies RN1 in {x, y, z}, but as we go to {x, y} we are removing an alternative that ranks better than x in attribute 1. So, x satisfies RN1 in {x, y}. As x ranks higher than y in attribute 2 and y is chosen in {x, y}, x also satisfies 11 RN2 in {x, y}. Hence, x is revealed to be considered in {x, y}. As c(xy) = y this implies yP x. But then we have xP yP x. Attraction or Compromise9 4 The previous section has shown that CAF provides an explanation for the attraction and compromise effects. Given this feature of the model, one might be interested in how these effects generalize to more than three alternatives and whether a class of attribute filters can be characterized with such generalizations. To answer these questions, in this section I will take an axiomatic approach and come up with an independent model that only allows for choice anomalies in the spirit of the attraction and compromise effects. I will show that this model is completely characterized by a particularly simple class of CAF. 4.1 The Model Throughout this section I will assume that there are only two attributes, i.e. Z ⊂ R2++ . Axiom 1. A choice function c satisfies Axiom 1 if whenever c(S) = x and c(S ∪ z) = y, where y 6= x and y 6= z we have either 1. y D z and x 6D z or 2. xi > yi > zi and zj > yj > xj , where i 6= j, i, j ∈ {1, 2} The idea is that if choice satisfies Axiom 1 then any choice change will be either in favor of a dominant alternative when an asymmetrically dominated alternative is added to the set (1) or in favor of an intermediate alternative (2). Hence every choice anomaly is either in the spirit of the attraction effect or the compromise effect. Axiom 2. A choice function c satisfies Axiom 2 if whenever c(S) = x, c(S ∪ z) = y, where y D z , x 6D z and if there exists z 0 ∈ S with z 0 D z then either • y D z 0 or • xi > yi > zi0 and zj0 > yj > xj , where i 6= j, i, j ∈ {1, 2} Axiom 2 is a consistency condition on the ‘attraction effect’. It requires that if z leads to the ‘attraction effect’ in favor of y, but if there is another alternative z 0 that dominates z then it should be the case that either y dominates z 0 or y benefits from being in between 9 I am grateful to an anonymous referee for suggesting this section. 12 z 0 and the previously chosen alternative x. In other words, y either benefits from being dominant or from being intermediate. Note that this Axiom has no bite when there are only 3 alternatives. Definition 5. A choice function c is AoC if it is monotonic10 and satisfies Axiom 1 and Axiom 2. Unlike CAF, AoC takes the axiomatic approach and directly puts restrictions on the kind of choice anomalies allowed in the data. It mentions neither the preferences nor the consideration sets and it is far from clear whether it is related to CAF. Surprisingly, it turns out that AoC is equivalent to a particularly simple class of CAF. For any set S ⊆ Z, let S nd ⊆ S denote the set of undominated alternatives in S, i.e. S nd = {z ∈ S| 6 ∃z 0 ∈ S with z 0 D z}. CAF with total elimination (CAFTE) is CAF when Γ(S)nd includes a single alternative: Definition 6. (CAF T E) c is CAFTE if c is CAF and |Γ(S)nd | = 1 for all S ⊆ Z. If choice is CAFTE then choice is CAF and every consideration set includes a single undominated alternative. An example of data that is CAFTE is the data corresponding to the attraction and compromise effects (see Figure 2). Indeed, the compromise effect can be rationalized with Ei (S) = {z 0 ∈ S|zi0 ≤ c(S)i −} for i ∈ {1, 2} and all S ⊆ Z. The attraction effect can be rationalized with Ei (S) = {z 0 ∈ S|zi0 ≤ c(S)i − } for i ∈ {1, 2} and for S 6= {x, y, z}, E1 (xyz) = {z 0 ∈ S|z10 ≤ z1 −} and E2 (xyz) = {z 0 ∈ S|z20 ≤ y2 −}. These elimination sets lead to an attribute filter in which Γ(S) = c(S) for all S ⊆ Z. And hence the choice is CAFTE. Furthermore, it is immediate that the data corresponding to the attraction and compromise effects is AoC, as all the choice changes are either (1) or (2) from Axiom 1. For an example of data that is CAF, but not CAFTE see the data in Figure 1. We have already seen that this is CAF. However, it cannot be rationalized as CAFTE, as the revealed consideration set for the set {x, z} is {x, z} itself and contains multiple undominated alternatives. It is also easy to see that the data in Figure 1 is not AoC, as when we go from {x, z} to {x, y, z}, y leads to a choice change that is neither consistent with the attraction nor the compromise effects (so, neither 1) nor 2) in Axiom 1). It turns out that what we see in these two examples hold in general. In particular, CAFTE is equivalent to AoC. Theorem 3. c is AoC iff c is CAFTE. 10 c is monotonic if whenever x D y for x, y ∈ S, we have c(S) 6= y. 13 This result further establishes the strong connection between the attraction and compromise effects and CAF. It shows that not only CAF allows for these effects, while not allowing for their opposites, but a model that only allows for the attraction and compromise effects is completely characterized by a simple class of CAF. 5 Related Literature There is evidence that the decision makers do not consider all of the available alternatives when making a choice. This phenomena has been extensively studied in the marketing literature, see Hauser and Wernerfelt (1990), Roberts and Lattin (1991) and Wright and Barbour (1977) for examples. There are several models where choice is made as a result of a two-stage procedure, first stage of which can be interpreted as the formation of the consideration set. Manzini and Mariotti’s (2007) rational shortlist method considers sequential maximization of several rationales. In Cherepanov et al.’s (2013) rationalization, the DM chooses the best alternative among the ones she can rationalize. In Manzini and Mariotti (2012) the DM categorizes the alternatives, and given a set she maximizes her preference among the ones in the ‘winning categories’. These models are closely connected to each other, as their characterization involves similar axioms. In particular, a condition called weak WARP is necessary for the choice to be consistent with any of these models. In Section 5.2, we will see that CAF might violate weak WARP. Masatlioglu et al. (2012) and Lleras et al. (2010) are more closely connected to this work. Just like this paper, they assume that the DM forms a consideration set and maximizes her stable preference on this consideration set. The model considered here and these differ on the properties of the consideration sets involved. Masatlioglu et al. (2012) consider attention filters, which require that if an alternative is not considered in a given set then removal of this alternative doesn’t affect the consideration set. Whereas Lleras et al. (2010) consider consideration filters, which require that if an alternative is considered in a set then it should be considered in every subset of this set including the alternative. In Section 5.2, I show that CAF is independent of both of these models. Caplin and Dean (2011), Caplin et al. (2011) and Masatlioglu and Nakajima (2013) look at the cases in which the consideration set is formed as a result of search through the set of alternatives. Both of these models look at data that is richer than choice data, in particular Caplin and Dean (2011) and Caplin et al. (2011) consider ‘choice process data’, which includes how the choice changes through the process of search. The data Masatlioglu and Nakajima (2013) considers includes the starting point of search. Both the motivation and the empirical implications of these models are quite different from 14 CAF. De Clippel and Eliaz (2012) model choice as the cooperative solution to a bargaining problem among the different selves of the individual. In their model, each attribute ranking is interpreted as the preference of one self. Each self assigns each alternative a score equal to the number of alternatives in the lower contour set of that alternative and the solution selects the alternative whose minimum score is highest. Hence, they do not model the choice as a two step procedure and there is no underlying single preference ordering that represents the preference of the DM. As with CAF, their model can also explain both the attraction and compromise effects. Another model that is able to explain both the attraction and compromise effects is Natenzon (2015), where the author takes a fundamentally different approach and shows that these effects can arise from Bayesian updating. Ok et al. (2015) is able to explain the attraction effect, but not the compromise effect. In their explanation, in the attraction effect the dominated alternative acts as a reference for the agent and makes the dominant alternative more attractive. One important distinction between their model and mine is that they use standard choice data, whereas I use data that includes observable attributes. The approaches should be seen as complementary as the former has the advantage of identifying the attraction effect and the attributes when they are subjective, while the latter can make firmer predictions when the attributes are objective (for instance by ruling out the opposites of these effects). Aside from these Lombardi (2009) has a model that explains the attraction effect. In this model, the DM first eliminates the dominated alternatives according to a fixed, but possibly incomplete relation. And from the remaining alternatives, she eliminates the ones whose lower counter set is strictly contained in another option’s. Finally, Gerasimou (2012) also has a model that incorporates the attraction effect. In this model the DM again eliminates the dominated alternatives according to a partial order, but from the remaining set she chooses the ones that are at least preferred to some other alternative. Mandler et al. (2012) study a choice procedure in which the DM sequentially goes through a checklist of properties, where at each stage she eliminates the alternatives that do not have the property. They show that this procedure has close connections to utility maximization. Unlike in this paper, the properties are unobservable and there is no underlying preference that the DM maximizes. Tversky (1972) considers a probabilistic choice model in which the alternatives are characterized by a set of aspects, at each stage an aspect is selected and all the alternatives that do not have this aspect are eliminated. This continues until all, but one of the alternatives are eliminated. Unlike this model, this is a probabilistic model and the DM 15 does not have a stable preference. Furthermore, unlike Tversky (1972), the data I consider here allows for the movement of a threshold relative to the set under consideration. Manzini and Mariotti (2014) consider a stochastic model of consideration set formation in which the DM considers each alternative with a certain unobservable probability and then maximizes her preference among the alternatives she considers. In their model the elimination is done stochastically given set-independent probabilities, whereas in CAF elimination is deterministic and depends on the choice set. Aside from these there are applied theory papers that study the effects of limited consideration on markets. Eliaz and Spiegler (2011a) provides one of the earliest applications, where the firms can use costly marketing devices to influence the set of alternatives considered by the consumer. In a similar vein, Eliaz and Spiegler (2011b) consider a model in which products are used to influence what the consumer pays attention to. 5.1 Consideration Sets In this section, I will compare the attribute filters to the consideration sets studied in the literature. I will make the comparison to attention filters (Masatlioglu et al. (2012)) and consideration filters (Lleras et al. (2010)). We say that a consideration set is an attention filter if when an alternative is not considered then the removal of this alternative does not affect the consideration set (Masatlioglu et al. (2012)). Formally Γ is an attention filter if for any S, Γ(S) = Γ(S \ x) whenever x ∈ / Γ(S). The corresponding choice procedure is called choice with limited attention (CLA). We say that a consideration set is a consideration filter if when an alternative is considered in set S then it will be considered in every subset of S that includes this alternative (Lleras et al. (2010)). Formally, Γ is a consideration filter if x ∈ S ⊆ T and x ∈ Γ(T ) then we have that x ∈ Γ(S). The corresponding choice procedure is called choice with limited consideration (CLC). Remark 1. Attribute filters are independent of attention filters and consideration filters. To show the remark, consider Example 3 in Section 2.2 above. Suppose that the DM eliminates the n most expensive alternatives. This is not an attention filter as when the nth most expensive alternative is removed from the set an alternative that was not originally eliminated will be eliminated in the set we obtain. The same argument also shows that it is not a consideration filter either, as an alternative is considered in a set, but not considered in one of its subsets. Example 8 below is an example of an attention and consideration filter that is not an attribute filter. 16 Attribute 2 t z y x Attribute 1 Figure 3: Attribute rankings for Example 9 Example 8. (Top n on each attribute) The DM considers the best n alternatives on each attribute ranking. It is easy to see that this is both an attention filter and a consideration filter. To see that it is not an attribute filter consider the following example. Z = {a, b, c}, there are two attributes K = {1, 2}, the rankings are a1 > b1 > c1 and c2 > b2 > a2 . Suppose the DM considers the top alternative on each ranking, then Γ(abc) = {a, c}. If this is an attribute filter then b is eliminated either by attribute 1 or 2. But if b is eliminated by attribute 1 then by condition (A) c is also eliminated. Similarly, if b is eliminated by attribute 2 then by condition (A) a is also eliminated. So, this cannot be an attribute filter. And this proves Remark 1. 5.2 Comparison to Other Models We have seen that attribute filters are independent of both attention and consideration filters, but the question of whether the models are independent still remains. Here, I will show that they are independent. The examples also show independence of CAF from Rationalization (Cherepanov et al. (2013)) and Categorize then Choose (Manzini and Mariotti (2012)) .11 Example 9. Suppose that Z = {x, y, z, t}, there are two attributes and the attribute rankings are given in Figure 3. Suppose the DM has the following preference, x y t z and uses the following choice procedure. In a binary set she maximizes her preference, but given a set containing three or more alternatives she first eliminates the worst in each attribute ranking then she maximizes her preference in the remaining set. This choice is consistent with CAF. But it is neither a CLA nor a CLC. 11 Note that the data I consider is richer than the choice data, which all the other models mentioned here consider. 17 To see that it is not a CLA observe that c(xyz) = y and c(yx) = x, but this implies that if the choice is a CLA then the DM paid attention to z in {x, y, z} implying y z. Similarly, c(yzt) = z and c(zt) = t implies z y, a contradiction. To see that this is not a CLC, observe that c(xyzt) = y and c(yzt) = z. But if the choice is a CLC then since y is considered in {x, y, z, t}, y is also considered in {y, z, t}, so z y. Similarly, c(yzt) = z and c(yz) = y implies that y z, a contradiction. This example shows that CAF is not a special case of CLA and CLC. Another widely used axiom, which is a necessary condition in several models discussed in the Literature Review is Weak WARP. Definition 7. Weak WARP c satisfies weak WARP if x 6= y, {x, y} ⊆ S ⊆ T , c(xy) = c(T ) = x, then c(S) 6= y. For c to be consistent with the Rational Shortlist Method (Manzini and Mariotti (2007)), Rationalization (Cherepanov et al. (2013)) and Categorize then Choose (Manzini and Mariotti (2012)) it has to satisfy Weak WARP. The above example also shows that c might be consistent with CAF, but violate Weak WARP. To see this, observe that in the above example we have c(xyzt) = y, c(zy) = y, but c(yzt) = z. So, CAF is also not a special case of any of these models. Finally, Weak WARP has no bite when |Z| = 3. CLA and CLC also do not put any restrictions on c when |Z| = 3. But then, all of these models are consistent with the ‘opposites’ of the attraction and compromise effects studied in Section 3.2, while CAF is not consistent with these. Hence with Example 9, this observation proves that CAF is independent from CLA,CLC, Rationalization and Categorize then Choose. 6 Conclusion It is well known that the decision makers do not always consider all of the alternatives when they are making a choice. In this paper I introduced choice through attribute filters, which postulates that attribute rankings can provide a natural way to form the consideration sets. I have characterized CAF and shown what can be inferred about the preference and the consideration set of the DM if the choice is consistent with CAF. Appendix A. Additional Results Unobservable Attributes Here I will show the model has no empirical content if the attributes are unobservable. The definition below assumes that attributes and attribute rankings are unobservable. 18 So let Z be any finite set of alternatives.12 Definition 8. A choice function c is a choice through attribute filters with unobservable attributes if there exists an integer k > 0, a function f : Z → Rk++ such that (c, f (Z)) is CAF. Proposition 1. Any c is a choice through attribute filters with unobservable attributes. Proof. Take any c. I will show the result by constructing |S| − 1 attributes for each set S. For any set S and x ∈ {S \c(S)} construct an attribute ranking with the property that the elements in S rank strictly higher than the elements in Z \ S and x is the alternative that ranks the lowest among the alternatives in S. In set S the threshold on this attribute will be at x and in any other set the threshold will be below the lowest ranking alternative on this attribute. So, x and only x will be eliminated using this attribute ranking in set S. And nothing will be eliminated by this attribute in any set T 6= S. We can do this for every alternative x ∈ S \ c(S) and for every set S ∈ Z. It is easy to see that this leads to an attribute filter and Γ(S) = c(S) for all S ⊆ Z. Hence, any linear order will rationalize the choice.13 Pre-Attribute Filters As mentioned in the Introduction a model with Condition (A) alone has little empirical content as any monotone c will be rationalizable with such a model. Here I will show this. Note that I am assuming that the attributes and the rankings on these attributes are observable, as otherwise there is nothing to show by Proposition 1. Definition 9. Pre-Attribute Filter Γ is an pre-attribute filter if there exists a threshold tS ∈ Rk for each S ⊆ Z and a mapping Ei : Z → Z for each i ∈ {1, .., k}, where Ei (S) ⊆ S, Γ(S) = S \ (∪i∈K Ei (S)), Γ(S) 6= ∅ such that (A) Ei (S) = {z ∈ S|zi ≤ tSi } If the choice is rationalizable with a pre-attribute filter I will call it choice through pre-attribute filters (CPF). 12 In the paper we have taken Z ⊂ Rk . This specification incorporates the attributes and the rankings on these attributes. Here, Z is any finite set of alternatives, so the attributes are unobservable. 13 Note that monotonicity has no bite, as there are no dominated alternatives. 19 Definition 10. Choice Through Pre-attribute Filters (CPF) c is a CPF if there exists a strict, complete, transitive and monotone preference relation over Z and a pre-attribute filter Γ such that c(S) is the -best element in Γ(S) for every S ∈ Z. Proposition 2. Any monotone c is a CPF. Proof. Take any monotone c. Let be any complete, strict, monotone and transitive preference relation. For every set S, let tS = (c(S)1 − , c(S)2 − , ..., c(S)k − ), i.e. Ei (S) = {z ∈ S|c(S)i − ≥ zi }. The resulting consideration set is a pre-attribute filter, furthermore everything in S that does not dominate c(S) is eliminated. Since c is monotone, c(S) maximizes in Γ(S). Remark 2. This brings the question of how monotonicity affects the main results in the paper, indeed the main results in the paper is robust in this sense. If we do not assume that the underlying preferences are monotonic and if we drop the condition that xP y if x >D y, then all of the main results (Theorems 1 and 2) will still go through with exactly the same proofs. Remark 3. Finally, if we do not require the monotonicity of the underlying preferences, then for this particular result we will have that any c will be a pre-attribute filter. The elimination sets in the proof will stay the same, but the preferences will have the peculiar property that that x y whenever y >D x. Allowing for Indifference In this section I will allow for indifference in the preference of the DM and in the attribute rankings. We will see that results go through with slight modification. As these are changes in the fundamentals I will quickly go through the data and choice procedure again. Z ⊂ Rk++ is a finite set of alternatives, Z denotes the set of all nonempty subsets of Z. Now it can be the case that zi = xi for some z, x ∈ Z with z 6= x and some attribute i. A choice correspondence is a mapping c : Z → Z such that c(S) ⊆ S for all S ∈ Z. It is interpreted as the choice of the DM from each set. It will be assumed that Z and c is observable. Definition 11. Γ is an attribute filter if there exists a threshold tS ∈ Rk for each S ⊆ Z and a mapping Ei : Z → Z for each i ∈ {1, .., k}, where Ei (S) ⊆ S, Γ(S) = S \ (∪i∈K Ei (S)), Γ(S) 6= ∅ such that 20 (A) Ei (S) = {z ∈ S|zi ≤ tSi } (B) If F = S ∪ x where x ∈ Z and xi ≥ tSi then xi + ≥ tFi ≥ tSi (C) If F = S ∪ x where x ∈ Z and xi ≤ tSi then xi − ≤ tFi ≤ tSi We say that x D y if xi ≥ yi for all i ∈ {1, 2, ..., k}. A preference relation is monotone if whenever x D y then x y. Here is the definition of CAF. Definition 12. Choice Through Attribute Filters (CAF) A choice correspondence c is a choice through attribute filters (CAF) if there exists a complete, transitive, monotone and reflexive preference relation over Z and an attribute filter Γ such that c(S) = {x ∈ Γ(S)|x y for every y ∈ Γ(S)} RNi needs a very slight modification to reflect the changes. Condition 2. (RNi ) y ∈ T satisfies RNi for attribute i in set T if there exists z ∈ T and a set S including y and z such that z ∈ c(S), yi ≥ zi and • If x ∈ T and xi ≥ yi , then x ∈ S. • If x ∈ S and zi ≥ xi , then x ∈ T . Exactly the same argument following the original definition of RNi shows here that whenever choice is CAF and x satisfies RNi in set S then x is revealed to be non-eliminated by attribute i. If c is CAF and if y is revealed to be non-eliminated by attribute i in set T for all attributes i then y is revealed to be considered in set T . Which means that if x ∈ c(T ) then x is revealed to be preferred to y. Furthermore if y ∈ / c(T ) then x is revealed to be strictly preferred to y. Lets define the relations P and S with these observations and by incorporating monotonicity. Let x D y if x D y, but y 6D x. Definition 13. xSy if x D y or if there exists a set T such that y ∈ / c(T ), x ∈ c(T ) and y satisfies RNi in set T for all i ∈ {1, ..., k}. xRy if x D y or if there exists a set T , x ∈ c(T ) and y satisfies RNi in set T for all i ∈ {1, ..., k}. Note that S is the asymmetric part of R. We say that R satisfies only weak cycles (OWC) if given any x1 , x2 , ...., xn ∈ Z whenever x1 Rx2 R...xn Rx1 , for no j = 1, ..., n − 1 can it be the case that xj Sxj+1 nor can it be the case that xn Sx1 . As the underlying preference is transitive OWC is a necessary condition for c to be CAF. The following theorem states that it is also sufficient. 21 Theorem 4. c is CAF iff R satisfies OWC. Proof. Necessity already shown, for sufficiency observe that as R satisfies OWC, there exists a complete preorder extension of R. Let R be one such extension, note that R is transitive, reflexive, complete and monotone. The rest of the proof follows the same lines as the proof of Theorem 1, so I will not be providing it here. Appendix B. Proofs Attribute Filters for Examples 1 to 6 Here I will formally set the attribute filters of the examples discussed in Section 2. Example 1. (Rational Choice) For each S, tS = (0, 0, ..., 0), i.e. Ei (S) = {z ∈ Z|zi ≤ 0} = ∅ for all i ∈ {1, ..., k} and S ∈ Z. Example 2. (Top n) Suppose the DM considers the top n alternatives in attribute 1. Let z n+1 (S) be the (n + 1)th best alternative in set S according to attribute 1 if |S| ≥ n + 1, otherwise let z n+1 (S) = 0. Then tS = (z1n+1 (S), 0, ..., 0). Example 3. (Eliminate Worst n) Suppose the DM eliminates the worst n alternatives in attribute 1. Let ẑ n (S) be the (n)th worst alternative in set S according to attribute 1 if the set contains more than n alternatives, otherwise it is the second best alternative according to attribute 1. Then for each S, we have tS = (ẑ1n (S), 0, 0, ..., 0) Example 4. (Eliminate Worst n on each attribute) Let zil (S) be the value of the lth worst alternative in attribute i in set S if l > 0, otherwise zil (S) = 0. Let tS (l) = (z1l (S), z2l (S), ...., zkl (S)). Let l∗ be the largest integer l smaller or equal to n such that the consideration set induced by tS (l) is nonempty. Then tS = tS (l∗ ). Example 5. (Fixed Cutoff ) Without loss of generality let the ordering of the attributes be {k, k − 1, ..., 1}. Let tS (l) be such that tSi (l) = a∗i for i < l and tSi (l) = 0 for i ≥ l. Let l∗ be the largest integer l smaller than or equal to k + 1 for which the consideration set induced by tS (l) is nonempty. Then tS = tS (l∗ ). Example 6. (Satisficing (Simon (1955))) Assume that the DM searches through the attribute ranking of attribute 1. Let N1 (S) = {x ∈ S| ∃y ∈ Z such that y1 > x1 and y a∗ }. Let n1 (S) be the element of N1 (S) with the highest value of attribute 1 if N1 (S) 6= ∅, otherwise n1 (S) = 0. Then tS = (n1 (S), 0, 0, ..., 0). Proofs Lemma 1. Take any T ⊆ Z and x ∈ T 22 (i) Suppose y ∈ T , yi > xi and x satisfies RNi in set T . Then y satisfies RNi in set T . (ii) If x satisfies RNi in set T and if yi < xi then x satisfies RNi in set {T ∪ y}. (iii) If x does not satisfy RNi in set T and if yi > xi then x does not satisfy RNi in set {T ∪ y} Proof. Suppose x satisfies RNi in set T 0 . Then by definition, there exists z ∈ T 0 and a set S including x and z such that c(S) = z, xi ≥ zi and • If y 0 ∈ T 0 and yi0 ≥ xi then y 0 ∈ S, and • If y 0 ∈ S and zi ≥ yi0 then y 0 ∈ T 0 For part (i), take T 0 = T . Since yi > xi , by the first bullet y ∈ S and furthermore any y 0 ∈ T such that yi0 ≥ yi also satisfies yi0 ≥ xi and hence by the first bullet y 0 ∈ S. But then y also satisfies RNi in set T . For part (ii), take T 0 = T . In set {T ∪ y}, the first bullet point is satisfied as no alternative is added above x, furthermore the second bullet point is satisfied whether zi ≥ yi or not, hence the result follows. For part (iii), take T 0 = {T ∪ y} and towards a contradiction assume that x satisfies RNi in this set. But then since as we go from T 0 to T an alternative above x is removed, we have that x also satisfies RNi in set T , a contradiction. Proof of Theorem 1. Necessity already shown, here we will show sufficiency. Suppose that P is acyclic. Step 1: Constructing the preference Since P is acyclic, there exists a linear order extension of P . Let R be a linear order extension of P . Then R is a strict, transitive, complete and monotone preference relation. Step 2: Forming the Elimination Sets and the Consideration Set For any set S, we will include everything that does not satisfy RNi in set S in Ei (S). Let N RNi (S) ⊆ S be the set of alternatives that does not satisfy RNi in set S.For any set S let tSi = 0 if N RNi (S) = ∅, otherwise let tSi = max{z∈N RNi (S)} zi So, for any set S we have Γ(S) = S \ ∪i∈K Ei (S), where for each i ∈ {1, ..., k} , Ei (S) = {z ∈ S|zi ≤ tSi } By Lemma 1 (i), Ei (S) eliminates an alternative x iff x does not satisfy RNi in set S. Step 3: Show that c(S) maximizes R in Γ(S) c(S) satisfies RNi in every attribute i in set S, hence c(S) ∈ Γ(S). Now, we need to show that c(S) R maximizes Γ(S). Towards a contradiction, suppose there exists y ∈ S such that y R c(S) and y ∈ / Ei (S) for all i ∈ {1, ..., k}. But then y satisfies RNi in set S by every attribute. By the definition of P we have c(S)P y implying c(S) R y, a contradiction. 23 Step 4: Show that Γ is an attribute filter, i.e. elimination sets satisfy (A), (B) and (C) in Definition 1. (A) follows from Lemma 1(i). To show (B), assume that yi > tSi and F = {S ∪ y}. Towards a contradiction assume tFi < tSi or tFi > yi . In the former, there exists z ∈ F such that z satisfies RNi in set F and z does not satisfy RNi in set S and yi > zi . This is a contradiction to Lemma 1(iii). In the latter, there exists z ∈ S that satisfies RNi in S with zi > yi , but z does not satisfy RNi in set F , which is a contradiction to (ii). A symmetric argument shows (C). Hence, it is indeed an attribute filter. Step 5: The result We have established that Γ is an attribute filter and c(S), R -maximizes Γ(S). Therefore (Γ, R ) rationalizes the choice. Proof of Theorem 2. We already showed that if xPR y then x is revealed to be preferred to y. For the other way, suppose that it is not the case that xPR y, then there exists an extension of PR , say R , that ranks y over x. The proof of Theorem 1 shows that there exists an attribute filter ΓR such that (ΓR , R ) rationalizes the choice as CAF. We have also shown that if x satisfies RNi then x is revealed to be non-eliminated by attribute i. For the other way suppose that x does not satisfy RNi in set S, again the proof of Theorem 1 shows that there exists elimination sets that rationalizes the choice and includes x in Ei (S). Revealed consideration follows the same argument. Lemma 2. If c is CAFTE then c is AoC. Proof. Suppose c is CAFTE. Take any S ⊆ Z, suppose c(S) = x 6= c(S ∪ z) = y 6= z. Since c is CAF, it is monotone, so without loss of generality assume x2 > y2 and y1 > x1 . Trivially, we have y ∈ / E2 (S ∪ z) and since c is CAFTE, we have y ∈ E2 (S). But then by (B) and (C) in Definition 1, z2 < y2 . Similarly x ∈ / E1 (S) and since c is CAFTE, we have x ∈ E1 (S ∪ z). But then by (B) and (C) in Definition 1, z1 > x1 . These two observations show that Axiom 1 is satisfied. We need to show Axiom 2. So, suppose y D z and there exists z 0 ∈ S with z 0 D z. Assume that y 6D z 0 , we will show that y is in between z 0 and x. First suppose z20 > y2 , but then since c is CAFTE we have z 0 ∈ E1 (S ∪ z). Also since x ∈ / E1 (S), by (A), z0 ∈ / E1 (S).14 By (B) and (C) this implies that z1 > z10 , a contradiction. So, y2 > z20 . Since y 6D z 0 , we also have z10 > y1 . But then, z10 > y1 > x1 and x2 > y2 > z20 . 14 This is because z10 > x1 . And that is because by Axiom 1 (which is shown to hold above) x 6D z (so z1 > x1 ), but z 0 D z. 24 Lemma 3. Suppose c is AoC. If c(S) = x and T ⊆ Z is obtained from S by removing alternatives that rank better than x in attribute i, then xi ≥ c(T )i . Proof. Towards a contradiction suppose c(T )i > xi . As we go from T to S we are adding alternatives that rank better than x in attribute i. Lets assume that one by one we start adding these alternatives. Let z be the first alternative that leads to a choice change and F be the resulting set. Then either c(F ) = z or the choice change is due to 1) in Axiom 1 or 2) in Axiom 1, but in all of these cases c(F )i > xi , this is because both zi > xi and c(T )i > xi . But then we can continue adding alternatives until we reach the set S and by the same argument the choice will never change in favor of x. A contradiction. Proof of Theorem 3. By Lemma 2 above, if c is CAFTE then c is AoC. Here, we will show the other way. So, suppose that c is AoC. Let be any monotone preference relation and let the elimination sets be defined as in the proof of Theorem 1. From the proof of Theorem 1, we already know that these lead to an attribute filter. So, all we need to show is that for any S ⊆ Z, the resulting consideration set Γ(S) satisfies |Γ(S)nd | = 1.15 Towards a contradiction, suppose not. So, there exists T ⊆ Z with c(T ) = x, but there exists y ∈ Γ(T ), y 6= x that is not dominated by x. Since the choice is AoC, x is also not dominated by y, so without loss of generality assume x2 > y2 and y1 > x1 . Then y satisfies RN2 in set T . Then, by the definition of RN2 , there exists z ∈ T and a set S including y and z such that c(S) = z, y2 ≥ z2 and • If t ∈ T and t2 ≥ y2 , then t ∈ S. • If t ∈ S and z2 ≥ t2 , then t ∈ T . Let F be the set obtained from S by removing everything that is not in T and that ranks above z in attribute 2. By Lemma 3, z2 ≥ c(F )2 . Furthermore since c(F ) cannot be dominated, we have c(F )1 ≥ y1 . Set T is obtained from F by adding alternatives that rank worse than y in attribute 2. Lets divide this set into two: (T \ F )D , which includes all the alternatives in (T \ F ) that are dominated by x and (T \ F )U , which includes the rest. Note that everything in (T \ F )D is also dominated by y. Lets start adding the alternatives in (T \ F )D to set F in any order , let z1 be the first alternative that leads to a choice change and let Z1 be the corresponding set. As z1 is dominated, the choice cannot be z1 . If the choice change is due to 1) in Axiom 1, then c(Z1 )1 ≥ y1 . This is because of Axiom 2, as both x and y dominate z1 , but y is in between c(F ) and x, x or 15 Note that if this holds, then maximization of the preference trivially holds. 25 anything that ranks worse than y in attribute 1 cannot be chosen. Finally, if the choice change is due to 2) in Axiom 1, then c(F )1 > c(Z1 )1 > z1 and z2 > c(Z1 )2 > c(F )2 . Note that as c(Z1 ) cannot be dominated by y, we also have c(Z1 )1 > y1 . So, in either case c(Z1 )1 ≥ y1 . Now, we can continue adding the remaining alternatives in (T \ F )D in any order, whenever there is a choice change, the same argument shows that the chosen alternative ranks at least as good as y in attribute 1, but then c(F ∪ (T \ F )D )1 ≥ y1 . Note that everything in (T \ F )U ranks better than x in attribute 1, so by Lemma 3, c(T \ (T \ F )U )1 ≤ x1 . But then we have c(T \ (T \ F )U )1 ≤ x1 < y1 ≤ c(F ∪ (T \ F )D )1 = c(T \ (T \ F )U )1 . The last inequality is because (T \ (T \ F )U ) = (F ∪ (T \ F )D ). A contradiction. References Caplin, A., Dean, M. (2011). Search, Choice, and Revealed Preference. Theoretical Economics, 6(1), 19-48. Caplin, A., Dean, M., Martin, D. (2011). Search and Satisficing. The American Economic Review, 2899-2922. Cherepanov, V., Feddersen, T., Sandroni, A. (2013). Rationalization. Theoretical Economics, 8(3), 775-800. De Clippel, G., Eliaz, K. (2012). Reason Based choice: A Bargaining Rationale for the Attraction and Compromise Effects. Theoretical Economics, 7(1), 125-162. Eliaz, K., Spiegler, R. (2011a). Consideration sets and competitive marketing. The Review of Economic Studies, 78(1), 235-262. Eliaz, K., Spiegler, R. (2011b). On the strategic use of attention grabbers. Theoretical Economics, 6(1), 127-155. Gabaix, X., Laibson, D., Moloche, G., Weinberg, S. (2006). Costly information acquisition: Experimental analysis of a boundedly rational model. The American Economic Review, 1043-1068. Gerasimou, G. (2016). Asymmetric dominance, deferral, and status quo bias in a behavioral model of choice. Theory and Decision, 80(2), 295-312. Hauser, J. R., Wernerfelt, B. (1990). An Evaluation Cost Model of Consideration Sets. Journal of Consumer Research, 393-408. 26 Huber, J., Payne, J. W., Puto, C. (1982). Adding Asymmetrically Dominated Alternatives: Violations of Regularity and the Similarity Hypothesis. Journal of Consumer Research, 90-98. Lleras, J. S., Masatlioglu, Y., Nakajima, D., Ozbay, E. (2010). When More is Less: Limited Consideration. Unpublished paper. Lombardi, M. (2009). Reason-based choice correspondences. Mathematical Social Sciences, 57(1), 58-66. Mandler, M., Manzini, P., Mariotti, M. (2012). A Million Answers to Twenty Questions: Choosing by Checklist. Journal of Economic Theory, 147(1), 71-92. Manzini, P., Mariotti, M. (2007). Sequentially Rationalizable Choice. The American Economic Review, 1824-1839. Manzini, P., Mariotti, M. (2012). Categorize Then Choose: Boundedly Rational Choice and Welfare. Journal of the European Economic Association, 10(5), 1141-1165. Manzini, P., Mariotti, M. (2014). Stochastic Choice and Consideration Sets. Econometrica, 82(3), 1153-1176. Masatlioglu, Y., Nakajima, D., Ozbay, E. Y. (2012). Revealed Attention. The American Economic Review, 102(5), 2183-2205. Masatlioglu, Y., Nakajima, D. (2013). Choice by Iterative Search. Theoretical Economics, 8(3), 701-728. Natenzon, P. (2015). Random Choice and Learning. Mimeo Ok, E. A., Ortoleva, P., Riella, G. (2014). Revealed (p) reference theory. The American Economic Review, 105(1), 299-321. Roberts, J. H., Lattin, J. M. (1991). Development and Testing of a Model of Consideration Set Composition. Journal of Marketing Research, 429-440. Simon, H. A. (1955). A Behavioral Model of Rational Choice. The Quarterly Journal of Economics, 99-118. Simonson, I. (1989). Choice Based on Reasons: The Case of Attraction and Compromise Effects. Journal of Consumer Research, 158-174. Tversky, A. (1972). Elimination by Aspects: A Theory of Choice. Psychological Review, 79(4), 281. 27 Tversky, A., Simonson, I. (1993). Context-dependent Preferences. Management Science, 39(10), 1179-1189. Wright, P., Barbour, F. (1977). Phased Decision Strategies: Sequels to an Initial Screening. Graduate School of Business, Stanford University. 28
© Copyright 2026 Paperzz