Imperceptible Difference and Categorization: A Rational Problem Simplification Strategy∗ André d’Almeida Monteiro† Gávea Investmentos First version: September, 2003 This version: February, 2006 Abstract This paper proposes a multidimensional rational preference that incorporates imperceptible difference on a countable alternative set whose elements are described by n attributes based on the notions of similarity and categorization. Preference invariants are investigated as well as conditions that assure consistency among multidimensional similarity and n similarities over the attributes. If the conditions for the existence of the proposed preference are provided, imperceptible difference induces a rational strategy to simplify a problem choice: reduction of the alternative set by categorization. ∗ I have benefited from discussions with Dionísio Dias Carneiro, Vinicius Carrasco, Carlos Kubrusly, José Alexandre Scheinkman and Wei Xiong. The first version of this paper was written when I was visiting the Bendheim Center for Finance and The Economics Department of Princeton University in 2003. † e-mail: [email protected]. Tel: + 55 21 3206-9018 1 1 Introduction When faced with certain types of decision tasks, the decision maker may be indifferent among some elements just because they are “very close” to each other. It is not a question of incompleteness of the preference or lack of decision. He cannot express his strict preference among them because their differences are not enough to be perceived by him. This is called imperceptible or just noticeable difference. The difference between two elements may be imperceptible for the decision maker for some reasons. One reason is his incapacity to recognize that two objects are different. Some people, for instance, do not perceive the difference between blue and red. Luce (1956) provides a classical example: a person is not able to detect the difference caused by one grain of sugar in a cup of coffee. Perception accuracy may also vary among people because of expertise (training and inherit ability). There is no judgement but lack of discrimination capacity or accuracy. MasColell, Whinston and Green (1995) give an example: a person is deciding on the color of his apartment walls. He may be indifferent between two shades of gray because their difference is imperceptible for him. However, the choice of these shades makes the difference for an interior designer. A sommelier is able to recognize the difference between two years of the same wine, whereas a person who is not used to drink wine probably is not able to distinguish between them. Another reason comes from a judgment of merit about the difference motivated by some source of cost in evaluating a large-number alternative set. In this case, the decision maker recognizes that two objects are different, but judges their difference as not noteworthy or negligible. Bernstein (1995) and Swensen (2000) report that some investors allocate funds among groups of similar assets, instead of among individual assets. These groups are called styles and this investment process, style investing. These investors’ motivation is their incapacity to evaluate all the available assets (only in NYSE there are approximately 3,600 2 listed stocks). Proceeding like that, they perceive the difference among assets within the same style as noteworthy compared with the cost of evaluating all the assets. Although it seems to be reasonable that decision makers take into consideration the notion of “very close” in their choices, imperceptible difference brings two problems for a classical rational preference: the failure of transitiveness and the perception discontinuity at the mark of “very close”.1 Both are related to the agglomerative nature of imperceptible difference: its capacity to create chains of elements in which differences between two immediate neighbors are imperceptible. If the decision maker is not indifferent among all elements in the same chain, the symmetric part of the preference is not transitive. The mark of “very close” generates a discontinuity point in the judgment: all of a sudden, after two alternatives having achieved a certain level of difference, the decision maker is able to distinguish between them. In addition, this perception discontinuity mark seems to be meaningless in terms of decision theory. Luce (1956) was the first to axiomatize imperceptible difference. He introduces the concept of semiorder. The main idea is that the decision maker judges two elements as indifferent if their associated utility values are close enough to each other. In other words, indifference is not transitive. The main goal of the present paper is to reconcile imperceptible difference and rational preference by using the notions of similarity and categorization. Similarity here has the meaning of proximity measured on the alternative attributes. Categorization is the grouping of objects into non-empty and mutually exclusive categories based on similarity. More precisely, a category is a subset of elements that are more similar to each other than to elements of other subsets. This definition is crucial for the whole paper. Categorization is a choice behavior deeply documented and investigated by the psychological literature: the decision makers exhibit the tendency to simplify a problem through the reduction of the number of 1 A binary relation is said to be a rational preference, or a complete pre-order, if and only if it is complete and transitive. 3 alternatives by categorizing them.2 It is also in the core of pattern recognition literature.3 The major contribution of the present paper is to show that imperceptible difference can be incorporate into a rational multidimensional preference on a countable alternative set if and only if the decision maker perceives his alternatives through categories generated by similarity. If the alternative set can be partitioned into categories, imperceptible difference is a transitive relation, and the perception discontinuity mark just captures the disconnectedness existing in the alternative set, that is, the differences among the categories. Let the proposed preference be called Similarity Based Preference (SBP). The present paper also contributes in giving choice rationality for certain kinds of categorization. Usually categorization arises to deal with a source of cost that leads the decision maker to reduce his alternative set by grouping it such as monitoring, research, store, time and subjective cost. However, the present paper proves that, even in the absence of any source of cost, the decision maker is allowed to categorize the alternative set and to be indifferent among the elements within the same category. In the sequence, he can take just one element per category to the following steps of the decision making process. Therefore, it is shown that imperceptible difference may be able to induce a rational strategy to simplify a problem choice by reducing the alternative set. The preference is modeled as follows. There are no probabilities involved in the decision and the modeling is descriptive. Every element of the alternative set is described by n attributes and all of them are relevant for the choice and the choice is exhausted by them. The notion of imperceptible difference is formalized by a binary relation, called similarity. Similarity is a function of two decision maker’s parameters: the way he measures difference between elements based on their n attributes (a metric function, d), and his perception accuracy for differences (a threshold, ε). The decision maker evaluates two elements as similar if and only if their difference is imperceptible — below his perception accuracy. 2 3 For example, see Medin and Aguilar (1999). See, for example, Duda and Hart (2002) and Theodoridis and Koutroumbas (1999). 4 According to Armstrong (1950, page 122), “the imperfect powers of discrimination of the human mind whereby inequalities become recognizable only when of sufficient magnitude.” Gilboa and Lapson (1995) advocate the existence of such thresholds for physical perception problems. Rubinstein (1988) applies a threshold to model decision maker perception on prices and probabilities of a lottery. Here, the threshold will be used in a general decision problem. As the modeling is descriptive, the reasons that lead the decision maker to not distinguish between two elements do not matter and the decision maker’s perception parameters (d and ε) are taken as given. SBP puts the question of “very close” in a different place than a semiorder does. In a semiorder relation, two elements are not distinguishable if their utility values are close enough. Then, the decision maker evaluates the very-close question in the real line. On the other hand, under SBP approach, two elements are imperceptible different if they are close enough based on their attributes. The very-close question is, then, evaluated in the space of the alternative attributes. There is also a point of primitiveness here: if the decision maker judges the difference between two elements as imperceptible, he should not be able to attach different utility values for them. Fixed the asymmetric part of the preference, the analysis of invariances of SBP for the decision maker’s perception parameters brings two interesting conclusions. First, two decision makers with different perception of the world (different d and ε) may end up perceiving exactly the same categories. Second, if the decision maker varies his perception accuracy by enough, the resulting SBPs form an unique hierarchical structure. That is, in order to reduce the number of indifference classes, the decision maker can only merge the already existing indifference classes. The decision maker deals with the notion of similarity simultaneously judging the differences of the n attributes. Since it is reasonable that he also evaluates similarity on each attribute, n similarity relations are defined, one on each attribute independently. Assuming 5 that there is no association among similarity relations on the attributes, the decision maker’s multidimensional judgment and his n judgments over each attribute should be consistent. Thus, two conditions are required for the system of n + 1 similarity relations to be said consistent: similarity in all the n attributes should imply multidimensional similarity and vice-versa. The first condition is due to Gilboa and Lapson (1995). If the system is consistent, the n + 1 perception accuracy thresholds are mutual-dependent and more structure is required from d. Nevertheless, if there are channels of association among similarity relations on the attributes (as the correlated similarity proposed by Aizpurua et al., 1991, for example), the second condition becomes meaningless and there is room only for the first condition. In this case, SBP is flexible enough to accommodate interdependence among similarity relations on attributes. The papers that most relate to the present one, regarding the idea of similarity, are Rubinstein (1988) and Vila (1997). Rubinstein (1988) defines similarity relations on the two dimensions of lotteries space independently by using semiorders. Vila (1997) extends Rubinstein results to a general three-dimensional choice problem.4 The imperceptible-difference-driven categorization can be viewed as a selection process of the alternative set: only a subset of the alternative set is considered in the next steps of the decision process. In this sense, this paper relates to Kreps (1979) and Ergin (2003). The idea of categorization has been explored in the economics literature by some authors such as Esteban and Ray (1994), Barberis and Shleifer (2003), Keely (2003), Fryer and Jackson (2004) and Peng and Xiong (2005). The rest of this paper is divided into five sections. The second section introduces the basic concepts and builds the necessary and sufficient conditions for the existence of SBP. The third one explores the connections between imperceptible difference and categorization. 4 Recently, some authors have used the concept of similarity in game theory, such as Camerer, Hisa and Ho (2002), Chen and Khoroshilov (2003), and Sarin and Vahid (2004). 6 The next section focuses on the invariances of SBP for the parameters of the decision maker’s perception. The fifth section analyzes the connections among the similarity relations and preferences in the multidimensional space and on each attribute dimension. The sixth section concludes the paper. All the proofs are provided in the Appendix. 2 Similarity Based Preference This section establishes the decision making setup and introduces the basic definitions: difference between elements and sets, perception accuracy, similarity, chains, sub-chains and group of chains. It also presents the necessary and sufficient conditions for the existence of a SBP. The decision maker faces a countable alternative set X. Each x ∈ X is a n-tuple: x = (x1 , ..., xn ), n ∈ N. The decision maker perceives the difference between a pair of elements of X through a metric function: d : X → R+ . That is, (X, d) is metric space. It is a constraint on the way he evaluates differences. The diameter of a set is defined as the maximum distance between its own elements. The distance between two sets is the minimum distance between elements belonging to the different sets. Let X1 and X2 be subsets of a set X. diam (X1 ) = max d (x, y) x,y∈X1 d (X1 , X2 ) = min x∈X1 ,y∈X2 d (x, y) The difference between two elements in X is imperceptible for the decision maker if and only if it is lower than his perception accuracy ε. This concept is formalized by a binary relation called similarity. Assume that the decision maker’s parameters, d and ε, are exogenous. 7 Definition 1 Two elements x, y ∈ X are said to be similar, xSd,ε y, if and only if their difference measured by d is lower or equal to ε. xSd,ε y ⇐⇒ d (x, y) ≤ ε (1) where ε ∈ ]0, diam (X)[. Two alternatives are similar, if their difference is imperceptible. The higher the perception accuracy, the smaller the threshold ε is. The definition of a metric function requires that ε ≥ 0. Since the case of ε = 0 and ε = diam (X) are not interesting, ε lies on the open interval ]0, diam (X)[. Naturally, the meaning of ε depends on the metric d. The following proposition characterizes similarity relation. Proposition 1 Similarity, Sd,ε , is a binary relation on X, Sd,ε ⊂ X × X, which is complete, reflexive and symmetric regardless d or ε. Additionally, it is not transitive in general, and this property depends on (X, d) and on ε. An important remark is that Sd,ε fails to be an equivalence relation only because it is not transitive in general. The core idea of this section is to find the cases where this relation is transitive and to explore the preferences in which imperceptible difference means indifference in terms of choice theory. The similarity relation has an agglomerative nature: starting from a pair of similar elements, it can create chains of similarity. Definition 2 A chain generated by the similarity relation Sd,ε , called Sd,ε -chain, is a non-empty subset of X composed of a sequence of pairs of similar elements: Sd,ε -chain = {x1 Sd,ε x2 , x2 Sd,ε xj−k , ..., xj−1 Sd,ε xj }. The transitivity problem is clearly seen through a chain: the decision maker may judge x1 and x2 as similar, x2 and xj−k as well, and may not judge x1 and xj−l in this way. Then, imperceptible difference is not transitive, and it cannot mean indifference. 8 A Sd,ε -chain is cycle if x1 = xj . A sub-Sd,ε -chain is a Sd,ε -chain contained by another one. Two chains are connected if they share at least one element. A Sd,ε -group is a collection of connected Sd,ε -chains. Two Sd,ε -groups are said disconnected if they do not share any Sd,ε -chain. The next theorem establishes the necessary and sufficient conditions for the existence of a SBP. Under these conditions, similarity relation is transitive and, consequently, imperceptible difference is compatible with rational preferences. Let be a weak order on X, an asymmetric and negatively transitive binary relation. Let R be a equivalence relation on X. Let Ω (R) be the collection of weak orders on X such that xRy ⇐⇒ (not x y, not y x) for all x, y ∈ X. Theorem 1 The union between the similarity relation Sd,ε and a weak order is a rational preference d,ε on X, represented by an ordering-preserving real-valued function ud,ε , if and only if the decision maker perceives X through disconnected Sd,ε -groups; these groups are composed only by cycle sub- Sd,ε -chains; and Ω (Sd,ε ). Additionally, fixed a weak order belonging to Ω (Sd,ε ), d,ε is unique. The intuition behind the above conditions is that the decision maker has to perceive the set X through categories and the asymmetric and symmetric parts of the SBP are compatible. Theorem 1 controls the similarity chains such that they become indifference classes of a preference by requiring spatial characteristics from them on the attribute space. This spatial requirement makes an indifference class a category. This point will be explored in the next section. A SBP is properly specified by d,ε . Its primitive is its symmetric part, which is determined by d and ε. Thus, two decision makers may have the same perception parameters (the same indifference classes) and end up with different rankings because of the different weak orders belonging to Ω (Sd,ε ). The fulfillment of the conditions depends not only on X, but also on d and ε. SBP is not locally insatiable exactly because of perception inaccuracy: 9 given x ε X, there is no y ε X such that d (x, y) ≤ ε and x y or y x. SBP is a complete preorder, not a semiorder. SBP puts the question of “very close” in a different place than a semiorder does. In a semiorder relation, two elements are not distinguishable if their utility values are close enough. Then, the decision maker evaluates the very-close question in the real line. On the other hand, under SBP approach, two elements are imperceptible different if they are close enough based on their attributes. The very-close question is, then, evaluated in the space of the alternative attributes. There is also a point of primitiveness here: if the decision maker judges the difference between two elements as imperceptible, he should not be able to attach different utility values for them. Theorem 1 deals only with the case in which the indifference classes of a preference are solely generated by similarity. This assumption can be relaxed and the Theorem 1 restated. Another equivalence relation R may be defined on X that contains the similarity relation, xRy =⇒ xSd,ε y for all x, y ∈ X. In other words, the evaluation of similarity should be the first step in the decision process. If the weak order is redefined based on R, xRy ⇐⇒ (not x y, not y x) for all x, y ∈ X, the conditions of Theorem 1 are still satisfied. Thus, d,ε is the union of Sd,ε , R and . Vila (1997) extends Rubinstein results to a general three-dimensional choice problem and concludes (page 287) that “transitive preferences can not be consistent with similarity relations”. The fact that this conclusion disagrees with Theorem 1 is not surprising, since the decision making setups are not comparable. Rubinstein’s similarity is defined on each dimension independently as semiorders. Here, similarity is originated on a multidimensional space and the present paper models the case in which similarity is an equivalence relation. The expression chose by Rubinstein to represent similarity is different from the definition of similarity in (1). Notice that Rubinstein’s similarity can reduce the number of dimensions that really matter for the choice, whereas SBP allows for the reduction of the number of alternatives. 10 ε = max diam(C i ) i C2 α = min d (C i , C j ) C1 i≠ j C3 α>ε Figure 1: The indifference classes (C1 , C2 and C3 ) of a SBP 3 SBP and Categorization This section explores how the notion of category can shed light to the transitivity and perception discontinuity problems presented in imperceptible difference. It also investigates the cases where categorization can be rationally induced by imperceptible difference. The first two conditions of Theorem 1 can be restated if similarity is seen as the distance between elements: X has to be partitioned in such way that the maximum diameter among its categories, ε, is strictly inferior than the minimum distance between two different categories, α. Figure 1 illustrates this condition on an alternative set whose attributes are in the R2 and d is the Euclidean metric. Let Ci be the i-th category of X. The condition above implies that every element belonging to the same category is only similar to all others in the same category. The equivalence in (2) formalizes this idea. α > ε ⇐⇒ x, y ∈ Ci and z ∈ Cj , i = j ⇒ d (x, y) < d (x, z) and d (x, y) < d (y, z) 11 (2) The existence of categories makes similarity and imperceptible difference transitive. The disconnectedness inherent to the presence of categories makes ε meaningful in terms of decision making regardless the merit of perception accuracy. Returning to the Mas-Colell, Whinston and Green (1995) example, if the person perceives the difference among shades of the same color as smaller than the difference among shades of different colors, then each color is an indifference class and imperceptible difference can be modeled by a rational preference. Each color may be viewed as a category and ε reflects the discontinuity inherent to different colors. Imperceptible difference, in its turn, gives rationality for special cases of categorization. In costly choices, categorization arises as a tool to control cost by reducing the opportunity set that will really be manipulated by the decision maker. Theorem 1 can also be understood as a representation theorem for categorization in costless choices. Because the elements inside a same category are so alike in comparison with other elements, the decision maker is rationally allowed to be absolutely indifferent among alternatives inside a category. His opportunity set is no longer the entire X, but it is composed by one alternative per category. In this sense, imperceptible indifference, under Theorem 1 conditions, leads to an opportunity-set-selection process. Therefore, imperceptible difference is able to rationally simplify the choice. Consider an investor in the US stock market, who applies the mean-variance investment criterion, building his efficient portfolio with no cost. His forecasts for mean, variance and covariances of next period return of a stock are based on the company fundamentals. He evaluates similarity between two companies based on their fundamentals: he measures differences between the fundamentals through a metric d an has a perception accuracy of ε. Although he knows that the companies are different, if they are similar, he produces the same forecasts for the moments of their expected returns. Because the companies are similar, they contribute exactly in the same manner to the investor’s portfolio. This analysis is extended for the whole set of stocks. If he can partition the companies into categories, he is indifferent among the stocks in the same category. Therefore, he is allowed to build his 12 efficient portfolio considering just one stock per category, simplifying his job. 4 Decision Maker’s Parameters This section works on the influence of the decision maker’s perception parameters on the resulting SBP. Proposition 2 tells that more than one value of perception accuracy thresholds can produce the same SBP. Although the perception accuracy changes, similarity relation may be unchanged and, consequently, the indifference classes are the same. Proposition 2 The preference d,ε is invariant for any decision maker’s perception accuracy threshold belonging to the interval: Td,ε = [diammax , dmin [ (3) where diammax is the maximum diameter among the disconnected Sd,ε -groups and dmin is the minimum distance among them. At first sight, the previous proposition makes suspicious the uniqueness of a SBP claimed in Theorem 1. It is not the case, since a decision maker, at least at same time, can have only one difference measure and only one perception accuracy threshold. Notice that the interval is defined by four elements of X. If the decision maker varies his perception accuracy beyond the limits of (3), he may have preferences with different number of indifferent classes. Because the similarity relation changes, the previous weak order is no longer valid. From a theoretical point of view, the asymmetric part of the preference has to be consistent regardless the number of indifference classes. Therefore, the preference structure has to count with an underlying strict order, o . Theorem 2 claims that as long as the decision maker perception accuracy shrinks (the threshold enlarges), if a different SBP exists, its indifference classes are the results of merges 13 C 5ε ε C 4ε C 3ε ε C1 γ C 2γ C1γ C 2ε Figure 2: Indifference classes of two SBPs with the same d and thresholds ε and γ. of previous SBPs indifference classes. This structure represents an unique hierarchy over the possible sets of indifference classes. Theorem 2 A decision maker who has his perception accuracy strictly decreased, keeping the same underlying strict order o and difference measure d, has different SBPs only if they form a strictly decreasing hierarchical structure. The above hierarchical structure should not be taken for granted: it strongly depends on the definition of both distance between two subsets and similarity relation. Figure 2 shows the indifference classes of two different SBPs with the same d (the Euclidean metric) on alternative set whose attributes are in the R2 . The SBPs differ only on the perception accuracy thresholds, ε < γ. Notice the hierarchical structure involving the indifference classes: C1γ = C1ε ∪ C2ε ∪ C3ε , C2γ = C4ε ∪ C5ε . On the other hand, nothing interesting happens in the case of strictly increasing perception accuracy. 14 The next proposition states that SBP is invariant for same transformations over the metric d if the threshold is properly adjusted. The key point is the notion of equivalence between metrics. Two metrics, d1 and d2 , on the same set X are equivalent if a subset is open in (X, d1 ) if and only if it is open in (X, d2 ). Let I : (X, d1 ) → (X, d2 ) be the identity mapping of (X, d1 ) onto (X, d2 ). Proposition 3 Consider a SBP d1 ,ε and a metric d2 on X. There exist a SBP d2 ,γ that is equal to d1 ,ε if d2 is equivalent to d1 , Td2 ,γ = I (Td1 ,ε ) and the weak order is the same. Proposition 3 tells that, once the topological neighborhoods are the same, the four elements of X which determine Td1 ,ε and Td2 ,γ are the same. The next result states that decision makers with different perception of the world (different d and ε) may end up perceiving the same categories. And, if both share the same weak order on X, the resulting SBP are the same. It is a corollary of Proposition 3. Corollary 1 Consider a decision maker A who has a preference d1 ,ε on X. Consider another decision maker B who has a preference d2 ,γ on X . These two preferences are equal if d2 is equivalent to d1 , Td2 ,γ = I (Td1 ,ε ) and the weak order is the same. 5 Similarities and Preferences on Attributes The SBP on X is originated taking into consideration all the n alternative attributes simultaneously. The multidimensionality of the problem brings a question of decision theory consistency between the overall judgment and the judgments on each attribute. This section investigates the linkages between the (multidimensional) similarity relation and the n similarity relations on the attributes based on the hypothesis that there is no association among the latter n similarity relations. The final question to be addressed is the connections among SBPs on X and on each attribute. 15 The bridges between the multidimensional space and the n attribute spaces are orthogonal projections and norms. Therefore, it is necessary to equip the metric space with linear and geometric structures. The price paid is a stronger requirement over d. Let χ be a real linear space over R and ; : χ × χ → R an inner product on χ defined in n the usual way. Then, (χ, ; ) is a real-valued inner-product space. Let B = {B i }i=1 be its orthonormal base and X be a countable subset of vectors belonging to it. The norm and the metric d on X are induced by the inner product: d (x, y) ≡ x − y ≡ x − y; x − y2 . Then, each x ∈ X is redefined as a n-dimension vector: x = [x1 , ..., xn ]. Assume that χ = Rn and each attribute is represented by one dimension. The definition of similarity relation on the attributes follows Definition 1. Because χ = Rn , the decision about the difference measure on the attribute dimension is natural: the usual metric on the real line. Then, the difference measure is not a parameter of similarity on the attributes. Define X i ⊂ R as the set composed by the i-th attribute of the elements in X, that is X i = (xi , y i , z i , ...) . Definition 3 Let x, y ∈ X. They are said similar regarding to attribute i, xi Sρi i y i , if and only if the absolute value of their difference is lower or equal to ρi . where ρi ∈ ]0, diam (X i )[. xi Sρi i y i ⇐⇒ xi − y i ≤ ρi (4) This definition is extended to all attributes. As these n relations are special cases of Sd,ε , Proposition 1 applies to them. Notice that the thresholds ρi ’s might not be equal and each Sρi i would result in the ε-difference similarity defined by Rubinstein (1988), on page 148, if X was the unitary hypercube in Rn . The conditions for the existence of a SBP on each attribute in isolation are stated by Theorem1 in its one-dimensional-similarity version. Assume that there is no association among similarity relations on the attributes. One example of association among similarity is the correlated similarity introduced by Aizpurua 16 et al. (1991). The authors model the lottery choice with similarity. However, they do not define similarity relations on prize and probability dimensions independently. The idea is that similarity relation on the probability space depends on the prize: the decision maker’s discrimination capacity on probabilities increases as the prize increases. Once the hypothesis above has been made, it makes sense to require that the judgment about similarity on the multidimensional space and on the n attributes be consistent. In order to formalize this idea, two conditions will be imposed on the n + 1 similarity relations. The first one is due to Gilboa and Lapson (1995). They argue that it seems to be unreasonable that the decision maker who does not separately perceive difference among the n attributes of x and y judges them as perceptible different. The second condition is the converse of the first one: if the decision maker does not perceive the difference between x and y, he does not perceive the differences across their n attributes separately. These two consistence conditions make the n + 1 perception-accuracy thresholds mutualdependent. Proposition 4 analyzes the first condition, where the n similarity relations are the primitives. That is, the values of ρi s are given and the proposition imposes a restriction over ε. Proposition 5 deals with the converse case: the threshold ε is given and the proposition restricts the n thresholds ρi s. Both propositions require more structure from d. Proposition 4 Let x and y be two vectors in X which are similar regarding all their n attributes, xi Sρi i y i for i = 1, . . . , n, and d is a metric function induced by a norm on X. n Then, x and y are similar, xSd,ε y, only if ε ≥ ρi . i=1 Proposition 5 Let x and y be two similar vectors in X, xSd,ε y, and d a metric function induced by an real-valued inner product on X. Then, x and y are similar regarding all their n attributes, xi Sρi i y i , for i = 1, . . . , n only if ρi ≤ ε, for i = 1, . . . , n. Proposition 4 demands a stricter condition regarding the perception accuracy thresholds and less structure from the difference measure than Proposition 5. Definition 4 selects the tougher part of the last two propositions and establishes the conditions on both the n + 1 17 thresholds and d in order for the system to be consistent. Equipped with a consistent system of similarity relations, similarity can consistently transit from the multidimensional space to each attribute and vice-versa. Definition 4 A system of n + 1 similarity relations, composed by Sd,ε and n Sρi i , is said to be consistent if and only if the decision maker’s difference measure d is a metric function n induced by an real-valued inner product on X and ε ≥ ρi . i=1 Relaxing the assumption that the judgments about similarity on the attributes are independent, the second condition becomes meaningless. Requiring only the Gilboa and Lapson’s condition, SBP is flexible enough to accommodate interdependence among similarity relations on attributes. Now, it is possible to explore the connections among preferences. Since the primitive of SBP is its symmetric part, the first step is to check the links among the indifference classes of d,ε and of the n preferences iρi . Notice that the disconnectedness of two Sd,ε -groups does not guarantee the disconnectedness of the associated Sρi i -chains (generated by orthogonal projections) on each attribute dimensions. Therefore, even if the system of similarity relation is consistent, the existence of d,ε does not necessarily provide the conditions for the existence of the n iρi preferences. Consider the Figure 2 for a simple example. If the indifference classes of d,ε were only C1ε , C2ε and C3ε , the projections of their points on the horizontal axis do not fulfill the conditions of Theorem 1. Conversely, the existence of the n iρi preferences doe not guarantee the existence of d,ε . 6 Conclusion The present paper reconciled imperceptible difference and rational preference by using the notions of similarity and categorization. It was shown that imperceptible difference can be incorporate into a rational multidimensional preference on a countable alternative set if 18 and only if the decision maker perceives his alternatives through categories generated by similarity. Similarity is a binary relation that models imperceptible difference by controlling the decision maker’s capacity for perceiving difference between alternatives based on their n attributes. It is function of the way the decision maker measures difference (a metric function) and his perception accuracy for differences (a threshold). Two alternatives are similar if and only if their difference is imperceptible — lower than the threshold. A category is defined as a subset of elements that are more similar to each other than to elements of other subsets. This definition was crucial for the whole paper. If the alternative set can be partitioned into categories, imperceptible difference is a transitive relation. And the perception discontinuity mark just captures the disconnectedness existing in the alternative set, that is, the differences among categories. Let the proposed preference be called Similarity Based Preference (SBP). Its primitive is similarity relation. SBP is not a semiorder. SBP puts the very-close question to the decision maker in the n-dimensional space of the attributes. Imperceptible difference, in its turn, gives rationality for special cases of categorization. Theorem 1 can also be understood as a representation theorem for categorization in costless choices. Imperceptible difference is able to rationally simplify the choice under Theorem 1 conditions leading to an opportunity-set-selection process: the decision maker’s opportunity set is no longer the entire X, but it is composed by one alternative per category. Fixed the asymmetric part of the preference, the analysis of invariances of SBP for the decision maker’s perception parameters brings two interesting conclusions. First, two decision makers with different perception for differences of the alternatives (different d and ε) may end up having the same SBP. Second, if the decision maker varies his perception accuracy by enough, the resulting SBPs form a hierarchical structure. That is, in order to reduce the number of indifference classes, the decision maker can only merge the already existing indifference classes. In addition, this hierarchical structure is unique. 19 The linkages between the (multidimensional) similarity relation and the similarity relations on each n attributes, based on the hypothesis that there is no association among the latter n similarity relations, were explored. In order to assure the consistent transit of similarity from the multidimensional space to each attribute and vice-versa a consistent system was defined. A system composed by n + 1 similarity relations is consistent if and only if the decision maker’s difference measure d is a metric function induced by a real-valued inner product on X, and the threshold ε is greater than the sum of the n similarity-thresholds on the attributes. Finally, even if the system of similarity relation is consistent, the existence of d,ε does not assure the existence of the n iρi preferences and vice-versa. References [1] Aizpurua, J. M., Ichiishi, T., Nieto, J. and Uriarte, J. R. (1991), “Decision Making under Risk: Non-Transitiveness and Correlated Similarities”, Department of Economics, Public University of Navarra, Working Paper. [2] Armstrong, W. E. (1950), “A note on the theory of consumer’s behavior. Oxford Economic Papers, 2, 119-122. [3] Barberis, N. and Shleifer, A. (2003), “Style Investing”, Journal of Financial Economics, 68, 161-199. [4] Bernstein, R. (1995), Style Investing, Ney Work: Wiley. [5] Bewley (1973), “Edgeworth’s Conjecture”, Econometrica, 41, 425-453. [6] Camerer, C., Hisa, D. and Ho, T. (2002), “EWA Learning In Bilateral Call Markets”, in Zwick, R. and Rapoport, A. Experimental Business Research, New York: Kluwer Academic Publishers. 20 [7] Chen, Y. and Khoroshilov, Y. (2002), “Learning under Limited Information”, Games and Economic Behavior. Vol. 44, Issue 1, July , 1-25 . [8] Esteban, J. M. and Ray, D. (1994), “On the Measurement of Polarization”, Econometrica, 62, 816-851. [9] Ergin, H. (2003), "Costly Contemplation", Department of Economics, MIT, Working Paper. [10] Fishburn, P. C. (1970), Utility Theory for Decision Making. New York: John Wiley & Sons. [11] Fryer, R. G. and Jackson, M. O. (2004), “A Categorical Model of Cognition and Biased Decision-Making”, Department of Economics, Harvard University, Working Paper. [12] Gilboa, I. and Lapson, R. (1995), “Aggregation of semiorders: intransitive indifference makes a difference”, Journal of Economic Theory, 5, 109-126. [13] Goldstone, R. L. (1999). “Similarity”. In R. A. Wilson & F. C. Keil (Eds.) MIT Encyclopedia of the Cognitive Sciences, 763-765. Cambridge, MA: MIT Press. [14] Keely (2003), “Exchanging Good Ideas”, Journal of Economic Theory, 111, 192-213. [15] Kreps, D. M. (1979), “A Representation Theorem For Preference For Flexibility”, Econometrica, 47, 565-577. [16] Kubrusly, C. S. (2000), Elements of Operator Theory. Boston: Birkhauser. [17] Luce, R. D. (1956), “Semiorders and a Theory of Utility Discrimination”, Econometrica, 24, 178-191. [18] Mas-Colell, A., Whinston, M.D. and Green, J. R. (1995), Microeconomic Theory, Oxford University Press. 21 [19] Medin, D. L. and Aguilar, C. (1999), “Categorization”, In R. A. Wilson & F. C. Keil (Eds.) MIT Encyclopedia of the Cognitive Sciences. MIT Press, Cambridge, Mass. [20] Peng, L. and Xiong, W. (2005), “Investor Attention, Overconfidence and Category Learning”, Journal of Financial Economics, forthcoming. [21] Rubinstien, A. (1988), “Similarity and Decision making under Risk”, Journal of Economic Theory, 46, 145-153. [22] Sarin, R. and Vahid, F. (2004), “Strategy Similarity and Coordination”, Economic Journal, 114, 506-527 [23] Swensen, D. (2000), Pioneering Portfolio Management, New York: The Free Press. [24] Theodoridis, S. and Koutroumbas, K. (1999), Pattern Recognition, London: Academic Press [25] Vila, X. (1997), “On the Intransitivity of Preferences Consistent with Similarity Relations”, Journal of Economic Theory, 79, 281-287. 22 Appendix Proposition 1 Similarity, Sd,ε , is a binary relation on X, Sd,ε ⊂ X × X, which is complete, reflexive and symmetric regardless d or ε. Additionally, it is not transitive in general, and this property depends on (X, d) and on ε. Proof: Because d is a metric on X, the relation applies for all x, y ∈ X: d (x, y) ≤ ε or d (x, y) > ε. Thus, it is a complete relation. It is reflexive: xSd,ε x ⇐⇒ d (x, x) = 0 ≤ ε. It is symmetric, since d is a symmetric function by definition. These three properties are independent of d and ε. It is not generally true that d (x, y) ≤ ε and d (y, z) ≤ ε =⇒ d (x, z) ≤ ε. Thus, it is not transitive in general and its property depends on (X, d) and ε. Theorem 1 The union between the similarity relation Sd,ε and a weak order is a rational preference d,ε on X, represented by an ordering-preserving real-valued function ud,ε , if and only if the decision maker perceives X through disconnected Sd,ε -groups; these groups are composed only by cycle sub- Sd,ε -chains; and Ω (Sd,ε ). Additionally, fixed a weak order belonging to Ω (Sd,ε ), d,ε is unique. Proof: To perceive the set X through subsets means to partition X into these subsets. Necessary condition. Suppose that there exist m disconnected Sd,ε -groups. The distance between two disconnected Sd,ε - groups is strictly greater than ε. Take one of the Sd,ε -groups. If all of its sub-Sd,ε -chains are cycle, then all elements belonging to this Sd,ε -group are similar to each other. This argument is applied to every m disconnected Sd,ε -groups. In a set partitioned in this way, similarity is transitive and Proposition 1 can be used to show that it is an equivalence relation. Thus, since X is countable , Sd,ε induces a countable quotient space, X/Sd,ε . The indifference classes of X/Sd,ε are the disconnected Sd,ε -groups. Let be a weak order (an asymmetric and negatively transitive binary relation) on X. Let Ω (Sd,ε ) be the collection of weak orders on X such that xSd,ε y ⇐⇒ (not x y, not y x) for all x, y ∈ X. Let d,ε be the union of Sd,ε and a ∈ Ω (Sd,ε ). This union is complete and transitive. Therefore, d,ε is a rational preference relation on X. Fishburn (1970), Theorem 2.2, shows 23 that there is an ordering-preserving real-valued function that represents the similarity-based preference such that: x d,ε y ⇐⇒ ud,ε (x) ≥ ud,ε (y) . Sufficient condition. Suppose that d,ε is rational. Since the weak order is asymmetric, the similarity relation Sd,ε must quotient X out. Then, it has necessarily to be an equivalence relation. Proposition 1 states that Sd,ε is reflexive, symmetric, and, in general, intransitive. According to Definitions 1 and 2, X has to be composed of disconnected Sd,ε -groups and these ones have to be composed only by cycle sub- Sd,ε -chains in order to be transitive. Fixed , d and ε, the uniqueness of d,ε results from the fact that the collection of all equivalence relations on a set is in a one-to-one correspondence with the collection of all partitions of X. Proposition 2 The preference d,ε is invariant for any decision maker’s perception accuracy threshold belonging to the interval: Td,ε = [diammax , dmin [ where diammax is the maximum diameter among the disconnected Sd,ε -groups and dmin is the minimum distance among them. Proof: Suppose the number of disconnected Sd,ε -groups is m. By Theorem 1, the diameter of a Sd,ε - group may be strict less than ε. Let diammax be the maximum diameter j of the disconnected Sd,ε -groups: diammax = max diam Sd,ε − group ≤ ε. On the other 1≤j≤m hand, the same theorem implies that the difference between two disconnected Sd,ε -groups is strictly greater than ε. Define dmin as the minimum difference among the disconnected Sd,ε i j groups: dmin = mind Sd,ε − group, Sd,ε − group > ε. As long as the similarity relation is i=j redefined by any real value belonging to the interval Td,ε , defined by (2), the m disconnected Sd,ε -groups are the same and they observe the conditions of Theorem 1. Thus, the following two rational preferences, d,ε = Sd,ε ∪ and d,γ = Sd,γ ∪ , are equal: if γ ∈ Td,ε . 24 Theorem 2 A decision maker who has his perception accuracy strictly decreased, keeping the same underlying strict order o and difference measure d, has different SBPs if they form an unique strictly decreasing hierarchical structure. Proof: As the decision maker’s perception accuracy decreases, the threshold ε increases. Let ξ = {εi }ni=1 be a strictly increasing sequence of positive real numbers belonging to the allowed interval for ε: ]0, diam (X)[. A strictly decreasing hierarchical structure means that x d,εi y ⇒ x d,εj y and d,εi = d,εj for i < j and all x, y ∈ X. Suppose that Theorem 1 applies for each element of ξ in order to generate a similarity-based preference. Let P = {d,εi }ni=1 be the collection of these n preferences. Assume that they are different. The underlying strict order o is the same across the preferences, that is, o is a strict order on the n quotient spaces X/Sd,εi , i=1, . . ., n. Then, the source of the difference has to be their symmetric parts. More precisely, the n quotient spaces must differ among them. Let M = {mi }ni=1 be the sequence composed of the number of indifference classes associated with each d,εi . Since d is kept unchanged, the perception accuracy thresholds are the responsible for the difference. Proposition 2 requires that εi ∈ / Td,εj for i = j. For any x, y ∈ X and εi , i=1,. . . , n-1, xSd,ε y ⇒ xSd,εi+1 y, because d (x, y) ≤ εi ⇒ d (x, y) ≤ εi+1 . It means that elements belonging to a same disconnected Sd,ε -group are in the same disconnected Sd,εi+1 -group, for i=1,. . . , n-1. Because the uniqueness of the m-partition that satisfies the conditions of Theorem 1, mi > mi+1 . Therefore, the increasing of the perception accuracy threshold implies only in merger of the existing indifference classes. The uniqueness of the m-partition can be used again to show that P = {d,εi }ni=1 is unique. Proposition 3 Consider a SBP d1 ,ε and a metric d2 on X. There exist a SBP d2 ,γ that is equal to d1 ,ε if d2 is equivalent to d1 , Td2 ,γ = I (Td1 ,ε ) and the weak order is the same. Proof: Let m be the number of indifference classes of d1 ,ε . By definition, if d1 ˜ d2 , the topologies of (X, d1 ) and (X, d2 ) are equal. Then, d1 (x, y) < d1 (x, z) and d1 (x, y) < 25 d1 (y, z) ⇔ d2 (x, y) < d2 (x, z) and d2 (x, y) < d2 (y, z) for all x, y, z ∈ X. By (2), there exist a γ ∈ ]0, diam (X)[ such that the similarity relation Sd2 ,γ on X is an equivalence relation and generates the same indifference classes of d1 ,ε . By (2) yet, γ = max diam Sdj2 ,γ − group . 1≤j≤m Since Sd2 ,γ = Sd1 ,ε , Sd2 ,γ ⇐⇒ (not x y, not y x) for all x, y ∈ X. Thus, d2 ,γ = Sd2 ,γ ∪ =d1 ,ε . Proposition 2 tells that d2 ,γ is invariant for any perception accuracy threshold belonging to Td2 ,γ . Let C i = Sdi 1 ,ε − group = Sdi 2 ,γ − group for i = 1, ..., m. Td2 ,γ = max maxi d2 (x, y) , min min d2 (x, y) and 1≤j≤m x,y∈C i=j x∈C i ,y∈C j Td1 ,ε = max maxi d1 (x, y) , min min d1 (x, y) . Let I : (X, d1 ) → (X, d2 ) be the i j 1≤j≤m x,y∈C i=j x∈C ,y∈C identity mapping of (X, d1 ) onto (X, d2 ). Kubrusly (2000), in Corollary 3.19 (page 108), shows that d1 ˜ d2 ⇔ I is continuous. Therefore, I [Td1 ,ε ] = max maxi I [d1 (x, y)] , min min Id1 (x, y) = 1≤j≤m x,y∈C i=j x∈C i ,y∈C j max maxi d2 (x, y) , min min d2 (x, y) = Td2 ,γ . i j 1≤j≤m x,y∈C i=j x∈C ,y∈C Corollary 1 Consider a decision maker A who has a preference d1 ,ε on X. Consider another decision maker B who has a preference d2 ,γ on X . These two preferences are equal if d2 is equivalent to d1 , Td2 ,γ = I (Td1 ,ε ) and the weak order is the same. Proof: Immediate application of Proposition 3. Proposition 4 Let x and y be two vectors in X which are similar regarding all their n attributes, xi Sρi i y i for i = 1, . . . , n, and d a metric function induced by a norm on X. Then, n x and y are similar, xSd,ε y, only if ε ≥ ρi . i=1 n Proof: Let x, y ∈ X. Since X is a linear space and B = {B i}i=1 its orthogonal n n i i i i i i basis, (x − y) = (x − y ) B . Since X is a normed space, x − y = (x − y ) B ≤ n i=1 i i i=1 i (x − y ) B . As x and y are similar in terms of all their n attributes, B is orthonormal i=1 and the metric d is induced by the norm .in X, d (x, y) ≤ n ρi . Therefore, in order to x i=1 26 and y be similar in the multidimensional space, according to Definition 1, ε ≥ n ρi . i=1 Proposition 5 Let x and y be two vectors in X which are similar, xSd,ε y, and d a metric function induced by a real-valued inner product on X. Then, x and y are similar regarding all their n attributes, xi Sρi i y i for i = 1, . . . , n, only if ρi ≤ ε, for i = 1, . . . , n. Proof: Let x; y be an inner product on X ⊂ Rn and d (x, y) ≡ x − y ≡ x − y; x − y2 . By definition, xSd,ε y ⇐⇒ d (x, y) ≤ ε. As the distance d is induced by the norm ., d (x, y) ≡ x − y ≤ ε. Define the absolute value of the difference between x and y in (x−y),Bi each dimension as the projection of x − yon it: projBi x − y = x−yBi x − y = |cos ((x − y) , B i )| x − y, for all i = 1, . . . , n. Then, xi Sρi i y i ⇐⇒ |xi − y i | = |cos ((x − y) , B i )| x − y ≤ ρi . Since x − y ≤ ε and |cos ((x − y) , B i )| ≤ 1, if ρi ≤ ε, then xi Sρi i y i for all i = 1, . . . , n. 27
© Copyright 2026 Paperzz