Information Fusion in a Cooperative Multi-agent System for Web Information Retrieval K. B. Shaban O. A. Basir K. Hassanein M. Kamel School of Engineering University of Guelph Guelph, Ontario, Canada N1G 2W1 [email protected] School of Engineering University of Guelph Guelph, Ontario, Canada N1G 2W1 [email protected] School Of Business McMaster University Hamilton, Ontario, Canada L8S 4L8 [email protected] Systems Design Engineering University of Waterloo Waterloo, Ontario, Canada N2L 3G1 [email protected] Abstract – As an attempt to solve some contemporary web information retrieval problems, a construction of a cooperative multiagent system is proposed. This paper introduces the system and presents the use of a unique aggregation and fusion technique that is employed to attain reliable delivery performance. The intelligent methodology to fuse agents’ decisions, the team consensus approach, models the interaction and bring a society into a consensus. After each agent in the group gathers information relevant to a user’s query, the group engages in an uncertainty estimation stage. This process allows each agent to assess its self-uncertainty and the conditional uncertainties of other agents. The procedure facilitates the computation of a weighting scheme that operates recursively on information collected by these agents until the group reaches a consensus. Whenever a new task is received, the uncertainty estimates of agents are updated and used to compute a new weighting scheme. Keywords: Information fusion, decision fusion, team consensus, cooperative multiagent systems, web information retrieval. 1 Introduction As the value and range of information available on the Internet has grown, so the need to manage this information effectively has grown, and has given a rise to what is known as the Internet information overload (or overkill) problem. Put simply, the volume of information available via the World Wide Web (WWW, or Web) represents a very real problem. The potential of this problem is immediately apparent to anyone with more than the most superficial experience of using the WWW. The information overload problem can be characterized in two management processes; Information filtering, and Information gathering. It is of a great benefit to take the burdens off users’ shoulders and automate these tasks [7]. There are many publicly available information retrieval tools (i.e., search engines, web directories, etc.), but users ISIF © 2002 are not necessarily satisfied with their different retrieval speeds, qualities of retrieved information, formats of inputting queries and presenting results, database sizes, and on-line availabilities [6]. These tools are even contributing to the information overload problem, as they require the end user to constantly direct the management process. Users have to look for the desired information, sort the wheat form the chaff, and focus on information they need. Moreover, the gathering, organization, and handling process in these systems is focused on the low information (or even data) representations instead of the higher knowledge levels, and that is considered as a strong factor leads to theses systems retrieval impreciseness [11] A construction of a cooperative multiagent system (MAS) for web information retrieval is being proposed. The system shall overcome pitfalls current Web-based information retrieval tools are facing. It consists of a distributed group of entities (Intelligent Agents1) that mimic everyday life of information seekers. Each member of this group has the ability to be goal and pattern driven, autonomous, and rational in a cooperative and interactive setting. Agents are responsible for perceiving users’ desires, setting goals, exploiting all efforts to achieve these goals, filtering and customizing results, and presenting them in appropriate formats. They may also have more interesting features such as tracking their perspective users’ browsing behaviours, and trying to anticipate what items may be of interest through building profiles of the accumulated knowledge (see figure 1). At a certain level of abstraction, many MAS applications share common features, e.g., collaboration. Individual agents are designed and built to enact particular roles. These agents are autonomous, goal directed entities, which are responsive to their environment. They must 1 Definition of multiagent systems, intelligent agents, and details of everevolving technology are beyond the scope of this article. However, the reader is referred to the literature to gain familiarity to the subject [3, 4]. There are several definitions of agents and multiagent systems, and one can also describe rather than define agents in terms of their task, autonomy, communication capabilities, etc. 1256 typically interact in order to carry out their roles. Such interactions are a natural consequence of the inevitable interdependencies, which exist between the agents, their environment, and their design objectives. One important aspect of the cooperation property is the ability to reach a consensus in decisions made by the different agents at different levels of information abstraction. There is always a certain amount of uncertainty associated with decisions made by these agents. This uncertainty problem can prevent information retrieval agents from answering complex queries, or preclude personal agents from figuring out ambiguous user actions. Generally speaking, uncertainty in agent decisions can be attributed to two main aspects, namely, deficiency in agent decision-making capability, and application information ambiguity. The first aspect is generally due to agent design/implementation defects. The second aspect is due to information overload. For example, in an environment such as the Internet, properties like inaccessibility, non-deterministic, and dynamic nature of the information space, are sources of agent imprecise decisions. Each one of these sources may induce a certain amount of uncertainty to the agents’ assessments and may cause them to acquire the wrong knowledge, which may lead to unfavourable results. Thus, procedures to reduce uncertainty are crucial for improving the capability, increasing the reliability, and widening the applicability of such systems. agents. The procedure facilitates the computation of a weighting scheme that operates recursively on decisions made by these agents until the group reaches a consensus. Whenever a new task is received, the uncertainty estimates of agents are updated and used to compute a new weighting scheme. The approach is computationally tractable, and analytical as well as comparative studies have demonstrated that the approach can be applied to many practical information fusion and integration applications [8]. The breakdown of the paper as follows; section 2 formally state the decision fusion problem; section 3 illustrates types of cooperative agents’ team settings and suggest the use of the team consensus arrangement; section 4 discuss the team consensus approach and its hypothetical assumptions in assessing agents importance weights; section 5 introduce the concept of how weights can be used to reflect uncertainties; section 6 provides how uncertainty estimations of agent decisions can be acquired and used to assign weights coefficients; section 7 gives an illustrative case of the whole fusion procedure where a consensus decision can be noticed; section 7 concludes the paper. User In order to reduce both aspects of uncertainty, one may intuitively suggest building ingenious agents and structuring their environment. However, the use of agents capable of digesting the gigantic knowledge on the Internet and produce accurate answers to all users’ enquiries may not be achievable, or at least may result in very costly systems. Moreover, practically speaking, it is not always possible to structure the agent-working environment in such a way that reduces uncertainty. As in the Internet, where the majority of information resources are dynamically changing, it is unfeasible to construct an inclusive knowledge space. A more practical approach is to view these agents as a collaborative society of agents that communicate their assessments and engage in a discourse so as to reach a consensus with respect to the best decision possible. By combining decisions made by different agents, more effective decision-making can be achieved. There are several attractive decision fusion methodologies, from different disciplines, based on statistical methods, probability techniques, artificial intelligence approaches, and others [1]. This article presents the use of an intelligent approach, the team consensus approach, which has been applied in multisensory data fusion [9]. The approach tries to bring team into a consensus by having agents engage in an uncertainty estimation stage whenever a group decision is required. This process allows each agent to assess its selfuncertainty and the conditional uncertainties of other Personal Agent Resource “Information Retrieval” Agent User Personal Agent Intermediate “Fusion” Agent Environment “The Web” Figure 1. Envisioned View of the System 2 Fusion Problem Formulation Consider a group of N agents, indexed by the set A = { A1, A2, …, AN}, gathering information from their perspective environment (e.g., the Web). Here, a decision theory formulation is utilized where each agent Ai observes a subset θi over a universal information space given by Θ. 1257 An information structure ηi is used to relate θi to a belief zi. Thus, zi = ηi (θi ) 7: display (1) where zi ∈ ℑ , the knowledge space. 1 Upon receiving a query q, Ai chooses an answer ai from a set of possible answers Γ i= (γ1, ..., γM). This answer is related to the belief zi by a decision function δi as ai = δi (zi) 6: get final answer 1 / Personal Agent * 1: enquire 1 / Fusion Agent 2: request 1 User 3: query (2) 5: get answer * Each agent processes its own beliefs, which might be different from the beliefs of other agents, and uses them to rank its answers. Collectively, the n-tuple pair η = (η1, …, ηn), and δ = (δ1, …, δn), respectively, are the information structure and the decision rule of the group. / Retrieval Agent 1 4: answer The decision integration problem can be stated as finding the group answer a ∈ Γ that constitutes the group preference. To achieve that, a ranking function that places a preference ordering on the answers of each agent is defined as Ri(δi (zi ), q): Γ × Θ → ℜ for each Ai ∈ A, and k∈Γi * :ExternalResource Figure 2 (a) Collaboration Diagram of Centralized Team Ri (γk) = 1. The actual form of each ranking function can vary considerably depending on the nature of the decision problem and the tactic agents follow. For instance, information retrieval agents might be equipped with different computational methods (e.g., LSI1, VSR2, etc.) for automated indexing, representations, and then ranking their findings. 10: display 1 9: get final answer 1 / Personal Agent * 1: enquire 2: request 1 User A global ranking function RG, i.e., the group ranking function, is then defined to aggregate the expected rankings of all members, RG = f(R1, …, Rn). The performance of the agents as a group is influenced by this function. The actual form of this function will be shown in section 4. * / Retrieval Agent * 1 1 2 Latent Semantic Indexing [2]. Vector Space Representation [13]. * * / Retrieval Agent 5: query 6: get answer 7: answer * :ExternalResource 1 3: query 8: get answer 3 System Collaboration Model Agents’ groups can be categorized in terms of relationships and interactions between members into two types, namely, non-recursive and recursive arrangements. In the non-recursive organizations, information flow between members of the group is always permitted in only one direction. The standard UML collaboration diagram [10] in figure 2a is a snapshot of one variation of this type of setting, the centralized team. Another example of the non-recursive type is also modeled and showed in figure 2b. This setting can be realized by assuming a natural precedence order in which the members of the group communicate their answers. In this group, the decision of the ith member depends only on its internal beliefs and the decision of the (i – 1)th member. 1 / Fusion Agent 1 4: answer * :ExternalResource Figure 2 (b) Collaboration Diagram of Sequential Team In the recursive category, however, each member of the group can engage in a discourse with other members of the group. They are suggesting and verifying each other’s answers. This provides a powerful means to resolve environment ambiguities, a proficiency that makes the recursive groups the best choice for integrating information provided by multiple agents. This type of network is called the group consensus and was suggested by DeGroot [5] to aggregate multiple opinions into one decision that constitutes the group consensus (Figure 2c). This consensus is achieved by allowing all group members 1258 to linearly pool their assessments in a recursive manner. This model is simple and intuitively appealing, and it will be used here to aggregate agents’ knowledge. 18: display 1 User limit theorems of Markovian chains to determine whether the group will converge to a common ranking, which represents the group consensus, and if so what is the value of this ranking. It has been shown and proven in [5, 12] that the group will converge to a common ranking if and only if there exists a vector π such that. 17: get final answer 1 / Personal Agent * 1: enquire 2: request 1 4: query subject to 1 πi = 1 (5) N RG(γk) = * * / Retrieval Agent * 6: answer 10: get answer 14: answer 12: get answer 1 πi × Ri(γk) (6) As the weights, wij, of this model are estimated subjectively by each member of the group, they are meant to reflect experience, honesty, etc. Nonetheless, they can also be used to reflect the uncertainty of agents about their decisions. It is fully obvious that this uncertainty is different in various situations, differs from agent to agent, and it will manifest in the ranking assessments of each agent. In this way, weights will be adaptive and have some dynamic behaviour, in the sense that they portray the changes in the state of knowledge in the group. Weights are assigned accordingly as to exhibit their self and peers confidence degree. 5: answer 13: answer * :ExternalResource * :ExternalResource i =1 * / Retrieval Agent 9: give answer 11: give answer And the common group ranking, for each γk∈ Γ denoted by RG(γk), k = 1, …, M, is given by 7: get answer 15: get answer 1 (4) i∈A 3: query 8: get answer 16: get answer π × w=π 1 / Fusion Agent Figure 2 (c) Collaboration Diagram of Consensus Team 4 Team Consensus for Fusion 5 Uncertainty Estimation In this model, each individual agent must first assess its The objective here is to seek a function that, by processing the decisions made by a group of agents, can estimate their uncertainties. This function should reflect the contrasts of agents’ decisions. Given the uncertainty levels of all agents, one only needs to adjust the weights of the agents such that the more uncertain an agent is lesser weight it receives. ∀ γk ∈ Γ i. It is then own expected rankings confronted with the decisions of the other group members and revises its own by making an assessment of each group member’s relative importance, expertise, honesty, etc. Specifically, each revised expected ranking is deemed to be of the form * R i (γk), N * R i (γk) = wij Rj(γk) (3) j =1 where wij is a positive importance weight assigned by the ith member to the jth member and N j =1 wij = 1, ∀ i, j ≤ N. Each agent revises its opinion in this manner where it should update its own decision in light of the revisions made by others. The process continues until further revision no longer changes the expected ranking of any member. Since w is an N × N stochastic matrix, it can be viewed as the one-step transition probability matrix of Markovian chain with N states and stationary transition probability. This interpretation enables one to use the In a communicative group of agents, two types of uncertainties seem to emerge. The first uncertainty is local to the agent and reflects the quality of decisions it makes. The second uncertainty is global and manifests the cooperation of agents in terms of knowledge exchange. Given their answers as well as of other agents, they will be able to adjust their uncertainty level. The latter mechanism mimics how agents improve their state of knowledge by learning from each other. There are two types of uncertainties that can be used to model this estimation process: the self-uncertainty and the conditional-uncertainty. The self-uncertainty measures how uncertain the agent about its decisions or how random are the choices of the agent. The more certain is an agent the higher contrast are its choices. Let Ui|i indicate the 1259 self-uncertainty of Ai . Ui|i is computed based on the local knowledge of the agent as Similarly, taking the partial derivative of Vi with respect to the Lagrange multiplier ρ and equating with zero yields M Ui|i = - Ri (γk) logM Ri(γk) (7) Combining eqs. (13) and (14) yields The conditional-uncertainty, however, is a measure of the state of uncertainty of an agent given the answers of other agent. This measure can be used to capture the essence of knowledge relevancy between agents. M Ri (γk| Γ j) logM Ri(γk| Γ j) j∈A ρ =1 2× U 2j|i (15) 2 U −j|i2 j∈ A (16) It then follows that (8) k =1 ρ= In general, for a team of N agents, these uncertainties are arranged in a matrix form as Substituting eqs. (16) and (13) gives the agent weighting coefficient, wij, as follows: U1 | 1 U2 | 1 . . . UN | 1 U1 | 2 U2 | 2 . . . UN | 2 ... ... ... ... U= U1 | N U2 | N . . . UN | N (14) j∈A k =1 Ui|j = - wij = 1 (9) wij = 1 U 2 j |i (17) U −2 k ∈A k |i 7 Illustrative Scenario 6 Uncertainty Based Weightings Now, given the uncertainty matrix U, each agent of the group can determine appropriate weights for itself and other agents. This can be achieved by minimizing the sum of squares of its self-uncertainty and conditional uncertainties associated with other agents. This implies that each agent will assign high weights to agents with low conditional-uncertainties and low weights to those with high conditional-uncertainties. The minimization problem may be stated as follows: Minimize Ti = w ij × U j |i , 2 2 Consider a multiagent system consists of three information retrieval agents each observing a limited portion of the Web. For simplicity, a maximum of four answers from each agent will be considered (Table 1). (10) j∈A N subject to and wij ≥ 0 wij=1, (11) j =1 Vi = w ij × U j|i - ρ 2 2 j∈A wij − 1 j∈A (12) where ρ is the Lagrange multiplier. Taking the partial derivative of Vi with respect to wij and equating it to zero yields wij = ρ 2× U 2j|i γ1 γ2 γ3 γ4 R1(γk) 0.8 0.17 0.03 N/A R2(γk) 0.333 0.333 N/A 0.333 R3(γk) N/A 0.6 0.2 0.2 R1(γk| Γ2) 0.75 0.1 0.1 0.05 R1(γk| Γ3) 0.9 0.04 0.04 0.02 R2(γk| Γ1) 0.6 0.2 0.05 0.15 R2(γk| Γ3) 0.2 0.4 0.1 0.3 R3(γk| Γ1) 0.3 0.4 0.1 0.2 R3(γk| Γ2) 0.1 0.5 0.1 0.3 Table 1. Illustrative Scenario The above minimization problem subject to the above constraints is equivalent to minimization of R\Γ Γ After receiving a query, every agent ranks its findings based on local information. These assessments need not to be the same as agents’ knowledge and reasoning capabilities various. After the first information exchange course finishes, each agent will provide new ranked answers. The uncertainty matrix for this group as computed using equations (7) and (8) is: (13) 1260 agent 1 would benefit the most (in terms of uncertainty) from these results. Similarly, one can point to other interesting facts about the relational behaviour of the group by considering the rest of the conditionaluncertainties in the uncertainty matrix. 0.53 0.77 0.92 U = 0.6 1.0 0.84 0.31 0.92 0.86 Then each agent of the group uses this matrix to determine appropriate weights for itself as well as for other agent in the group. Whenever new decision task is required, uncertainties are recalculated and used to update the weight matrix accordingly. The computed weight matrix, computed using equation (17) is: 0.55 0.26 0.18 w = 0.53 0.2 0.27 0.8 0.09 0.11 The vector π computed from the above weighting matrix, using equation (4) is: π = [0.6 0.2 0.2] By linearly combining the assessments of the three agents, using equation (6) the group will converge to the following group ranking: R\Γ Γ γ1 γ2 γ3 γ4 RG(γk) 0.55 0.29 0.06 0.1 Clearly in this case, the group will favor γ1 as its ranking is greater than all other answers, and the answers will be ordered according to the global ranking as γ1, γ2, γ4, and γ3 respectively. From the above case the following observations can be noticed. First, the self-uncertainties of the three agents (the diagonal of the matrix U) are different, and the second agent has the highest self-uncertainty level. This is in agreement with its ranking assessment in Table 1. It is clear from the table that the three agents have different assessments as to what is the answer to the query. The information gathered by agent 2 did not enable it to have any preference, which might rationalize why it has the highest self-uncertainty level. Other reasons may also be blamed for getting such answers (e.g., design/implantation defects). On the other hand, the other two agents seem to have more preference; Agent 1 is more toward the first answer γ1, while Agent 3 has no clue of the first answer, and giving a high rank to the second answer γ2. 8 Conclusion The development of a multiagent system is deemed to be the key to solve some of contemporary Web information retrieval problems. Justifications and a brief introduction to the system were offered in this article. Following that, a presentation of an intelligent information fusion approach, the team consensus approach, to integrate agents’ decisions was given. The approach treats the agents as a group of experts cooperating with each other to perform a common goal. After each agent in the group makes its own decisions concerning the given task, the group engages in an uncertainty estimation stage. This process allows each agent to assess its self-uncertainty and the conditional uncertainties of other agent. The process facilitates the computation of a weighting scheme that operates recursively on agent beliefs until the group reaches a consensus. Whenever new group decision is required, the uncertainty estimates of agents are updated and used to compute a new weighting scheme. An illustrative scenario to demonstrate the whole procedure is also given. As a future extension of the work, other factors than uncertainties could be considered. For instance, an experience factor reflecting how successful an agent has been during a considered period of time is worth including. Sometimes an agent should not be penalized based just on its decisions uncertainty. Moreover, uncertainty could even be neglected if qualities of all or most decisions are good. The other suggestion is to extend the fusion technique so that can be applied in different levels and shapes of information abstractions. Integrating knowledge in forms other than decisions will be definitely needed in most of multiagent application processes. References Second, conditional-uncertainties (off diagonal entries) in the U matrix tell us the change of the uncertainty state of agents after receiving their peers’ findings. Take as an example the third row of the matrix, which depicts the conditional uncertainty of the first and second agents when they are given the answers the third agent. It indicates that when agent 3 passed its results to agent 1 and agent 2, 1261 1. 2. 3. 4. Belur V. Dasarathy, Decision Fusion, IEEE Computer Society Press, 1994. Deerwester, S., Dumais, S. T., Landauer, T. K., Furnas, G. W. and Harshman, R. A., "Indexing by latent semantic analysis." Journal of the Society for Information Science, 41(6), 391-407, (1990). Gerhard Weiss, Multi-agent Systems A Modern Approach to Distributed Artificial Intelligence, The MIT Press, 1999. J. M. Bradshaw, Software Agents, American Association for Artificial Intelligence and MIT Press, 1997. 5. 6. 7. 8. 9. 10. 11. 12. 13. M. H. DeGroot, “Reaching a consensus,” Journal of the American Statistical Association, 69, 118121 (1974). M. Kobayashi and K. Takeda, “Information Retrieval on the Web,” ACM Computing Surveys, 32(2), 2000. Nicholas R. Jennings, and Michael J. Wooldridge, Agent Technology: Foundations, Applications, and Markets, Springer, 1998. O. A. Basir and H. C. Shen, “New Approach for Aggregating Multi-Sensory Data,” Journal of Robotic Systems 10(8), 1075-1093 (1993). O. A. Basir and H. C. Shen, “Sensory data fusion: A team consensus approach,” IEEE International Conference on Man, Systems, and Cybernetics, 1992, pp. 290-296. Object Management Group (OMG). Unified Modeling Language (UML) Resource Page. http://www.omg.org/uml/ P. Martin and P. W. Eklund, “Knowledge Retrieval and the World Wide Web,” IEEE Intelligent Systems, 2000. R. L. Berger, “A necessary and sufficient condition for reaching a consensus using DeGroolt’s method,” Journal of Statistical Association, 76(374), 415-418 (1981). Salton, G., McGill, M. J., Introduction to Modern Information Retrieval, McGraw-Hill, 1983. 1262
© Copyright 2026 Paperzz