Toward principles for the representation of hierarchical knowledge in formal ontologies Dean M. Jones, Ray C. Paton Department of Computer Science, University of Liverpool, Liverpool, UK, L69 7ZF Abstract Early ontological engineering methodologies have necessarily focussed on the management of the whole ontology development process. There is a corresponding need to provide advice to the ontological engineer on the finer details of ontology construction. Here, we specifically address the representation of hierarchical relationships in an ontology. We identify five types of problem that may be encountered in moving from an informal description of a domain to a formal representation of hierarchical knowledge. Each problem type is discussed from the perspective of knowledge sharing and examples from biological ontologies are used to illustrate each type. Keywords: Ontological engineering; Hierarchical relationships; Knowledge sharing 1. Introduction It is largely accepted that one of the most important aspects of a domain ontology is the organisation of class definitions into inheritance hierarchies. Methodologies that describe processes for developing domain ontologies commonly include recommendations to define hierarchical relationships. For example, in the METHONTOLOGY approach to ontology development described in [5], it is suggested that the representation of concepts as ‘classification trees’ is one of the main steps in ontology development. Taxonomic analysis, during which hierarchical relationships in a domain should be identified and represented, is defined as a distinct stage in the latest version of the ONIONS approach to ontology development [19]. These methodologies are mainly concerned with the organisation of the various activities that an ontological engineer engages in and providing a means of managing the many products of the ontology development process. There has been a necessary focus on what must be done in the development of the formal ontologies. However, there is a parallel need to provide advice to the ontological engineer on how to achieve each stage in the development of a formal ontology. Here, we focus on the development of hierarchical representations. Although the representation of hierarchical knowledge is widely seen as an important aspect in the design of a formal ontology, very little advice is available on the problems that may be encountered and how these problems might be addressed. Furthermore, the advice that is available (e.g. [4]) is often based on the classical model of category membership. The classical model states that an object is an instance of a category if and only if it exhibits certain necessary features. It has long been recognised that this is an inadequate model of the way in which category membership is decided (see [11] for a good summary of the arguments). As the frame-based paradigm was developed largely in response to non-classical theories of category membership and concept definition [12], the problems involved in adopting the classical model of categorisation have been recognised for almost as long by the knowledge representation community. There has, however, been little work on how the complexity of category structure can be addressed in practice. The aim of this article is to describe some common problems that are encountered in the formal representation of hierarchical knowledge and discuss ways in which these problems can be addressed. The problems described are illustrated with examples taken from cellular and molecular biology. This domain was selected because: (a) hierarchical conceptual structures have traditionally been important in the organisation of knowledge in the biological sciences. This is exemplified by the most familiar representation of biological knowledge, the classification of living entities into kingdoms, phyla, classes, orders, families, genera, species, etc. (b) biological systems are generally very complex, especially at the cellular and molecular levels [15,10]. The hierarchical relations in these domains are correspondingly complex and are therefore likely to exhibit any problems. (c) scientific domains are generally more formal than non-technical domains. It is hoped that in analysing biological theories, it will be seen that a fortiori the same problems will exist in non-technical, less formal domains. In the following sections, we analyse some hierarchical relationships in biological domains and discuss some of the issues that are relevant to the representation of this knowledge in a formal ontology. The source for this analysis was [2], a standard text used in the domain. 2. Classification Problems Firstly, as there is often confusion regarding the terms that are used to discuss hierarchical representations of knowledge, definitions of some terms will be useful. entity: a member of the Universe of Discourse. class: a class c is a subset of the Universe of Discourse to which some predicate applies. The set of classes c1 to cn form a partial order. If c1>c2 then c2 is a subclass of c1. instance: a member of a class. We now describe five types of classification problem. A classification problem is informally defined as a situation in which there might be some confusion about the hierarchical relationships that should be included in a formal ontology. Each type of problem will be defined more fully in the following subsections. The types that we have so far identified are: (i) atypical instances: an instance is not a typical example of a class to which it belongs. This may lead to difficulties with identification of the instance as a member of the class and with inheritance of properties from the class to the instance. (ii) overlapping siblings: a member of a class is also a member of a sibling of that class. (iii) context-sensitive subclasses: whether a hierarchical relationship is said to exist between two classes or not varies between contexts. (iv) excluded instances: an instance of a class cannot be included as an instance of any of the immediate subclasses of that class. (v) non-instance similarity: an entity satisfies the criteria for membership of a class which it is not a member of. Instances of classification problems can hinder the development of a formal ontology from an informal description of a domain. They may also hinder the sharing of knowledge across resources where different decisions have been made in the resolution of these problems. It is hoped that a classification of these problems will aid the identification of the issues involved in developing a formal representation. It is commonly the case that a formal ontology is intended to be reusable. It has been suggested that in such situations, the definitions which comprise a formal ontology should be objective [7]. Although desirable, this is a difficult ideal to satisfy and evaluate in practice. We prefer to use the term verisimilitude to describe the purpose of this goal. Verisimilitude is as a property of scientific theories that “have stood up to severe criticism, including tests” ([17], p.57). Ontologies should be assessed in relation to the requirements of the ontology. Thus verisimilitude can only be evaluated with respect to the intended use of the ontology. We will introduce some tests that can be applied during the ontology design phase in order that the verisimilitude of an ontology can be maximised prior to a final assessment. Each of the five types outlined above will now be described in more detail and illustrated with examples. Some of the issues raised for their formal representation will be discussed in relation to these examples. 2.1 Atypical Instances We begin the discussion of the various kinds of classification problems with one that will be familiar to knowledge engineers. The existence of atypical instances has long been recognised. One of the most important analyses was that of Rosch and her colleagues (e.g. [18]), which focussed on two implications of the classical view of categories: (a) that no members of a category can be better examples than others. (b) that category membership is independent of the person doing the categorisation. Experimental evidence showed the existence of typicality ratings, where some members of a class are judged to be more typical examples of that class than other members. The explanation that was given for this effect was that categories exhibit gradience i.e. that some instances are more ‘central’ to the categories than others. Membership of a class is therefore a matter of degree, based on the similarity of an entity to the prototype of the category. Typicality ratings are explained as resulting from differing degrees of similarity to this prototype. Although the prototype view is no longer widely held as an explanation for human categorisation, the existence of typicality ratings is not disputed. As an example, consider one of the most basic distinctions made in cell biology, the classification of cells into one of two kinds, procaryotic or eucaryotic. The distinguishing features of these cells are given in Table 1 (reproduced from [2]). This table is introduced by the sentence: “the major existing eucaryotes have in common both mitochondria and a whole constellation of other features that distinguish them from procaryotes (Table 1-1)” ([2], p.19). Table 1 is taken to describe the distinguishing features of the two child classes of the class ‘cell’ and constitutes an informal representation of the classes ‘eucaryotic cell’ and ‘procaryotic cell’. According to the classical theory, the properties described here could be taken to be necessary and sufficient features of these classes. However, many of the listed properties are generalisations that do not apply to every member of each class. For example, it is clear that the property ‘contains a nucleus’ is one of the defining characteristics of eucaryotic cells. This view is reinforced by the statement “Eucaryotic cells, by definition and in contrast to procaryotic cells, have a nucleus” ([2], p.15). Although this is presented as a defining feature of eucaryotic cells, there are exceptions, as shown by the eucaryotic red blood cells: “erythrocytes (or red blood cells) are very small cells, usually with no nucleus” ([2], p.25). Another example is that the location of genetic material and associated processes is presented as a means of distinguishing between the two subclasses of ‘cell’. In procaryotic cells, the two stages of gene expression, Table 1: Comparison of Procaryotic and Eucaryotic Organisms ([2], Table 1-1, p.19) transcription and translation, occur in the same location: “in procaryotic cells, there is no compartmentalization - the translation of RNA sequences into protein begins as soon as they are transcribed” ([2], p.26). In eucaryotic cells, these processes are generally located in physically distinct regions of the cell: “in eucaryotes, however (except in mitochondria and chloroplasts, which in this respect as in others are closer to bacteria), the two steps in the path from gene to protein are kept strictly separate” ([2], p.26). Indeed, exceptions have been identified for most of the ‘defining’ characteristics listed in Table 1. For classes that have many atypical instances, we would typically give multiple partial definitions of the class rather than a single complete definition. A complete definition should only be given where we can identify a set of necessary and sufficient features that define a class. For many concepts this is not possible as no such set of properties exists. A generalised example is given below for a class c defined in terms of the properties p1,…, p6: (~x) p1(x) Y p2(x) Y p4(x) c(x) (~x) p1(x) Y p3(x) Y p5(x) c(x) (~x) p2(x) Y p5(x) Y p6(x) c(x) Note this is only a partial definition as we have only defined the sufficient cases. Adequate definitions in terms of necessary features cannot always be given for categories exhibiting gradience. The purpose of each sufficient case is to cover a set of instances of c and with all of the partial definitions we hope to cover all possible instances of the class. Although it is usually straightforward to give some partial definitions that cover a set of instances, it is more difficult to coordinate these definitions in order that they collectively cover all possible instances. For example, we may have the instance i about which we know the following: c(i) Y p1(i) Y p2(i) Y p3(i) Y p6(i) Although i is a member of the class c it would not, according to our definition of c, be included as such (although this additional information does not contradict the earlier definition since no necessary definition was given). The problem here of course lies with the definition but this is not always the case. If the generalised example seems to be an unrealistic strawman, consider giving a definition for the class ‘eurcaryotic cell’. Having a nucleus gives us the basis for one partial definition but as we know, this does not completely cover the instances of the class. It is unlikely that we would be able to give a set of meaningful partial definitions for the class that completely covered the instances of the class. For purely practical reasons, we would probably never have access to a full set of instances formally specified with their known properties. It is impossible to know a priori whether we have managed to exhaustively define the class. It seems that this uncertainty is something we must live with. We can identify two distinct issues involved in the representation of classes with many atypical instances: 1) there is no set of necessary properties that will enable us to determine that entities are definitely not members of a class e.g. knowing that a cell has a nucleus implies that the cell is eucaryotic. However, many eucaryotic cells do not have a nucleus. The property of not having a nucleus does not allow us to say that the cell is not eucaryotic. 2) many of the properties are generalisations and are therefore difficult to formalise, e.g. knowing that a cell have size between 10 and 100 )m gives a reason for believing that the cell is eucaryotic but does imply that the cell is eucaryotic. Cases of type (1) require the use of partial rather than complete definitions. Type (2) situations restrict the properties that we can include in the partial definitions. We cannot use the size of a cell as an implication of eucaryotic-hood since this only implies a probability of a cell being eucaryotic. Unless our chosen representation is expressive enough to allow us to represent probabilities (and most languages chosen to represent ontologies do not permit this), such properties can only be utilised when conjoined with other properties. For example, it if were the case the all cells that have a size between 10 and 100 )m and which exist naturally in a multicellular organisation are eucaryotic, we can utilise these two generalisations to give another¸partial definition. How then, should we represent such classes with due regard to the maximisation of verisimilitude? If there is no set of properties that are necessary features that all instances of the class exhibit, we must use one or more sufficient definitions. Given such a representation of a class, we then need to assess whether or not the set of the partial definitions covers all instances of the class. Since we may not have access to all instances of the class, domain knowledge must be used, in the form of a domain expert, to provide us with atypical instances to test against our definitions. 2.2 Overlapping Siblings It is a common problem in knowledge representation for an instance of a class to also be an instance of two or more of the immediate children of that class. In the classical view of category membership, this situation should not be possible as sibling classes are distinguished from their immediate parent and each other by some differentia and therefore will not overlap in this way. Here, it is not always possible to identify the differentia that distinguish the sibling classes. An entity that is an instance of two or more overlapping sibling classes will commonly be judged atypical of at least one of those classes. Descriptions of sibling classes that overlap commonly occur in biological literature. The problem for the ontology developer is to identify those subclasses to include and those to exclude. This kind of problem can perhaps be explained more clearly using an example. In [2], the subclasses of the class ‘remote signalling cell’ are described as follows: Three Strategies of Chemical Signaling: Endocrine, Paracrine, and Synaptic Figure 1: Possible Classifications of ‘neuroendocrine cell’ Chemical signaling mechanisms vary in the distances over which they operate: (1) In endocrine signaling, specialized endocrine cells secrete hormones, which travel through the bloodstream to influence target cells that are distributed widely throughout the body. (2) In paracrine signaling, cells secrete local chemical mediators, which are so rapidly taken up, destroyed, or immobilized that the mediators act only on cells in the immediate environment, perhaps within a millimeter or so. (3) In synaptic signaling, which is confined to the nervous system, cells secrete neurotransmitters at specialized junctions called chemical synapses; the neurotransmitter diffuses across the synaptic cleft, typically a distance of about 50 nm, and acts only on the adjacent postsynaptic target cell. [2], p.682) Suppose that on the basis of this description we make the (seemingly reasonable) decision to distinguish three subclasses of the class ‘remote signalling cell’ - i.e. ‘endocrine cell’, ‘paracrine cell’ and ‘nerve cell’. Subsequently, another type of remote signalling cell is identified: The linking function of the hypothalamus is mediated by cells that have properties of both nerve cells and endocrine cells; for this reason they are called neuroendocrine cells. ([2], p.683) The class ‘neuroendocrine cell’ can, according to the existing hierarchy, be included as both a subclass of‘endocrine cell’ and a subclass of ‘nerve cell’. Neuroendocrine cells secrete hormones into the bloodstream like endocrine cells and respond to synaptic stimulation like nerve cells, although they are typical of neither of the superclasses. On the surface, it is not at all clear how we should represent the further subclass of ‘neuroendocrine cell’. For the above example, four possible representations of the class ‘neuroendocrine cell’ can be identified, as shown in Fig. 1. In more detail, these are: (i) as a subclass of the class ‘endocrine cell’ i.e. as an atypical instance of an endocrine cell. (ii) as a subclass of the class ‘nerve cell’ i.e. as an atypical instance of a nerve cell. (iii) as a subclass of both the ‘endocrine cell’ and ‘nerve cell’ classes. (iv) as a new subclass of the ‘remote signalling cell’ class. Solutions (i) to (iii) share the notion that ‘neuroendocrine cell’ is a subclass of one or more of the three existing classes, whereas solution (iv) suggests that the relationship of ‘neuroendocrine cell’ to the existing classes is at most one of similarity. More generally, if we have the following definitions of the class c and its subclasses c1 and c2 in terms of properties p1,…, p4: (~x) c(x) c1(x) Z c2(x) Z … cn(x) (~x) c(x) Y p1(x) Y p2(x) c1(x) (~x) c(x) Y p3(x) Y p4(x) c2(x) It can be seen that we have given partial definitions for the classes c1 and c2. Now suppose that we have another subclass cn+1: c(x) Y p1(x) Y p2(x) Y p3(x) Y p4(x) cn+1(x)¸ That is, instances of the class cn+1 are also instances of both c1 and c2 and cn+1 can unproblematically be included as a subclass of both c1 and c2. This is the same situation that we encountered in the signalling cell example above and suggests that solution (iii) is the most accurate representation in that situation. However, such a representation would give us more problems than it solved. For example, there are various different kinds of neuroendocrine cell, some of which are more similar to endocrine cells than nerve cells and others for which the opposite applies. In such circumstances, solution (iii) will involve much overwriting of inherited properties. There is a fundamental problem with our initial representation of the three subclasses of ‘remote signalling cell’. Two of the subclasses (‘endocrine’ and ‘paracrine’) are functional descriptions in that they describe the way in which some cells act. These are therefore more properly represented as roles that certain cells play rather than classes of cell. Tests in these situations that help maximise verisimilitude would include the use of principles that help us to identify those predicates that are sortal i.e. that identify an entity as a kind of thing (examples of such principles can be found [8] and [20]). The use of such principles would help to avoid the development of ontologies in which concepts such as ‘endocrine cell’ that represent functional roles are defined as classes. Not all classificatory knowledge is ontological knowledge. 2.3 Context-sensitive Subclasses To some degree all classifications are dependent on the context in which they are made. In [3] it is suggested that categories are often formed on the basis of temporary goals. It is often difficult to know which concepts such be defined as classes in an ontology and how to define those classes that we choose to include in our ontology. The context-sensitivity of an ontology is discussed in [19], where it is suggested that the meaning of terms depends on the context in which a term appears and the context in which the term is used. They suggest that an ontology should only be specified as far as the context of use requires. As an example of context-sensitive membership from the biological domain, consider the classification of the kinds of chemical bonds that exist between biological molecules. There are two classes of chemical bond, covalent and non-covalent. The subclasses of the non-covalent bond class are described in the sentence: “The non-covalent bonds encountered in biological molecules are usually classified into three types: ionic bonds, hydrogen bonds, and van der Waals attractions” ([2], p.88). Sometimes, however, hydrophobic forces are included as another kind of non-covalent bond: “Another important weak force is created by the three-dimensional structure of water, which tends to force hydrophobic groups together” ([2], p.88). Although this force has features in common with the non-covalent bonds, it is not strictly a member of this category: “This expulsion from the aqueous solution generates what is sometimes thought of as a fourth kind of weak non-covalent bond.” ([2], p.88). This is a somewhat weaker assertion than stating that hydrophobic forces are atypical instances of non-covalent bonds as it is suggested that only under certain conditions are hydrophobic forces considered to be non-covalent bonds. Although it is impossible to exhaustively specify all relevant contexts, it is usually possible to identify those subclasses that should or should not be included as part of the definition of the superclass. Where the existence of a hierarchical relationship is dependent on the context, it is unlikely that it should be included in an ontology. It is more likely that a similarity is being drawn between two different ontological kinds as a didactic aid rather than as a description of the meaning of terms. It is important to be clearly distinguish between these two different kinds of explanation. The phrase “biological molecules are usually classified into three types” ([2], p.88) suggests that in the majority of contexts, hydrophobic forces are not members of the class ‘non-covalent bond’ and that the hierarchical relationship between non-covalent bond and hydrophobic force should not be included in a formal description of the domain. The description of hydrophobic forces as a kind of non-covalent bond seems to clearly be a didactic explanation, where an analogy is being drawn between hydrophobic groups and non-covalent bonds in order to highlight the similarities between the ways in which the different forces act. This distinction forms the basis of a test for maximising verisimilitude here: the context of the description of the subclasses is assessed in order to determine the purpose of the description. If it seems that the description is made in the context of a didactic explanation, we cannot assume that any statements made in the description will be relevant to an ontological representation of the domain. Generally, verisimilitude will be maximised by including those relationships that go towards defining the meaning of terms across the majority of contexts in which the ontology will be used. Although it is impossible to exhaustively specify all potential contexts, it is often possible to determine from a knowledge source which representation will maximise verisimilitude (and thereby also maximise the potential for reuse of the ontology). One caveat is that not all examples of context-sensitive subclasses will be as clear as the above description. This will be especially evident when considering non-scientific domains and each case needs to be judged individually. 2.4 Excluded Instances We have a case of an excluded instance when an instance of a class is not an instance of any of the subclasses of that class. For example, consider the class of small organic molecules, the subdivision of which is described in [2] as follows: “cells contain just four major families of small organic molecules: the simple sugars, the fatty acids, the amino acids, and the nucleotides” (p.43). It appears that the class ‘small organic molecule’ has four immediate¸ subclasses. However, this is not strictly the case as “some cellular compounds do not fit into these categories” ([2], p.43). In an ontology that includes only the above four subclasses of ‘small organic molecule’, any of these cellular compounds will be an excluded instance. As a generalised example of this problem, consider the following definitions of the classes c, c1 and c2 and the instance i in terms of the properties p1, …, p6: (~x) c(x) c1(x) Z c2(x) (~x) p1(x) Y p2(x) c1(x) (~x) p3(x) Y p4(x) c2(x) c(i) Y p5(i) Y p6(i) The instance i is excluded from the definitions of the subclasses. A simple solution to this problem is to define an additional class c3 along the lines of the following: (~x) p5(x) Y p6(x) c3(x) However, we have to question how general this solution is. It is not always feasible to extend the number of subclasses to cover for each instance of the superclass. For example, although their number is small compared to the four subclasses outlined above, there are very many other kinds of small organic molecule. Should the number of subclasses of ‘small organic molecule’ be extended to include each type or should we only include the four subclasses already identified? The solution that has been adopted for the above case is to specify a miscellaneous subclass of small organic molecule called ‘other small organic molecules’ and state in the ontology that the five subclasses exhaustively cover the instances of the class ‘small organic molecule’. This seems to be acceptable solution in this situation since it reflects how the domain is conceptualised by biologists. The class ‘other small organic molecules’ can straightforwardly be defined as those molecules that are an instance of the superclass ‘small organic molecule’ but are not an instance of any of the other four subclasses. The use of such ‘miscellaneous others’ subclasses is common in biological resources, e.g. the PDB [1] and SCOP [13] databases. Here it seems that there is no single general approach that will always produce an optimal solution. We can assess the description of the domain to determine whether the use of a ‘miscellaneous others’ subclass is appropriate or not. The test for verisimilitude here is whether the domain supports this notion or not. If not, then we cannot always expect to exhaustively specify the subclasses. Practical considerations may require that such a subclass is included -¸this is subject to the requirements of the ontology. If we consider that extensibility of our ontology is an important requirement, we may want to include a ‘miscellaneous other’ subclass in such situations as this will allow the representation of molecules that could not be included otherwise. This may facilitate the identification of further subclasses as more instances are represented in this way and it becomes possible to identify commonalities among them that may be generalised to classes. It is perhaps also worth noting that the ability to express the fact that a set of subclasses completely cover a common superclass depends upon the language that is used to represent the ontology. 2.5 Non-instance Similarity There is no completely objective way of deciding the relative importance of the infinitely many properties exhibited by any entity [11]. Two entities may be judged to be more or less similar to each other, depending on which properties are taken to be relevant to the comparison, the relative weighting given to the selected properties, and so forth. For example, following on from the information given in §2.1 regarding the way in which mitochondria and chloroplasts are similar to bacteria, it is clear that the former pair exhibit many features that are similar to procaryotic cells. Table 2 details the similarities between procaryotic¸cells and the organelles mitochondria and chloroplasts, based on some of the distinguishing features of procaryotic cells outlined in Table 1. These are the features that would, according to the prototype theory of categorisation, be the most important in deciding on membership of the category ‘procaryotic cells’. Although “mitochondria and chloroplasts show important differences from … present day aerobic bacteria and cyanobacteria” ([2], p.18) and would probably not be judged to be typical instances of the class ‘procaryotic cell’, based on the similarities described in Table 2 it is possible they would be included as atypical instances. However, mitochondria and chloroplasts are organelles and as such do not belong to the class ‘cell’. They should not be considered even as atypical members of the class of procaryotic cells. The similarities outlined in Table 2 can be explained in terms of biological theory, as “chloroplasts share a common ancestry with cyanobacteria and evolved from procaryotes that made their home inside eucaryotic cells” ([2], p.18). Mitochondria and chloroplasts are descended from procaryotes but are not classified as such: “Although they seem to have originated as symbiotic bacteria, they have undergone large evolutionary changes” ([2], p.18-19). This explanation of the classification of chloroplasts and mitochondria as organelles also explains the degree of similarity between these organelles and procaryotic cells. The identification of such cases will help prevent the inclusion of erroneous instances of classes in the definition of formal ontologies. We do not consider this to be a theoretical problem as the situation described may only arise as a result of mistakes made in the development of an ontology. It would be useful, however, to include solutions to common kinds of mistakes in a methodology for ontological engineering. 3. Knowledge Sharing Example We have stated that the solution to some of the problems will often be based on the requirements we have of the ontology. We now consider an example of the development of a fragment of an ontology where Table 2: Comparison of Procaryotic Cells with Mitochondria and Chloroplasts (after [2], Table 1-1, p.19) the requirement is to facilitate knowledge sharing between various data and knowledge sources. Given this, we would like to design our ontology such we can maximise the potential for accurate communication and minimise the communication of erroneous messages, regardless of the representation at the individual resources. The options available in the representation of the problem cases in the shared ontology are identified and assessed. The example we describe is based on the domain of remote cellular signalling, a brief introduction to which was given in §2.2, where the representation of the concept ‘neuroendocrine cell’ was discussed. The discussion in this section is based on solution (ii), where ‘neuroendocrine cell’ was defined as a subclass ‘nerve cell’. As described in the extract from [2] given in §2.2, there are three kinds of signalling molecules: hormones, neurotransmitters and local chemical mediators. Each of these molecules is associated with a particular type of remote signalling cell. Based on this, the following Ontolingua-style definitions can be given (we assume an existing definition of ‘signalling-molecule’): (Define-Class Remote-Signalling-Molecule (?RSM) “A molecule that is emitted by a remote signalling cell and acts at a distance on some other cell” :Def (Signalling-Molecule ?RSM) :Template-Slots (Emitted-By ?RSM ?RSC) :Template-Facets (Value-Type Emitted-By Remote-Signalling-Cell)) (Define-Class Hormone (?H) “A remote signalling molecule that is emitted by an endocrine cell” :Def (Remote-Signalling-Molecule ?H) :Axioms (<=> (Emitted-By ?H ?RSC) ¸ (Endocrine-Cell ?RSC))) (Define-Class Local-Chemical-Mediator (?LCM) “A remote signalling molecule that is emitted by a cell acting in the paracrine mode” :Def (Remote-Signalling-Molecule ?LCM) :Axioms (<=> (Emitted-By ?LCM ?RSC) ¸ (Paracrine-Cell ?RSC))) (Define-Class Neurotransmitter (?NT) “A remote signalling molecule that is emitted by a nerve cell” :Def (Remote-Signalling-Molecule ?NT) :Axioms (<=> (Emitted-By ?NT ?RSC) ¸ (Nerve-Cell ?RSC))) The class ‘neurotransmitter’ describes the signalling molecules released by nerve cells during synaptic signalling. However, nerve cells can also act in paracrine signalling mode: “In the paracrine mode the neurotransmitter functions as a local chemical mediator” ([2], p.682). Here we appear to have overlapping siblings, as there are molecules which are classified as both a kind of neurotransmitter and a kind of local chemical mediator. Although it is possible that local resources might share this representation (which makes it easier to define mappings between the shared ontology and the local ontologies), it is generally problematic to include such representations in the shared ontology. At the most basic level, some resources may have defined nerve cells as acting in neurotransmitter mode only (since this is the functionality most associated with nerve cells). Where local resources have different kinds of representation, we encounter problems in mapping between the shared ontology and some local resources. We need a shared ontology that allows us to define mapping functions to both kinds of representation. Testing the verisimilitude of our representation, we see that the definitions of subclasses of ‘remote signalling molecule’ are based on functional descriptions of molecules. We should therefore describe these as subclasses of a class such as ‘molecular signalling function’ which describes how such a signalling molecule might act and define this as a role of the class ‘molecule’ (we assume existing definitions of the classes ‘molecular function’ and ‘chemical-thing’): (Define-Class Molecule (?M) “A molecule is a set of atoms that are chemically bonded to each other; typically, by means of covalent bonds.” :Def (Chemical-thing ?M) :Template-Slots (Has-Function ?M ?F) :Template-Facets (Value-Type Has-Function Molecular-Signalling-Function)) (Define-Class Molecular-Signalling-Function (?MF) “A molecular signalling function describes one way in which a molecule can act as a signal to a cell.” :Def (Molecule-Function ?M)) (Define-Class Hormone (?H) “A remote signalling molecule that is emitted by an endocrine cell” :Def (Molecular-Signalling-Function ?H)) (Define-Class Local-Chemical-Mediator (?LCM) “A remote signalling molecule that is emitted by a cell acting in the paracrine mode” :Def (Molecular-Signalling-Function ?NT)) (Define-Class Neurotransmitter (?NT) “A remote signalling molecule that is emitted by a nerve cell” :Def (Molecular-Signalling-Function ?NT)) Here we include mainly the subclass relations. A fuller representation would involve defining classes for each method of cellular signalling and defining axioms for the subclasses of ‘molecular signalling function’ based on these. We can now map any representation of signalling molecule based on the original definition to our shared ontology by defining a mapping function from the local ontology classes ‘hormone’, ‘neurotransmitter’ and ‘local chemical mediator’ to the shared ontology class ‘molecule’ and giving the ‘HasFunction’ slot of the class ‘molecule’ the required value. A further complication is described in the sentence: “Many peptide hormones, for example, also act as neurotransmitters (in the paracrine mode) in the vertebrate brain” ([2], p.684). The obvious temptation here is to classify ‘peptide hormone’ as a subclass of ‘hormone’. However, since ‘hormone’ is a functional description, we instead choose to classify ‘peptide hormone’ as a subclass of ‘molecule’ and constrain the allowable values of ‘Has-Function’ to ‘hormone’ and ‘paracrine’. This does not however, express the notion that the molecule is acting as a neurotransmitter. Therefore we need to further elaborate our description of the domain to include (i) an attribute of the ‘molecular function’ class to indicate the locations where the molecule functions can occur (e.g. in nerve cells). (ii) add an axiom the defines a neurotransmitter as any molecule that has a functionality that can occur nerve cells. We now begin to see how careful consideration of the kinds of classes we can define allows us to develop more shareable ontologies. The problems that we encounter are caused by the temptation to give simplistic definitions of complex situations. Some of these complications are: (A) a metonymic relationship between the biological concepts ‘nerve-cell’ and ‘synaptic signalling cell’ which leads to the assumption that these are synonymous terms. Nerve cells can signal in both the synaptic and paracrine modes. (b) molecules having dual roles acting as, for example, hormones in some situations and as neurotransmitter. This knowledge is not supported by a definition of these two types of signalling molecules as disjoint classes. Complications such as these are usually evident in any reasonably complex domain. 4. Conclusions In this paper, we have shown that the formal representation of complex biological concepts is far from straightforward as they can rarely be specified in terms of necessary and sufficient features. Careful consideration of various alternative representations is required. We have identified several types of problem that can arise in the formal representation of hierarchical representations and have analysed the issues involved in selecting one representation above another. We believe that this analysis may prove to be of practical use in the definition of formal ontologies across different domains. We should note that some of the problem types identified (such as atypical instances) are familiar and have been the focus of substantial research. Other types (such as context-sensitive classification) are not so well documented, perhaps because they are less common. It is also worth pointing out that we do not believe that this list of problems and potential solutions is by any means exhaustive and also that they will be subject to further refinement and additional problem types and examples. The acquisition and representation of ontological knowledge need to take into account the complexity that is often present in domains. Principled techniques that allow the ontological engineer to deal with the problems caused by such complexity need to be developed. Fortunately, there has recently been much interest in the principled development of formal ontologies (summarised in [9]). However, the initial emphasis has necessarily been on the management of the overall process rather than the fine grain details. As more ontologies are developed, we will become aware of more of the general problems that lie in the details and we have argued elsewhere a detailed analysis of the domain is necessary in order to overcome these [16]. It is also worth remembering that knowledge in scientific domains is continually evolving. Under such circumstances, it is impossible to define standard ‘objective’ representations of a domain as there does not exist a consensus opinion amongst the practitioners in the domain. It has been argued [14] that the lack of ontology stationarity is an impediment in the use of formal ontologies for knowledge sharing. If this practice is to become widespread in evolving domains, it will be necessary to develop methods of evaluating different representations of the same domain [6, 21]. This will allow user of an ontology to select the most suitable for their needs from those available. References [1] E.E. Abola, J.L. Sussman, J. Prilusky and N.O. Manning, Protein data bank archives of three-dimensional macromolecular structures, in: C.W. Carter Jr and R.M. Sweet, eds., Methods in Enzymology, Vol. 277,¸ (Academic Press, San Diego, 1997) 556-571. [2] B. Alberts, D. Bray, J. Lewis, M. Raff, K. Roberts and J.D. Watson, Molecular Biology of the Cell (Garland, New York, 1989). [3] L.W. Barsalou, Ad hoc categories, Memory and Cognition 11 (1983) 211-227. [4] J. Bouaud, B. Bachimont, J. Charlet and P. Zweigenbaum, Acquisition and structuring of an ontology within conceptual graphs, in: Proc. ICCS'94 Workshop on Knowledge Acquisition using Conceptual Graph Theory (University of Maryland, 1994) 1-25. [5] M. Fernández, A. Gómez-Pérez and N. Juristo, METHONTOLOGY: From ontological art towards ontological engineering, AAAI-97 Spring Symposium on Ontological Engineering, Stanford University, 1997. [6] N. Fridman Noy and C.D. Hafner, The state of the art in ontology design: A survey and comparative review, AI Magazine 18(3) (1997) 53-74. [7] T.R. Gruber, Towards Principles for the design of ontologies used for knowledge sharing, Int. J. Human-Computer Studies 43(5/6) (1995) 907-928. [8] N. Guarino, M. Carrara and P. Giaretta, An ontology of meta-level categories, in: D.J.E. Sandewall and P. Torasso, eds., Proc. KR’94 (Morgan Kaufmann, San Mateo, CA, 1994) 270-280. [9] D.M. Jones, T.J.M. Bench-Capon and P.R.S. Visser, Methodologies for ontology development, in: Proc. IT&KNOWS Conference of the 15th IFIP World Computer Congress (Chapman-Hall, 1998) 20-35. [10] D.M. Jones and R.C Paton, Acquisition of conceptual structure in scientific theories, in: Proc. 10th EKAW-97 - European Knowledge Acquisition Workshop (Springer-Verlag, Berlin, 1997) 145-158. [11] D.L. Medin, Concepts and conceptual structure, Am. Psych. 44(12) (1989) 1469-1481. [12] M. Minsky, A framework for representing knowledge, in: P.H. Winston, ed., The Psychology of Computer Vision (McGraw-Hill, New York, 1975) 211-277. [13] A.G. Murzin, S.E. Brenner, T. Hubbard and C. Chothia, SCOP: A structural classification of proteins database for the investigation of sequences and structures, J. Mol. Biol. 247 (1995)¸ 536-540. [14] D.E. O’Leary, Impediments in the use of explicit ontologies for KBS development, Int. J. HumanComputer Studies 46 (1997) 327-337. [15] R.C. Paton, Understanding biosystem organisation I: Verbal relations”, Int. J. Sci.Ed. 15(4) (1993) 395410. [16] R.C. Paton, S.C. Lynch, D.M. Jones, H.S. Nwana, T.J.M. Bench-Capon and M.J.R. Shave, Domain characterisation for knowledge based systems, in: Proc. A.I. 94 - Fourteenth International Avignon Conference Vol. 1 (1994) 41-54. [17] K.R. Popper, Realism and the Aim of Science (Hutchinson, London., 1983). [18] E. Rosch, Cognitive representation of semantic categories, J. Experimental Psychology 104 (1975) 192-233. [19] G. Steve, A. Gangemi and D.M. Pisanelli, Integrating medical terminologies with ONIONS methodology, in: H. Kangassalo and J.P. Charrel, eds., Information Modelling and Knowledge Bases VIII (IOS Press, Amsterdam, 1997). [20] P.F. Strawson, Individuals: An essay in descriptive metaphysics (Routledge, London, 1959). [21] P.R.S Visser and T.J.M. Bench-Capon, A comparison of four ontologies for the design of legal knowledge systems, AI and Law 6 (1999) 27-57.
© Copyright 2026 Paperzz