Plausible Inferencing Using Extended Composition - cse.sc.edu

6 downloads 0 Views 100KB Size Report
SpareTire. Car. T. E. Fig. 1. Three examples of the composition of two semantic relations. A composite relation results from applying the binary operation of ...
Plausible Inferencing Using Extended Composition Michael N. Huhns and Larry M. Stephens Department of Computer Science and Engineering University of South Carolina Columbia, SC 29208 USA {huhns,stephens}@sc.edu

Abstract. This paper considers the composition of tuples from two relations in order to derive additional tuples of one of these relations. Our purpose is to determine when the composition is plausible and for which relation the new tuples are derived. We first present a formal definition of composition and our extension to it. We next define conditions on the domains and ranges of the relations that are necessary for extended composition to occur. We then show how a set of underlying attributes, independently specified for each relation, is sufficient for determining plausible composition, when the primitives are combined according to an algebra. Finally, we apply our method for extended composition to a representative group of semantic relations and evaluate the results.

1 Introduction The construction of a large knowledge base is difficult and requires techniques that can facilitate knowledge acquisition. Rather than requiring that all knowledge in the base be entered explicitly, a system could be provided with a basic set of facts and an inference mechanism for inferring additional facts from these [Baker et al., 1987]. An ideal system would be able to generate all valid and no invalid inferences. One way to approach this ideal is to provide a set of specialized inference procedures that collectively generate a valid set of inferences. In this paper we develop one such procedure, based on an extended composition of semantic relations from a knowledge base. Figure 1 contains examples of this type of composition. The procedure has the effect of constructing new inference rules, which, when executed, generate extensions to the knowledge base.

2 Extended Composition A binary relation R consists of a set A (the domain), a set B (the range), and a mapping that specifies the set of tuples ha, bi belonging to R, where a ∈ A and b ∈ B. The mapping may be explicit by listing all the tuples in R or implicit by providing rules for selecting the tuples. In a large frame-based knowledge system, such as CYC [Lenat et al., 1986,Lenat and Guha, 1988], the mapping for a relation is only partially specified; other tuples for the relation are added as knowledge is entered. The procedure for composing relations outlined in this paper provides a means of inferring additional tuples belonging to an implicitly defined relation.

ownedBy

Car

6

partOf

¡

- Grover ¡ µ ¡

¡ownedBy

Car containedIn

¡ Wheel

ownedBy

- Grover ¡ µ 6 ¡ ¡neither ¡ ¡

Ernie

Trunk

partOf

containedIn

¡

¡ SpareTire

Fig. 1. Three examples of the composition of two semantic relations

A composite relation results from applying the binary operation of composition to two binary relations. This operation has the following definition [Stanat and McAllister, 1977]: Definition 1. Let Ri be a relation from set A to set B and Rj be a relation from set B to set C. The composite relation from A to C, denoted Ri · Rj , is Ri · Rj = {ha, ci | a ∈ A ∧ c ∈ C ∧ ∃b[b ∈ B ∧ ha, bi ∈ Ri ∧ hb, ci ∈ Rj ]} We define extended composition as follows: Definition 2. Let Ri be a relation from set A to set B and Rj be a relation from set C to set D. The extended composite relation from A to D, denoted Ri ¯ Rj , is Ri ¯ Rj = {ha, di | a ∈ A ∧ d ∈ D ∧ ∃b[b ∈ B ∧ b ∈ C ∧ ha, bi ∈ Ri ∧ hb, di ∈ Rj ]} If we denote the converse relation of R by Rc , then it can be shown that (Ri ¯ Rj ⊂ Rk ) ⇔ (Rjc ¯ Ric ⊂ Rkc ) Extended composition can also be shown to be associative and not commutative. We would like to have an algorithmic way of determining when Ri ¯Rj is nonempty and whether it is a subset of Ri or Rj or neither. Our method for making this determination is based on two premises: – the domains and ranges of the two relations must be type-compatible, and – the primitives (defined below) of the relations must combine compatibly. If the first premise is satisfied by relations Ri and Rj , then the primitives of the two relations can be combined to yield the primitives of the composed relation, Ri ¯ Rj . The primitives of Ri ¯ Rj can then be compared to those of Ri and Rj to determine if Ri ¯ Rj is a subset of Ri , Rj , both, or neither. The type compatibility specified by the first premise results in the following necessary conditions for the extended composition of relations: 1. The intersection of sets B and C must be nonempty; otherwise, the relation Ri ¯ Rj will be empty.

- Car ¡ µ ¡ ¡containedIn

6

'$ '$ '$ '$ C B - D '$ 6Rj ¢¸ ¢&% B &% &% '$ Rj - D ¢ * &% ¢ ©© 6 © Ri ¢ Ri C Ri © &% '$ '$ Rj ¢ ©© &% A

A

&%

&%

Fig. 2. Type requirements on the domains and ranges of Ri and Rj

2. For the derived tuples to be elements of Ri , the intersection of sets B and D must be nonempty. 3. For the derived tuples to be elements of Rj , the intersection of sets A and C must be nonempty. These conditions, represented using Venn diagrams in Figure 2, eliminate many of the possibilities for extended composition. An algebra based on primitives of the relations eliminates additional implausible compositions.

3 Primitives for Semantic Relations The second premise above requires a set of primitives that describe each relation and a set of rules for combining primitives. We have postulated a group of ten primitives, based on a literature survey [Chaffin and Herrmann, 1987, Cohen and Loiselle, 1988, Wierzbicka, 1984, Winston et al., 1987] and an analysis of numerous semantic relations in the CYC knowledge base [Lenat and Guha, 1988]. These primitives are independently determinable for each relation and relatively self-explanatory. They specify a relationship between an element of the domain and an element of the range of the semantic relation being described. The primitives, described next, have values from the set X = {+, 0, −}, where + indicates that the relationship holds, − that it does not, and 0 that it is not applicable. Composable: Some semantic relations can never be meaningfully composed with other relations due to their fundamental characteristics. For example, attributes are not generally transferable through other relations. Functional: The domain of a Functional relation is in a specific spatial or temporal position with respect to the range of the relation. For example, in an instance of the componentOf relation, such as Wheel.componentOf.Car, the Wheel is in a specific spatial position with respect to the Car. This property does not hold for Juror.memberOf.Jury. Homeomerous: In each instance of a Homeomerous relation, the element of the domain must be the same kind of thing as the element of the range, e.g., in PieSlice.pieceOf.Pie, the slice is the same stuff as the pie.

Separable: The domain of a Separable relation can be temporally or spatially separated from the range, and can thus exist independently of the range. For the above componentOf example, the Wheel can be separated from the Car and can exist independently. For Wheel.madeOf.Aluminum, the Aluminum cannot be separated from the Wheel if the Wheel is still to exist. Structural: The domain and range of a Structural relation have a hierarchical relationship in terms of a physical structure. For example, in Wheel.componentOf.Car, the hierarchical structure is from part to whole and the Structural property of componentOf has a − value. Temporal: The domain and range of a Temporal relation are ordered in regard to a temporal structure. For example, there is no notion of time in the relation pieceOf, indicated by a value of 0 for Temporal; in causedBy, a value of − indicates that the range element precedes the domain element. Intangible: The domain and range of an Intangible relation have a hierarchical relationship in terms of ownership or mental inclusion. As an example, the relation ownedBy has a value of − for Intangible, because the element owned is intangibly included in the owner’s sphere of influence. (Note: values of the last three primitives for the converse of a relation are opposite to those for the forward relation.) Near: The domain of a relation with property Near is physically or temporally close to the range. Connected: The domain of a relation with property Connected is physically or temporally connected to the range. A connection, which may be indirect, is indicated by +; no connection is denoted by −. Intrinsic: A semantic relation has the property Intrinsic if the relation is an attribute of the stufflike nature of its domain or range. For example, the relation hasDensity is an intrinsic property of its domain, so that if Aluminum.hasDensity.5, then every piece of Aluminum inherits this value for its density. To test our hypotheses, we have selected a representative set of relations (Table 1), including part-whole, subclass, ownership, causal, and attribution relations. For each of these relations, Table 2 shows the values we have assigned to the above primitives. The domains and ranges of the relations, shown in Table 1, are also needed to determine plausibility.

4 Algebra of Relation Primitives We assume that the results of composing two semantic relations can be determined from the results of combining their ten relation primitives (the accuracy of this assumption is evaluated below) as follows: Ri ¯ Rj ≡ VRi ◦ VRj

(1)

where VR ∈ X 10 , X = {+, 0, −}, and ◦ is the combination operator. That is, for the purposes of relation composition, each relation can be represented solely by a vector of

Table 1. Domains and Ranges for Semantic Relations Relation Name a. componentOf b. memberOf c. pieceOf d. constituentOf e. subeventOf f. subregionOf g. subprocessOf h. subsequenceOf i. purposeOf j. causedBy k. producedBy l. ownedBy m. focusOf n. connectionOf o. attributeOf p. containedIn r. subfieldOf s. hasMechanisms t. isA u. weightOf v. densityOf

Domain IndividualObj1 Thing Stuff Stuff Event SpatialObject Process Sequence Event Event IndividualObj IndividualObj IntangibleObj IndividualObj Attribute IndividualObj IntangibleStuff Event Collection Number Number

Range IndividualObj Collection Stuff IndividualObj Event SpatialObject Process Sequence Agent Event 1 Object. 2 Tangible&IntangibleObject. Process Tng&IntngObj2 IntangibleObj IndividualObj Thing SpatialObject IntangibleStuff Event Collection TangibleObject TangibleObject

Table 2. Primitives for Semantic Relations Relation Name

Relation Primitives Compos. Func. Homeo. Sep. Struct. Temp. Intang. Near Conn. Intrin. a. componentOf + + − + − 0 − + + − b. memberOf − − − + − 0 − 0 − − c. pieceOf + − + + − 0 − + + + d. constituentOf + − − − − 0 − + + + e. subeventOf + + − − 0 − − + + + f. subregionOf + − + − − 0 − + + + g. subprocessOf + + + − 0 − − + + + h. subsequenceOf + + + + − − − + + + i. purposeOf + − − + 0 − 0 0 − − j. causedBy + + − + 0 − 0 0 +/− + k. producedBy + + − + 0 − 0 0 − − l. ownedBy + − − + 0 0 − 0 +/− + m. focusOf + + − − 0 0 − + + − n. connectionOf + + − + − 0 − + + − o. attributeOf − − − 0 0 0 − 0 − − p. containedIn + − − + − 0 0 + − + r. subfieldOf + − + − 0 0 − + + − s. hasMechanisms + + − + 0 − 0 0 +/− − t. isA + − + + − 0 − 0 − + u. weightOf − − − 0 0 0 − 0 − − v. densityOf − − − 0 0 0 − 0 − +

Table 3. Operation Tables for Combining Relation Primitives Composable Rj Ri − 0 + − P 0 P 0 0 0 0 + P 0 +

Functional Rj Ri − 0 + − +/− 0 − 0 0 0 0 + − 0 +

Homeomerous Rj Ri − 0 + − − 0 − 0 0 0 0 + − 0 +

Separable Rj Ri − 0 + − − 0 + 0 0 0 0 + + 0 +

Structural Rj Ri − 0 + − − 0 P 0 0 0 0 + P 0 +

Temporal Rj Ri − 0 + − − 0 P 0 0 0 0 + P 0 +

Intangible Rj Ri − 0 + − − 0 P 0 0 0 0 + P 0 +

Near, Connected Rj Ri − 0 + − +/− 0 − 0 0 0 0 + − 0 +

Ri − 0 +

Intrinsic Rj − 0 + − 0 +/− 0 0 0 +/− 0 +

Note: +/− indicates that the relations compose, but that this primitive does not constrain the composition. P denotes prohibited, indicating that the relations do not compose.

values for its ten relation primitives. It thus becomes necessary to define precisely how two of these vectors combine. We assume that the primitives are orthogonal and form a linear basis for the set of relations. The combination operator ◦ can thus be defined in terms of a separate operation table for each primitive, as shown in Table 3. Each operation table is symmetric and has been derived from empirically determined rules for relation composition, such as the following: – In order to compose, two relations must have the same hierarchical direction for their Structural, Temporal, and Intangible primitives. – If Ri has the property Connected and Rj does not, then Ri ¯ Rj (and Rj ¯ Ri ) cannot have the property Connected. Therefore, Ri ¯ Rj (and Rj ¯ Ri ) is not a subset of Ri . – If Ri has the property Separable and Rj does not, then Ri ¯ Rj (and Rj ¯ Ri ) has the property Separable. Therefore, Ri ¯ Rj (and Rj ¯ Ri ) may be a subset of Ri . The resultant algebra enables the primitives of the composed relation to be derived. If these derived primitives match the primitives of one (or both) of the composing relations, then a tuple of one (or both) of these can be instantiated; else, the knowledge base can be searched to find all relations that match the resultant primitives, and, if not already instantiated, these can be presented to a user as potential new tuples for the knowledge base.

As an example of this inference procedure, assume that a user has entered the assertions Wheel.componentOf.Car and Car.ownedBy.Grover. Combining the primitives from Table 1 for componentOf and ownedBy according to the combining rules in Table 3 yields the following vector of primitives for the resultant relation: VR = (+ − − + 0 0 − 0 + / − +/−). Because this vector matches the primitives of ownedBy and does not match those of componentOf, the inference is that Wheel.ownedBy.Grover. The plausibility of this result is checked by comparing the types of the domain and range of this relation instance with the types specified for ownedBy in Table 2. To do this, a taxonomy of types is needed that enables the intersection of domains and ranges to be determined. Such a taxonomy is typically part of frame-based knowledgerepresentation systems. The types used for our examples are from the CYC ontology [Lenat and Guha, 1988]. Using this ontology and Table 2, we find that Wheel is an instance of IndividualObject, Grover is an instance of Tangible&IntangibleObject, and these match the domain and range of ownedBy. The resultant inference is thus deemed plausible.

5 Results The above inference procedure was applied to the set of relations shown in Tables 1 and 2. The results, in the form of a composition matrix, are shown in Table 4. Each entry in Table 4 is equivalent to a rule of the form ∀x ∈ domain(Ri ) ∀y ∈ [range(Ri ) ∩ domain(Rj )] ∀z ∈ range(Rj ) [x.Ri .y ∧ y.Rj .z → x.(Ri ¯ Rj ).z]

(2)

The results reflect the order of composition, e.g., Rj ¯Ri as well as Ri ¯Rj , which was not addressed in either [Cohen and Loiselle, 1988] or [Winston et al., 1987]. Because each of the operators for combining primitives is symmetric, the composition matrix is nearly symmetric. The only exceptions result from type compatibility, which sometimes excludes a composition from occurring. For example, f ¯ l ⊂ l, but l ¯ f = ∅, because the intersection of the range of l with the domain of f is empty. The following are specific examples of plausible inferences predicted by the extended composition of relations (where → denotes logical implication): – a¯p⊂p Tire.componentOf.Car ∧ Car.containedIn.Garage → Tire.containedIn.Garage – p¯a⊂p Refrigerator.containedIn.Kitchen ∧ Kitchen.componentOf.House → Refrigerator.containedIn.House – i¯e⊂i Thunder.causedBy.Lightning ∧ Lightning.subeventOf.ThunderStorm → Thunder.causedBy.ThunderStorm

Table 4. Composition Matrix for Ri ¯ Rj Rj Ri a b c d e f g h i j k l m n o p r s t u v a a - - - - - - a - - - l - an - p - - - - b - - - - - - - -- - - - - - - -- - -- c - - c - - c - c- - - l - - -p- - -- d - - -d - d - -- - - l - - -p- - -- e - - - - e - e - i j k lm - - -- s -- f - - cd - f - -- - - l - - -p- - -- g - - - - e - g - i j k lm - - -r s -- h a - c - - - - hi j k l - n -p- - -- i - - - - i - i i i i i - - - - -- i -- j - - - - j - j j i j jk - - - - - - js - - k - - - - k - k k i jk k - - - - - - ks - - l l - l l l - l l - - - l - l - -- - -- m - - - -m-m-- - - lm - - -- - -- n an - - - - - - n - - - l - n - p - - - - o - - - - - - - -- - - - - - - -- - -- p p -pp - p - p- - - - - p -p- - -- r - - - - - - r -- - - l - - - -r - -- s - - - - s - s - i js ks - - - - - - s - - t - - - - - - - -- - - - - - - -- - t - u - - - - - - - -- - - - - - - -- - -- v - - - - - - - -- - - - - - - -- - -- Note: the letters in this matrix refer to the relations listed in Tables 1 and 2.

The technique for relation composition also correctly predicts when neither of the composed relations can be inferred. For example – p¯l =∅ Grover.containedIn.Car ∧ Car.ownedBy.Ernie 6→ (Grover.containedIn.Ernie ∨ Grover.ownedBy.Ernie).

6 Discussion and Conclusions The inference procedure and results presented in this paper extend the work of previous researchers. Chaffin and Herrmann [1987] identify a set of relation elements (relation primitives) that can be used to describe and classify relations. Each relation element is a fundamental property that holds between the domain and range of the relation. Winston et al. [1987] define three independent relation elements, inclusion, connection, and similarity; these are used to describe spatial inclusion, meronymic inclusion, and class inclusion. When any inclusion relation is combined with another, they find that a valid inference can be made and that the resultant relation is the one having the fewest relation elements. In addition, Winston et al. identify three dependent elements of connection that explain the transitivity, but not the composability, of six meronymic relations. Cohen and Loiselle [1988] identify two deep structures for relations: hierarchical and temporal, each having a direction. Each relation is hierarchical, temporal, or both. When two relations are composed, the resultant relation may have any of several possible deep structures, depending on the properties of the composing relations. They found that inferences are most plausible when either the hierarchical or temporal directions of the two composing relations are the same as that in the composed relation. Like Winston et al., they do not consider type consistency in composing relations. We extend the research efforts cited above by basing relation composition on set theory. On this basis, we conclude that typing of the domain and range elements may restrict composition, independently of any relation attribute restrictions. In addition, we extend the work of [Winston et al., 1987] by explicitly considering the hierarchical nature of the inclusion relations, as suggested by [Cohen and Loiselle, 1988]. This leads to a means of defining the primitive attributes of the converse of a relation and, consequently, of composing a converse with other relations. We provide a vector of ten primitives for each of 21 typical relations. This vector representation provides a more powerful basis for ranking and classifying relations than does the linear ordering in [Winston et al., 1987]. Since there are three possible values for each of the ten primitives, our representation provides for 310 = 59, 049 different basis vectors that can be used to represent relations. The number of relations that could be represented is actually much greater because of the large number of types that could be chosen for the domains and ranges. The inference procedure we developed for relation composition is based on several assumptions. The foremost of these is that relation composition is equivalent to a combination of the corresponding vector of primitives. The correctness of this assumption is borne out by the plausibility of the predicted inferences, shown in Table 4. A second

assumption is that each relation primitive is orthogonal to the others. This simplifying assumption greatly increases the efficiency of the inference procedure by yielding operation tables (see Table 3) that are independent of each other. Although the validity of the results supports this assumption also, there is some evidence that the chosen primitives are NOT orthogonal. For example, the primitives Connected, Homeomerous, and Intrinsic combine dependently according to the following rule to yield compositions with attribute relations not predicted by our algebra: (attributeOf .Intrinsic.+) ∧ (Rj .Connected.+) ∧ (Rj .Homeomerous.+) → (attributeOf ¯ Rj ⊂ attributeOf ) Such a rule would yield the valid inference densityOf ¯ pieceOf ⊂ densityOf, which does not result from our relation algebra. It could be applied after extended composition and viewed as an additional inference mechanism. Other valid inferences are missing from Table 4, including memberOf ¯ isA ⊂ memberOf and componentOf ¯ attributeOf ⊂ attributeOf. However, we feel that these omissions do not diminish the utility of our results, in that our procedure is designed for correctness instead of completeness. In addition, many knowledge-based systems have other inference mechanisms that could generate these missing inferences. For example, an automatic classifier [Lipkis, 1981] would generate the inference memberOf ¯ isA ⊂ memberOf. The potential for generating new inferences in a large knowledge base, such as the one in CYC, is enormous. CYC, currently with >4000 relations, could have approximately sixteen million possible compositions. Of these, 20% are predicted to be plausible, based on the percentage of valid entries in Table 4. For all possible values of relation primitives, no more than 31% could be composed validly due to prohibited entries in the operation tables for combining primitives. The one million assertions now in the CYC knowledge base can be combined using the predicted compositions to yield many new inferences. However, there are two major problems with extended composition. First, reason maintenance for the resultant inferences is computationally problematic, because the inferences depend not only on the relations being composed, but also on the relation primitives for all of the relations involved. Second, assigning values for the relation primitives is conceptually problematic. The values are subjective and must be entered manually for each relation in a knowledge base. The validity of the inferences generated by extended composition are directly dependent on these values. Nevertheless, we expect that the relation primitives can be used for classifying relations, as well as generating new inferences, and for suggesting plausible analogies. The procedure for extended composition appears to be a viable technique for increasing the information in an existing knowledge base. Because the procedure has the effect of generating new inference rules and then applying them, it yields plausible inferences that are not within the deductive closure of the original knowledge base.

References [Baker et al., 1987] M. Baker, M. H. Burstein, and A. M. Collins. Implementing a Model of Human Plausible Reasoning. In Proceedings of the Tenth IJCAI, Milan, Italy, August 1987, pp. 185–188. [Chaffin and Herrmann, 1987] R. Chaffin and D. Herrmann. Relation Element Theory: A New Account of the Representation and Processing of Semantic Relations. In D. Gorfein and R. Hoffman, eds., Memory and Learning: The Ebbinghaus Centennial Conference, Lawrence Erlbaum, Hillsdale, NJ, 1987. [Cohen and Loiselle, 1988] P. R. Cohen and C. L. Loiselle. Beyond ISA: Structures for Plausible Inference in Semantic Networks. In Proceedings AAAI88, St. Paul, MN, August 1988, pp. 415–420. [Lenat et al., 1986] D. B. Lenat, M. Prakash, and M. Shepherd. CYC: Using Common Sense Knowledge to Overcome Brittleness and Knowledge Acquisition Bottlenecks. AI Magazine, vol. 6, no. 4, Winter 1986, pp. 65–85. [Lenat and Guha, 1988] D. Lenat and R. V. Guha. The World According to CYC. MCC Technical Report No. ACA-AI-300-88, MCC, Austin, TX, September 1988. [Lipkis, 1981] T. Lipkis. A KL-ONE Classifier. In Proceedings of the 1981 KL-ONE Workshop, J. G. Schmolze and R. J. Brachman, eds., BBN Report No. 4842, Bolt Beranek and Newman Inc., June 1982, pp. 128–145. [Stanat and McAllister, 1977] D. F. Stanat and D. F. McAllister. Discrete Mathematics in Computer Science, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1977. [Wierzbicka, 1984] A. Wierzbicka. apples are not a ‘kind of fruit’: the semantics of human categorization. American Ethnologist, vol. 11, 1984, pp. 313–328. [Winston et al., 1987] M. E. Winston, R. Chaffin, and D. Herrmann. A Taxonomy of Part-Whole Relations. Cognitive Science, vol. 11, 1987, pp. 417–444.

Suggest Documents