Symbolic objects: order structure and pyramidal clustering. P. Brito. ENST, Dept. lnlormatique, 46 rue Barrault, F-75634 Paris Cedex 13, France and. INRIA ...
r.~nnals of Operations Research 55( 1995)277-297
277
Symbolic objects: order structure and pyramidal clustering P. Brito
ENST, Dept. lnlormatique, 46 rue Barrault, F-75634 Paris Cedex 13, France and INRIA, Projet CLOREC, Rocquencourt. F-78153 Le Chesnay Cedex, France
We recall a formalism based on the notion of symbohc object IDiday [15], Brito and Diday [811, which allows to generalize the classical tabular model of Data Analysis. We study assertion objects, a particular class of symbolic objects which is endowed with a partial order and a quasi-order. Operations are then defined on symbohc objects. We study the property of completeness, already considered in Bnto and Diday [8], which expresses the duality extension intension. We formalize this notion in the framework of the theory of Galois connections and stud) the order structure of complete assertion objects. We introduce the notion of c-connection, as being a pair of mappings (f, g) between two partially ordered sets which should fulfil given conditions. A complete assertion object is then defined as a fixed point of the composed/-o g: this mapping is called a "'completeness operator" for it "'completes" a given assertion object. The set of complete assertion objects forms a lamce and we state how suprema and infima are obtained. The lattice structure being too complex to allow a clustering study of a data set. we have proposed a pyramidal clustering approach [8]. The symbolic pyramidal clustering method builds a pyramid bottomup, each cluster being de~ribed by a complete assertion object whose extension is the cluster itself. We thus obtain an inheritance structure on the data set. The inheritance structure then leads to the generation of rules.
!.
Introduction
T h e f o r m a l i s m o f s y m b o l i c objects [8, 15] has been i n t r o d u c e d to extend data analysis to d a t a d e s c r i b e d by intension, in the f o r m o f c o n j u n c t i o n s o f properties. This f o r m a l i s m generalizes the t a b u l a r m o d e l o f D a t a Analysis, allowing to represent m u l t i - v a l u e d d a t a as well as c o m p l e x a n d s t r u c t u r e d data. M o r e o v e r , it allows to take into a c c o u n t the b a c k g r o u n d k n o w l e d g e which expresses d o m a i n semantics. F o c u s is m a d e o n the n o t i o n o f extension. Several a u t h o r s have addressed the p r o b l e m o f k n o w l e d g e r e p r e s e n t a t i o n in this context, see for instance, Michalski [30, 31, 33], Sidi [41], Stoffel [43]. T h e analysis o f s y m b o l i c d a t a led to a new a p p r o a c h o f D a t a Analysis, which has been called S y m b o l i c D a t a Analysis: new objects are to be processed, which can no longer be r e p r e s e n t e d by vectors o f Re; new criteria are considered, mainly based
' J.C. Baltzer AG. Science Publishers
278
P. Brito/Symbolic objects
on the duality intension/extension of a symbolic object and on generalization, specialization techniques. On the other hand, output results should be expressed in the same formalism used to represent input data. Several works have been developed in Symbolic Data Analysis, see for instance: Diday [18], Brito and Diday [8], De Carvalho [9], Jacq [28], Sebag and Schoenauer [40], Lebbe [29] and Vignes [44] for applications in biology and medicine. In clustering, the approach that aims at obtaining interpreted clusters is known as conceptual clustering. The problem of looking simultaneously for the clusters and their representation has been set out in Diday [11]: several kinds of representation have been used (see Diday and Simon [21]): in Diday [12] a logical representation of clusters is proposed. In the eighties, several methods of con. ceptual clustering have been developed, we may cite Michalski [32], Diday et al, [19], Sidi [41], Michalski and Stepp [35--37], Michalski et al. [34], Stepp and Michalski [42], Fisher [25, 26], and Diday and Roy [20]. More recently, the work of Decaestecker [10] and Bisson [6] is also following in this direction. We start by recalling the formalism of knowledge representation, based on thc notion of symbolic object and we focus on assertion objects, which may be delined as conjunctions of disjunctions on the values taken by the variables. The notion of extension is defined, as the set of objects verifying a given symbolic object. We define a partial order relation (generalization/:specialization order) and a quasi-order on the set of assertion objects. Operations of union and intersection, which will be needed in the sequel, are also defined. We then focus on the property of completeness, which expresses the duality intension,cxtcnsion, this is the criterium that will be considered in the presented conceptual clustering method, An assertion object is said to be complete if it describes exhaustively its extension, and if it is minimal, for the symbolic order, to fultil this condition. This notion is formalized by introducing two m a p p i n g s / and g, which should fulfil certain conditions, f being the mapping that, given an assertion object, determines its extension, and g a mapping that, given a set of obscrvcd objects, determines an assertion object that generalizes it (that is, whose extension contains the initial set). We come close to the theory of Galois connections (see Birkhoff [5], Barbut and Monjardet [2]). The set of complete assertion objects presents a lattice structure, and we state how suprema and infima are obtained. This approach is related to the work of Wille [45], for the binary case, already proposed in Barbut and Monjardet [2]. The lattice structure being too complex to allow for a clustering analysis of a data set, we have developed an approach based on the pyramidal model (Brito and Diday [8]), Generalizing the hierarchical model, for it allows the presence of nondisjoint clusters, pyramidal clustering yields a structure which is much simpler than lattices, since each cluster is an interval of an order 0 (as a consequence, pyramids present no crossing on their graphical representation). We present an algorithm of symbolic pyramidal clustering. It allows to cluster elementary objects, representing the lines of a classical data array, but also assertion objects.
P. Brito/ Svmbolic objects
279
Completeness is the main criterium guiding the algorithm: we build a pyramid whose clusters are described by a complete assertion object, the extension of which should be the cluster itself. The method yields an inheritance pyramid, the clusters being automatically interpreted by conjunctions of properties on the variables. We briefly compare the pyramidal symbolic clustering method to other conceptual clustering methods, namely, to C L U S T E R 2 (Michalski [331), COBWEB l Fisher [25, 26]) and U N I N E M (Lebowitz [46]). The inheritance structure between clusters allows the generation of rules. Two methods of rule generation are presented. Finally, an application is presented. 2.
The symbolic objects
In general terms, a symbolic object is defined as a description that is expressed by means of a conjunction of statements on the values taken by the variables. Let f+ be the set of observed objects, f~ = {+'l . . . . . ~,} C_ II. where I] is the entire population under study. Each object is characterized by variables .v,: II ~ O,, i = ! . . . . . p. Let 1," = (.vl . . . . . y?). We then have I::
H~Ol ,,-' ~
x -..
x 0,,,,
( Yl (,.,-9 .....
yp(~))
The point ()',(~,) . . . . . Yv(")) E ( = Ot x . . . x Op is called description ~ 1 ~ , and (' = Oj x ... x O v the description space. 2 I.
THE E L E M E N T A R Y EVENTS
An elementary event, denoted e = [Y, = V,], where 1 < i < p. I/t c O,, expresses the condition "'variable )'t takes its values in I.',". We then define the mapping associated to e: e: ( --. {true,false},
such that e(x I . . . . . . ~v) = true iff x, E Vi. Let us also consider the composed mapping e" = e o Y!:I, where Ylt denotes the restriction of Y to ~L e" : ~ - - ,
{true,false}.
,; --, e(Y(~,')). e'(~.,) = true iffy,(~) is an element of V,.
P. Britoj£:vmbolic objects
280
I)EFINITION
The extension o r e on ~} is defined as exts~e = {~,.' ¢ ~}/e'{w} = true} = e" I(true).
I)fit-INITION
The virtual extension o t e , on ( . is defined as extce=
{ x 6 C/e(x} = true} = e
I(true).
The following p r o p o s i t i o n results immediately from the definitions o f extension and virtual extension. PR(}t}(}SI'II{}N 1
The image b~ Y o f the extension on ~} of an e l e m e n t a r y event e is contained. but not necessarily equal, to the virtual extension o f e, I~XA.M I'Ltl 1
Let ~} {~'l,wz.w~.,,:4} be a set of o b s e r v e d objects, described by two variables, .vl - cohmr. ()l = {t}Im'. red green, w h i t e , . . . }: v: = size, O, = {hig, medium. .,'mall }:
~'1 ~'. .'~ -'a
{_'onsider the elementary event e =
t" I
V-
him' ~lute hlue ,k're¢'n
medium htg htg ~mall
[size = {hig, ,,wdhml}]. Then
ext{~e = {,,.'
[co~our = {blue, red}].
[colour = {h/iw. red}] A 1-':1,,,'.:3} ~ [colour = {Alue}] A [,,'i..-{, = {/~ig. medium}]. e is not complete, g ( . l { e ) ) = [{'ohmr = {blue}] A [size = {hig. medium}] is complete.
P. Brito, Symbolic objects
287
Let a be a complete assertion object (i.e. h ( a ) = a) and E its extension, E = l ( a ) . Then, g(E) = a and E = h'(E). Conversely, if E = h'(E) and a = g(E) then a is complete and E = f(a). IHEOREM I
The set o f all complete assertion objects constitutes a lattice for the symbolic order defined above, where inf and sup are given by: inf(al, a2) = h(al N a2) = g o l ( a l N a2), s u p ( a l . a 2) = al Ua2.
DEFINITION
Given a set o f observed objects ~L a concept o f ~ is defined as a couple (E, a) such that E C_ ~L a C ,4, a is complete and E = l ( a ) . A concept is hence defined as a couple extension intension, it consists on a class together with its description. Here, we generalize the definition presented by Wille [45] for the binary, case, based on the philosophical notion of concept: "'Traditional philosophy considers a concept to be determined by its extent and its intent: the extent consists o f all objects belonging to the concept, while the intent is the multitude of all attributes (or properties) valid for all those objects" [45]. In Machine Learning, however, the notion of concept generally refers to the intension of the class (Rendell [39]L Another point which should be stressed is the fact that we only consider conjunctive concepts, in other words, a couple (E. al is a concept if E can be described by a (discriminant) conjunction, a.
THEOREM 2
The set o f concepts o f ~L with the order defined by (Ei,al) respectively, the assertion objects representing them. We can then write: S ::~ S I VS-,.
4.3.2, Merging method
Let Pl, P2, PI2 E P be such that P~2 = Pl C~P2 ¢ O, and sl:, s~. s 2, respectively, the assertion objects representing them. Then: S 1 /~S'~ ~
5.
512.
Application
In this example, we cluster a set of 30 cat species, which are described by 14 qualitative variables, the 15th variable, that has not been taken into account in the clustering process, indicates the zoological genera. This data is originally from Dorst and Dandelot [22]. In figure 1 we can see the obtained pyramid. The symbolic pyramidal clustering method in this case produced an incomplete pyramid (that is, the cluster corresponding to the whole set is not formed); five terminal clusters can be observed: cluster cluster cluster cluster cluster
1: cat 2: cat 3: cat 4: cat 5: cat
species species species species species
13, 15, 17, 18, 2t, 22, 24, 28, 29, 30; 13, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29; 9, 12, 20, 25; 5, 7, 9, 10, 11, 12, 14; 1, 2, 3, 4, 5, 6, 7, 8, 9, I0, I1, 14.
The descriptions by assertion objects allow us to characterize the clusters: .
Cluster 1 consists of cat species whose fur is not striped nor mottled, that have non-retractile claws, that present a diurnal or mixed predatory behaviour, have the hyoid bone, whose shoulder height is less than 50cm, weight less than 80 kg and length less than 150 cm, tail is of medium size, have well developed canines, preys are small or mixed and hunt.
•
The cat species that constitute cluster 2 have non-retractile claws, nocturnal or mixed predatory behaviour, hyoid bone, shoulder of height less than 50cm, weight less than 10kg and length less than 150cm, well-developed canines and small preys.
P. Brito/Symbolic objects
292
30151813 ~ 21 172B24 2':3 16232527 19~L::K) I2 9 1014 7 11 5 8 3 4 2 l 6
Figure 1. Pyramid obtained on the 30 cat species.
•
Cluster 3 consists of cat species whose fur is not striped nor mottled and have short hair. non-retractile claws, diurnal or mixed predatory behaviour, hyoid bone, shoulder height less than 70cm. weight less than 80 kg, length less than 150cm, short tail, well-developed canines, small preys and do not climb trees.
•
Cluster 4 consists of cat species whose fur is not striped nor mottled, have non-retractile claws, whose shoulder is shorter than 70cm, weight over 80kg, length less than 150cm, have small or mixed preys and do not climb l Fees.
•
Finally, cluster 5 is composed of cat species of length greater than 80cm and weight over 10kg.
Clusters I, 2 and 3 only contain cat species of genus felts. The cheetah, the only cat of genus acinonyx, appears in cluster 5 but rather isolated from the other members of the cluster. The group lion-tiger-jaguar-leopard (cats 1-2-3-4) is characterized by the presence of short hair. non-retractile claws, diurnal or nocturnal predatory behaviour (but not mixed), round ears, absence of the hyoid bone, shoulder of length over 70cm, weight over 80kg and length over 80cm, short or medium tail and underdeveloped canines. These cats all belong to the genus pantera; the fifth element of
P. Brito/Symbolic objects
293
Table I Data 1. 2. 3. 4. 5. 6, 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30.
Lion Tiger Jaguar Leopard Ounce Cheetah Puma Neofi..nebulosa Serval Ocelot Lynx Caracal Viverrin cat Jaguarundi Chaus cat Golden cat Merguay cat Marguerit cat Cafer cat Bieti cat Bengal cat Rouilleux cat Malaysian cat Borneo cat Nigripes cat Manul cat Marble cat Tigrin cat Temminck cat Andean cat
121 123 123 123 321 II1 122 123
2 2 2 2 2 1 1 2
1212 2 2 1 3 1 1 1 2 1 3 1 2 2 1 1 2 1 4 3 1
1 2 1 1 1 2 1 1 2 1 1 1 1 2 1 1 2 1 1 1
3
3
2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 3
2 2 2 2 2 3 3 3 2 3 2 3 2 3 3 2 3 3 3 3 3
1 2 2 1 1 2 1 1 1 1 2
3 3 3 3 2 3 2 2
3 3 3 3 2 2 3 2
3
3
3 3
3 3 2 2 2 2 2 2
2 2 1 2 3 3 3 3
~
1
3 3
~
3
3 I
2 2 1 1 122 223 3
3
1 1 3 3
1 3 3
1 3
I I 3 3 3 3
1 1 I I 1 2 2 2 2 2 2 1 2 2 3 2 3 2 3 2 2 2 3 2 31 32 32 32 32 31 3 2 3 2 3 2 3 2 3 2 3 2 3 2 3 2 3 2 3 2 3 2 2 2
2 I 1 1 1 2 1 1 2 1 1 2
1 I 1 1 1 4 3 2 3 3 3 3 3 3 3 3 3 3
2 1 1 1 1 1 2 1 1 1 1 1
3 3 3 3 3 3 3 3 3 3 3 3
this genus, the ounce (cat number 5) is a little apart. This can be explained by the fact that on several variables on which cat species I-2-3-4 agree cat 5 takes a different value. We can also observe that the cheetah is the only cat in cluster 5 having retractile claws. 6.
Conclusion and perspectives
The formalism of symbolic objects presented in this paper aims at formalizing knowledge representation. By combining a set theoretic approach with a logical representation, it allows to bridge the gap between Data Analysis and Machine Learning, thus opening the way to the combination of techniques from both
P. Brito/Symbolic objects
294
Table 2 Variables ~'l
fur: 1 2 3 4
~',
hair: I short hair 2 long hair
without spots with spots striped mottled
~,~
claws: I retractile 2 non-retractile
r'4
predatory behaviour: I diurnal 2 diurnal and nocturnal 3 nocturnal
~,~
shape of ears: I round 2 pointed
,.~
hyoid bone: 1 present 2 absent
,'-
shoulder height: I height < 50cm 2 50cm ,v height < 70cm 3 height ;, 70cm
,,~
weight: I weight < 10kg "~ 10kg < weight < 80kg 3 weight > 80kg
~',~
length: I length ~ 80cm 2 80cm ,~ length < 150cm 3 length > 150cm
w,l~ )
tail-length (% body length) I short 2 medium 3 long
r'll
canines: I well-developed 2 underdeveloped
~'lz
prey: 1 big "~ big or small
~'la
tree-climbing: 1 climbs trees 2 doesn't climb trees
r,14
hunting: 1 yes "~ no
I'lr,
zoological classification: 1 genus panthera 2 genus neofilis 3 genus fells 4 genus acinonyx
domains. Moreover, it allows to make a step further in the processing of higher level data, since symbolic objects generally represent aggregates of objects which become the units to be subjected to analysis. The pyramidal clustering method we present allows to cluster the defined objects (thus generalizing the usual Data Analysis data) and, by considering settheoretic criteria, to obtain an inheritance structure whose clusters have an
P. Brito/Symbolic objects
295
immediate conceptual interpretation. We have hence a structure allowing to represent and learn knowledge. Perspectives of future development include extending the method to imprecise or uncertain data, on the one hand, and to take advantage of Data Analysis techniques (such as factorial analysis) to simplify the method, on the other hand. References Ill
A. Arnault and P. Nicole. La Logique ou l'Art de Penser (Flammarion, Paris. 1662, 1970).
121 M Barbut and B. Monjardet, Ordre et Classification, Algbhre et Combinatoire, Vols. 1 and 2 [31 141 I5] [~1
I~1 [gl
[t)] t l~1] II I1 I121 [13] [14] [I 51
[16] [17]
[I 8] [It)]
(Hachette, Paris, 1970). P. Bertrand, Etude de la repr6sentation pyramidale, These de 34me cycle, Univ. Paris-IX Dauphine (1986). P. Bertrand and E. Diday. A visual representation of the compatibility between an order and a dissimilarity index: the pyramids, in: Comp. Statist. Quarterly 2 (1985) 31 42. G. Birkhoff, Lattice Theory, American Mathematical Society Colloquium Publications, Vol. 25, 3rd ed. (1967). G. Bisson, Learning of rule systems by combining clustering and generalization, in: SymbolicNumeric Data Analysis and l_x,arning, eds. E. Diday and Y. Lechevallie {Nova Science, New York, 1991 ), P. Brito, Analyse de donnees symboliques. Pyramides d'heritage, Th4se, Universit/r Paris IXDauphine, Paris (1991). P. Brito and E. Diday, Pyramidal representation of symbolic objects, in: Knowledge, Data and Computer-Assisted Decisions. Proc. NATO Advanced Workshop on Data. Expert Knowh,dge and Decision, Hamburg, eds. M. Schader and W. Gaul (Springer, 1990). F.A.T. De Carvalho, J. Lebbe, R. Vignes and E. Diday, Dissimilarity in symbolic data analysis, in: Proc. COMPSTA T. 9th Syrup, on Computational Statistics, Dubrovnik, Yugoslavia (1990). C. Decaestecker. Apprentissage en classification conceptuelle incrementale, These, Univ. Libre de Bruxelles (1991). E. Diday, Nouveaux concepts et nouvelles methodes en classification automatique, These d'Etat, Univ. Paris VI (1972). E. Diday, Selection typologique de parametres, Rapport de Recherche IN RIA No. 188, IN RIA, Rocquencourt, Le Chesnay (1976). E. Diday. Une representation visuelle des classes empietantes: les pyramides, Rapport de Recherche INRIA No. 291. INRIA, Rocquencourt, Le Chesnay (1984). E. Diday, Orders and overlapping clusters by pyramids, in: Multidimensional Data Analysis Proc., eds. J. De Leeuw et al. (DSWO Press, Leiden, 1986). E. Diday, The symbolic approach in clustering and related methods of data analysis: the basic choices, in: Proc. 1st Conf. ,~f the International Federation of Classilication Societies, Technical Univ. of Aachen, ed. H. Book (1987). E Diday, Introduction ~i l'approche symbolique en Analyse des Donnees, RAIRO 23 (1989). E. Diday, Knowledge representation and symbolic data analysis, in: Knowledge, Data and Computer-Assisted Decisions, Proc. NATO Advanced Workshop on Data, Expert Knowledge and Decision, Hamburg, eds. M. Schader and W. Gaul (Springer, 1989), E. Diday, Des objets de l'Analyse des Donnees ~i ceux de I'Analyse des Connaissances, in: Induction Swnholique Num~rique h Partir de Donn~es, eds. Y. Kodratoffand E. Diday (Cepadues, 1991 ). E. Diday, G. Govaert. Y. Lechevallier and J. Sidi, Clustering in pattern recognition, in: Proc. NA TO Advam'ed Stud)" Institute on Digital Image Processing and Analysis, Bonas, ed. J,C. Simon (1980).
296
P. Brito/Symbolic objects
[20] E. Diday and L. Roy, Generating rules by symbolic data analysis and application to soil feature recognition, in: Acres des Seines Journees lnternationales "Les s.vstemes experts et leurs applica. tions". Avignon (1988) pp. 533 549. [21] E. Diday and J.C. Simon, Clustering analysis, in: Communication and Qvhernetics 10, Digital Pat. tern Recognition, ed. K.S. Fu (Springer, 1976) pp. 47 94. [22] J. Dorst and P. Dandelot, Guide des Grands Mammiferes d'Afrique (Delachaux et Niestl~, Neuchfitel. 1972}. I23] V. Duquenne, Contextual implications between attributes and some representation properties for finite lattices, Rapport C A M S . P.023, Centre d'Analyse et de Mathematiques Sociales, Maison des Sciences de L'Homme. Paris (1986). [24] C. Durand. Ordres et graphes pseudo-hierarchiques: theories et optimisation algorithmique, These. Univ. de Provence Saint Charles(1989). [25] D.H. Fisher, Conceptual clustering, learning from examples and inference, in: Proc. 4th Int. tt'orkshop on Machine Learning, Ir~'ine. CA (1987}. [26] D H . Fisher, Improving inference through conceptual clustering. Proc. AAAI-87. 6th Natianaf Con/: on Art~[icial Intelligence, Seattle, WA (1987). [27] O. Gascuel. Inductive learning, numerical criteria and combinatorial optimization, some results, in: Proc. ConL on Data Analvsi.~. Learning Symbolic and Numerical Knowledge. Antibes, ed. E. Dtday (Nova Science, New York. 1989). [28] C. Jacq. Combining a decision tree with kernel estimates, in: Symbolic Numeric Data Analysis and Learning, eds. E Diday and Y. Lechevallier (Nova Science, New York, 1991). [29] J. Lebbe, Representation des concepts en biologie et en medecine, These, Universite Paris-V1. Paris ( 1991 }. [3(I] R. Michalski, Aqual I Computer implementation of a variable-valued logic system VLI and examples of its application to pattern recognition, in: Proc. 1st Int. Joint Con]. on Pattern Recognition. Washington, DC (1973) pp. 3 17. [31] R Michalski. Pattern recognition as rule-guided inductive inference, IEEE Trans. Pattern Anal. Machine int. PAMI-2 (1980). I321 R. Michalski. Knowledge acquisition through conceptual clustering: a theoretical framework and an algorithm for partitioning data into conjunctive concepts, Int. J. Policy Anal. Inf. Syst.
(1980). [33] R Michalski. A theo~' and methodology of inductive learning, in: Machine Learning I, eds. R.S. Michalsky. J.G. CarbonellandT.M. Mitchell (Springer, 1984) pp. 83 134. [34] R Michalski. E Diday and R. Stepp. A recent advance in Data Analysis: Clustering objects into classes characterized by conjunctive concepts, in: Progress in Pattern Recognition, Vol. 1. eds. L Kanal and A. Rosenfeld (1982). [35] R. Michalski and R Stepp. An application of AI techiques to structuring objects into an optimal conceptual hierarchy, in: Proc. 7~th Int. Joint ConL on Arti/icial lntelligeme, Vancouver. Canada
11981), [36] R. Michalski and R. Stepp, Revealing conceptual structure in data by inductive inference, in: Machine Intelligence. Vol. I 0 ( 1981 ). [37] R. Michalski and R. Stepp, Learning from observation: conceptual clustering, in: Machine lwarnmg L eds. RS. Michalsky. J.G. Carbonell and T.M. Mitchell (Springer, 1984) pp. 331 363. [38] T. Mitchell, Generalization as search. Art. Int. 18 (1982) 203 226. [39] L Rendell, A general framework for induction and a study of selective induction, Machine Learning 1 (1986) 177 226. [40] M. Sebag and M. Sehoenauer, Incremental learning of rules and meta rules, in: Proc. 7th Int. ¢'onL on Machine Learning, eds. B. Porter and R. Mooney (Morgan Kaufmann, 1990). [41 ] J. Sidi, L'approche logique en classification automatique et reconnaissance des formes, These de 3~me cycle, Univ. Pierre et Marie Curie (1980).
P. Brito/Symbolic objects
297
[42] R.E. Stepp and R.S. Michalski. Conceptual clustenng: inventing goal-oriented classifications of structured objects, in: Machine Learning II, eds. R.S. Michalsky. J.G, Carbonell and T,M. Mitchell (Morgan Kaufmann, Los Altos. CA. 1986) pp. 471-498. [431 J,C Stoffel, The theory of prime events, Data Analysis for sample vectors with inherently discrete variables, in: Inlbrmation Processing 74 (1974). [44] R, Vignes, Caracterisation automatique de groupes biologlques, These. Universite Pans-Vl. Paris ( 1991 ), [45] R. Wille. Restructunng lattice theory: an approach based on hierarchies of concepts, in: Proc. Syrup, on Ordered Sets, ed. l, Rival (Reidel, DordrechtBoston, 1982) pp. 445 470. [461 M, Lebowitz, Experiments with incremental concept formation: UNINEM, in: Machine Learning 2 (1987) pp. 103--138. [47] R. Michalsky, Automated constructions of classifications: conceptual clustering versus numerical taxonomy, IEEE Trans. Pattern Anal. Machine Int. PAMI-5 (1983),