Development of Ontologies for the Semantic Web

2 downloads 0 Views 120KB Size Report
Free University of Bozen-Bolzano ... mains calls for the design and the development of the basic infrastructure for ... Web, are developed in a distributed and autonomous way by independent parties. .... This work can be seen as a first step towards ontology interoperation .... SWWS/program/full/SWWSProceedings.pdf.
In Proc. of the 40th Annual Conference of the Italian Computing Association (AICA 2002)

Development of Ontologies for the Semantic Web: An Overview Diego Calvanese1 , Giuseppe De Giacomo1 , Enrico Franconi2 1

Dipartimento di Informatica e Sistemistica Universit`a di Roma “La Sapienza” Via Salaria 113, 00198 Roma, Italy, {calvanese,degiacomo}@dis.uniroma1.it 2

Faculty of Computer Science Free University of Bozen-Bolzano Dominikanerplatz 3, I-39100 Bozen, Italy [email protected] http://www.inf.unibz.it/~franconi/

Abstract In this paper we overview the main research areas that, in our view, will have a profound impact on the future development of ontologies for the Semantic Web. Specifically, apart from the research on ontologies itself, we analyse the following fieldswhose technologies may contribute to ontology development: knowledge representation and reasoning techniques and systems, information integration and interoperation in databases and information systems, and quality analysis and quality based design.

1

Introduction

Currently computers are changing from single isolated devices to entry points into a worldwide network of information. As a result, support in the exchange of data, information, and knowledge is becoming the key issue in current computer technology. Ontologies play a major role in supporting information exchange across various networks. Defined as “specifications of a shared conceptualisation of a particular domain”, they provide a shared and common understanding of a domain that can be communicated across people and application systems, and thus facilitate knowledge sharing and reuse. Ontologies will play a key role in growth areas such as intelligent information integration, cooperative information systems, information retrieval, digital libraries, knowledge management and electronic commerce, and bioinformatics (especially medical terminologies). Also, Ontologies are widely regarded as one of the foundational technologies for the Semantic Web. Indeed, recently we observe the development of large numbers

1

of ontologies (see, e.g., WordNet1 , CYC2 , and the DAML ontology library3 ). These ontologies have, however, usually been developed in an ad hoc manner by domain experts, often with only a limited understanding of the semantics of ontology languages. The result is that many ontologies are of low quality—they make poor use of the languages in which they are written and do not accurately capture the author’s rich knowledge of the domain. This problem becomes even more acute as ontologies are maintained and extended over time, often by multiple authors. Poor quality ontologies usually require localised “tuning” in order to achieve the desired results within applications. This leads to further degradation in their overall quality, increases the brittleness of the applications that use them, and makes interoperability and reuse difficult or impossible. In order to overcome these problems, ontologists need clear and measurable quality criteria, and tools that support them in the task of engineering ontologies that meet these criteria. Indeed, the growing importance of ontologies in various application domains calls for the design and the development of the basic infrastructure for building, merging, and maintaining ontologies. However, there is still a fundamental lack of basic technical and methodological means for the development of high quality ontologies, and for preserving the quality and reusability of ontologies over time. In this paper we overview the main research areas that, in our view, will have a profound impact on the future development of ontologies for the Semantic Web. Specifically, apart from the research on ontologies itself, the following areas can deeply contribute to ontology development: • Automated reasoning techniques and systems. Ontologies can be formalised in logic, and the use of automated reasoning tools allows one to extract and exploit knowledge that is only available implicitly. This makes research in knowledge representation and reasoning of great interest for ontologies. • Information integration and interoperation. Ontologies, especially in the Semantic Web, are developed in a distributed and autonomous way by independent parties. Integration and interoperation of such ontologies is a major challenge. Research on schema and data integration in databases and information systems is the starting point for studying integration and interoperation of ontologies. • Quality analysis and quality based design. Ontologies, as software, are human artifacts that will be widely deployed. Hence both designers and users of ontologies need objective and measurable quality criteria for evaluating several different aspects concerning them. Research on quality analysis, typical of software engineering, has to be adapted to the issues specific for ontology development. The rest of the paper is organised as follows. In Section 2 we overview recent research on ontology development. In Section 3 we discuss the work on knowledge representation and reasoning relevant for ontologies. In Section 4 we analyse the research in integration and interoperation in databases and information systems, which can be the base for 1

http://www.cogsci.princeton.edu/~wn/ http://www.cyc.com/ 3 http://www.daml.org/ontologies/ 2

2

integration and interoperation of ontologies. In Section 5 we look at recent developments in quality analysis that can be relevant for ontologies. Finally, in Section 6 we discuss the limitations of the current approaches for ontology development and preliminary ideas to overcome them.

2

Ontologies

Even though ontologies have a long history in Artificial Intelligence (AI), the meaning of this concept still generates a lot of controversy in discussions, both within and outside of AI. We follow Gruber’s definition [46]: an ontology is a specification of a conceptualisation, that is, an abstract and simplified view of the world that we wish to represent, described in a language that is equipped with a formal semantics. In knowledge representation, an ontology is a description of the concepts and relationships in an application domain. Depending on the users of this ontology, such a description must be understandable by humans and/or by software agents. Researchers in AI were the first to develop ontologies with the purpose of facilitating automated knowledge sharing (see, e.g., [82, 52, 77]). Since the beginning of the 90’s, ontologies have become a popular research topic, and several AI research communities, including knowledge engineering, knowledge acquisition, natural language processing, and knowledge representation, have investigated them. More recently, the notion of an ontology is becoming widespread in fields such as intelligent information integration, cooperative information systems, information retrieval, digital libraries, e-commerce, and knowledge management. Ontologies are widely regarded as one of the foundational technologies for the Semantic Web [18, 68, 53]: when annotating web documents with machine-interpretable information concerning their content, the meaning of the terms used in such an annotation should be fixed in a (shared) ontology. Recent research has focused on the development of formalisms—with a logic-based semantics—for representing ontologies (e.g., OIL [38] and DAML+OIL [34]) and tools for building, maintaining and integrating ontologies (e.g., OilEd, OntoEdit, Chimaera, and Icom [16, 79, 69, 39]). An important area of research is concerned with philosophical issues related to the structure of knowledge and with the specification of general-purpose “upper level” ontologies that reflect this structure (see, e.g., CYC and the IEEE Standard Upper Ontology effort). Progress in these endeavours has, however, been very slow—it has proved extremely difficult to reach a consensus, and many now doubt if it is possible to define ontologies that accurately represent the structure of knowledge across widely varying domains of discourse. In contrast to such general ontologies, a variety of specialised ontologies have now been built for all kinds of applications (see, e.g., [45, 74, 71, 72, 54]). These efforts have been more successful and some of these ontologies have turned out to be very useful in applications. Various methodologies are being developed on how to build a “good” ontology. These methodologies are mainly developed in the field of knowledge acquisition (KA) (see, e.g., [49, 78]), but also by researchers following a more interdisciplinary approach [47, 15, 6]. These approaches differ in many aspects, e.g., in the underlying representation 3

formalism, and whether they are equipped with an explicit notion of quality.

3

Reasoning Techniques and Systems

Interestingly, state-of-the-art ontology languages like OIL and DAML+OIL can be viewed as dialects (see, e.g., [59]) of the most popular logic-based knowledge representation family of languages, namely description logics (DLs) [31, 32]. Hence, DLs form a good starting point for the development of a logic-based formalism for representing ontologies. This relation was already clear in early days of DL research (i.e., 80ies and early 90ies), but at that time there was a fundamental mismatch between the expressive power and the efficiency of reasoning that DL systems provided, and the expressivity and the large ontologies that ontologists needed. In the last decade, the expressive power provided by DL-based systems has increased dramatically: starting from tight worst-case complexity bounds for highly expressive DLs [22, 36, 25, 80, 28] (these DLs are mostly EXPTIME-complete), practical reasoning algorithms for these logics were developed [57, 59, 58]. These algorithms turned out to be amenable to sophisticated optimisation techniques, and they gave rise to the current state-of-the-art reasoning systems, which behave surprisingly well in practice [55, 56, 48]. In addition to the research on standard reasoning problems like subsumption, nonstandard reasoning problems for description logics have been investigated in the last few years. This was motivated by the observation that an optimal support for building and maintaining large knowledge bases also requires system services that cannot be provided by the standard reasoning techniques. These non-standard reasoning problems encompass matching and unification of concepts (useful, e.g., for browsing ontologies and detecting redundancies) [11, 9, 8], least-common-subsumer and most-specific-concept computation (useful to support the definition of new concepts) [7, 10], and approximation of concepts (useful for approximate reasoning and for a comprehensible presentation of ontologies to non-expert users). Though DLs are good candidates for a logical basis of ontology languages, this research field is not the only source of results on reasoning in logics that are relevant for ontologies. In various other areas of computational logic, results relevant for this project can be found: e.g., in temporal, modal, dynamic, and hybrid logic, in automated deduction, and in the research on decidable fragments of first-order logic [51, 43, 44, 83, 23, 24, 4, 42, 5]. These results may also concern reasoning problems that are not prominent in DLs (e.g., model checking), but which may become relevant for the quality-oriented ontology management [21, 19]. And they may concern logics that differ in expressive power from DLs (e.g., some hybrid and temporal logics [3, 37]). Fortunately, there exist a well-established correspondences between the above mentioned fields, and thus there is a good chance that results can be transferred [76, 35, 20, 62, 2].

4

Integration and Interoperation

Ontology integration is the problem of combining ontologies from different sources, and providing the user with a unified view of these ontologies, called mediated ontology. 4

Ontology interoperation is the problem of allowing various ontologies to interoperate and exchange information, without necessarily providing a mediated ontology as the principle mean of accessing information in them. Research on ontology integration and interoperation can draw from the large amount of work carried out in data integration [27, 26]. For a survey on data integration, see e.g., [65]. Many organisations face the problem of integrating conceptual schemas residing in several sources. Companies that build a Data Warehouse, a Data Mining, or an Enterprise Resource Planning system must address this problem. Also, integrating ontologies in the semantic web is the subject of several investigations and projects nowadays. Indeed, the web is constituted by a variety of information sources, each expressed over a certain ontology, and in order to extract information from such sources, their semantic integration and reconciliation in terms of a mediated ontology is required. Many research works have addressed the fundamental problem of how to specify the mapping (also called articulation) between the mediated global schema (global ontology) and the information sources (local ontologies), see, e.g., [61, 33, 67, 29, 70, 41, 64]. For capturing such a mapping in an appropriate way, the notion of query is a crucial one, since it is very likely that a concept in one source corresponds to a (possibly simple) view—that is, a query—over the other sources. As a result the problem of information integration is strongly related to the problem of view-based query processing in data integration [17, 30]. Two basic approaches have been used to specify the relation between the sources and the mediated schema [81]. The first approach, called global-as-view, requires that the mediated schema is expressed in terms of the concepts in the sources. More precisely, to every concept of the mediated schema, a view over the concepts in the sources is associated, so that its meaning is specified in terms of those concepts. The second approach, called local-as-view, requires the mediated schema to be specified independently from the sources. The relationships between the mediated schema and the sources are established by defining every source as a view over the mediated schema. Thus, in the local-as-view approach, we specify the meaning of the sources in terms of the concepts in the mediated schema. It is clear that the latter approach favours the extensibility of the integration system, and provides a more appropriate setting for its maintenance. For example, adding a new source to the system requires only to provide the definition of the source, and does not necessarily involve changes in the mediated schema. On the contrary, in the global-as-view approach, adding a new source typically requires changing the definition of the concepts in the mediated schema. Interestingly, also approaches that mix the global-as-view and local-as-view perspective have been proposed [40, 26]. This work can be seen as a first step towards ontology interoperation that does without a global reconciled view of the information.

5

Quality Analysis

Models and tools for quality oriented ontology development can build on substantial previous work in the fields of data and software quality. Quality assurance includes all the planned and systematic actions necessary to provide adequate confidence that a product will satisfy a given set of quality requirements. There is a significant amount 5

of work on quality assurance on data, from which the project can draw input [88]. Research regarding the design of data manufacturing systems that incorporate data quality aspects is classified into two approaches: (i) the development of analytical models and (ii) the design of system technologies to ensure the quality of data. In the former group, one can find models for investigating the quality in existing systems [12, 13] that produce, e.g., expressions for the magnitude of certain types of errors; it is also possible to predict the impact of quality control efforts on such rates. In the latter group, data tracking techniques have been proposed, which use a combination of statistical control and manual identification of errors and their sources [60, 73, 75]. The models in [84, 86, 87, 85] assume that the quality design of an information system can be incorporated in the overall design of the system. Further work on data quality can be found in [14, 63, 50, 66, 1]. Of particular interest for quality-based ontology development is also the work carried out in the ESPRIT-IV “Foundations of Data Warehouse Quality (DWQ)” project [65], where a global framework linking quality factors with conceptual data models equipped with formal semantics was proposed.

6

Limitations of the Current Approaches

Today, the key role of ontologies in information management in general and the Semantic Web in particular, has led to the rapid development of a large number of ontologies. These ontologies have, however, usually been developed in an ad-hoc manner by domain experts, often with only a limited understanding of the semantics of ontology languages. The result is that many ontologies are of low quality—they make poor use of the languages in which they are written and do not accurately capture the author’s rich knowledge of the domain. To the best of our knowledge, an ontology building methodology which uses the support provided by state-of-the-art logic based ontology languages and also helps the user to take advantage of the full expressive power of such a language is still missing. Such a methodology is indeed needed to build high-quality ontologies since (1) domain experts are, in general, not experts in the ontology language they are using, (2) formalising one’s knowledge in any kind of formalism is, in general, a highly complicated task which requires a lot of discipline, perseverance, and/or support. These problems are often aggravated by the fact that one ontology is built by several domain experts, and that ontologies need to be maintained and extended over the time. Other limitations of the current methodologies arise from the current status of the reasoning techniques employed by the state-of-the-art ontology systems. First of all, the expressive power required for representing high-quality ontologies is still not fully provided. Secondly, the sheer size of realistic ontologies pose problems that may require new optimisation techniques for reasoning. Thirdly, the fact that ontologies need to be integrated and interoperated requires the investigation of novel paradigms (such as a mixed global-as-view and local-as-view approach) and of reasoning problems tailored to supporting these tasks. Poor quality ontologies usually require localised “tuning” in order to achieve the desired results within applications. This leads to further degradation in their overall 6

quality (e.g., their interpretability), increases the brittleness of the applications that use them, and makes interoperability and re-use difficult or impossible. These considerations will be of crucial importance to the Semantic Web, where it is expected that (very large numbers of) web pages will be marked up using multiple ontologies, without the authors of the ontologies having any knowledge or control over the use to which they are being put. The research and the technologies we have overviewed in this paper will contribute in overcoming these problems. In particular, ontologists need clear and measurable quality criteria, and tools that take them into account when building, maintaining, and interoperating ontologies. To “take into account” means both to measure the quality of an ontology w.r.t. specific quality criteria as well as to support the user when operating an ontology. The latter point requires that the user, when operating an ontology, can state her quality requirements, which are then taken into account by the system supporting this operation. Logics with suitable computational properties are natural candidates to provide formal foundations for solid and measurable quality criteria on which to base methodologies and advanced tools for ontology development.

7

Conclusions

In this paper we have overviewed some of the research areas the promise to have a big impact on ontology development for the Semantic Web, namely ontologies, automated reasoning, integration and interoperation, and quality analysis. An aspect that we have not touched is the development and composition of e-services that make use of ontologies, which are of fundamental interest for the Semantic Web and currently actively studied. To tackle such an issue, the research mentioned in this paper is still of importance. However, an understanding of the basic issues regarding the semantics of e-services is also needed and still under investigation.

Acknowledgements The ideas presented here are the result of various discussions with Franz Baader, Jeen Broekstra, Carole Goble, Frank van Harmelen, Ian Horrocks, Maurizio Lenzerini, Daniele Nardi, Ulrike Sattler, and Heiner Stuckenschmidt. We would like to thank all of them.

References [1] N. Agmon and N. Ahituv. Assessing data reliability in an information system. J. Management Information Systems, 4(4), 1987. [2] Carlos Areces. Logic Engineering. The Case of Description and Hybrid Logics. PhD thesis, ILLC, University of Amsterdam, 2000. ILLC Dissertation Series 2000–5. [3] Carlos Areces, Patrick Blackburn, and Maarten Marx. A road-map on complexity for hybrid logics. In Proc. of the Annual Conf. of the Eur. Assoc. for Computer 7

Science Logic (CSL’99), volume 1683 of Lecture Notes in Computer Science, pages 307–321. Springer, 1999. [4] Alessandro Artale and Enrico Franconi. A temporal description logic for reasoning about actions and plans. J. of Artificial Intelligence Research, 9:463–506, 1998. [5] Alessandro Artale and Enrico Franconi. A survey of temporal extensions of description logics. Annals of Mathematics and Artificial Intelligence, 30(1-4), 2001. [6] Alessandro Artale, Nicola Guarino, Enrico Franconi, and Luca Pazzi. Part-whole relations in object-centered systems: an overview. Data and Knowledge Engineering, 20:347–383, 1996. [7] Franz Baader and Ralf K¨ usters. Computing the least common subsumer and the most specific concept in the presence of cyclic ALN -concept descriptions. In Proc. of the 22nd German Annual Conf. on Artificial Intelligence (KI’98), volume 1504 of Lecture Notes in Computer Science, pages 129–140. Springer, 1998. [8] Franz Baader and Ralf K¨ usters. Matching in description logics with existential restrictions. In Proc. of the 7th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR 2000), pages 261–272, 2000. [9] Franz Baader, Ralf K¨ usters, Alex Borgida, and Deborah L. McGuinness. Matching in description logics. J. of Logic and Computation, 9(3):411–447, 1999. [10] Franz Baader, Ralf K¨ usters, and Ralf Molitor. Computing least common subsumers in description logics with existential restrictions. In Proc. of the 16th Int. Joint Conf. on Artificial Intelligence (IJCAI’99), pages 96–101, 1999. [11] Franz Baader and Paliath Narendran. Unification of concept terms in description logics. In H. Prade, editor, Proc. of the 13th Eur. Conf. on Artificial Intelligence (ECAI’98), pages 331–335. John Wiley & Sons, 1998. [12] D. P. Ballou and H. L. Pazer. Modelling data and process quality multi-input, multi-output information systems. Management Science, 31(2), 1985. [13] D. P. Ballou and H. L. Pazer. Cost/quality tradeoffs for control procedures information systems. OMEGA: International J. Management Science, 15(6), 1987. [14] D.P. Ballou and K.G. Tayi. Methodology for allocating resources for data quality enhancement. Communications of the ACM (CACM), 32(3), 1989. [15] John A. Bateman. On the relationship between ontology construction and natural language: A socio-semiotic view. International Journal of Human-Computer Studies, 43(5/6):929–944, 1995. [16] Sean Bechhofer, Ian Horrocks, Carole Goble, and Robert Stevens. OilEd: A Reasonable ontology editor for the semantic web. In Proc. of the 2001 Description Logic Workshop (DL 2001), pages 1–9. CEUR Electronic Workshop Proceedings, http: //ceur-ws.org/Vol-49/, 2001. 8

[17] C. Beeri, A. Y. Levy, and M.-C. Rousset. Rewriting queries using views in description logics. In Proc. of the 16th ACM Symp. on Principles of Database Systems (PODS’97), pages 99–108, 1997. [18] Tim Berners-Lee, James A. Hendler, and Ora Lassila. The semantic Web. Scientific American, 284(5):34–43, 2001. [19] Orna Bernholtz, Moshe Y. Vardi, and Pierre Wolper. An automata-theoretic approach to branching-time model checking. In Proc. of the 6th Int. Conf. on Computer Aided Verification (CAV’94), volume 818 of Lecture Notes in Computer Science, pages 142–155. Springer, 1994. [20] Alexander Borgida. On the relative expressiveness of description logics and predicate logics. Artificial Intelligence, 82(1–2):353–367, 1996. [21] J. R. Burch, E. M. Clarke, K. L. McMillan, D. L. Dill, and L. J. Hwang. Symbolic model checking: 1020 states and beyond. Information and Computation, 98(2):142– 170, 1992. [22] Diego Calvanese. Finite model reasoning in description logics. In Luigia C. Aiello, John Doyle, and Stuart C. Shapiro, editors, Proc. of the 5th Int. Conf. on the Principles of Knowledge Representation and Reasoning (KR’96), pages 292–303. Morgan Kaufmann, Los Altos, 1996. [23] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. On the decidability of query containment under constraints. In Proc. of the 17th ACM SIGACT SIGMOD SIGART Symp. on Principles of Database Systems (PODS’98), pages 149–158, 1998. [24] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. What can knowledge representation do for semi-structured data? In Proc. of the 15th Nat. Conf. on Artificial Intelligence (AAAI’98), pages 205–210, 1998. [25] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. Reasoning in expressive description logics with fixpoints based on automata on infinite trees. In Proc. of the 16th Int. Joint Conf. on Artificial Intelligence (IJCAI’99), pages 84–89, 1999. [26] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. A framework for ontology integration. In Proc. of the 2001 Int. Semantic Web Working Symposium (SWWS 2001), pages 303–316, 2001. Available at http://www.semanticweb.org/ SWWS/program/full/SWWSProceedings.pdf. [27] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. Ontology of integration and integration of ontologies. In Proc. of the 2001 Description Logic Workshop (DL 2001), pages 10–19. CEUR Electronic Workshop Proceedings, http: //ceur-ws.org/Vol-49/, 2001.

9

[28] Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, and Daniele Nardi. Reasoning in expressive description logics. In Alan Robinson and Andrei Voronkov, editors, Handbook of Automated Reasoning, chapter 23, pages 1581–1634. Elsevier Science Publishers (North-Holland), Amsterdam, 2001. [29] Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, Daniele Nardi, and Riccardo Rosati. Information integration: Conceptual modeling and reasoning support. In Proc. of the 6th Int. Conf. on Cooperative Information Systems (CoopIS’98), pages 280–291, 1998. [30] Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, and Moshe Y. Vardi. View-based query processing and constraint satisfaction. In Proc. of the 15th IEEE Sym. on Logic in Computer Science (LICS 2000), 2000. [31] Diego Calvanese, Maurizio Lenzerini, and Daniele Nardi. Description logics for conceptual data modeling. In J. Chomicki and G. Saake, editors, Logics for Databases and Information Systems. Kluwer, 1998. [32] Diego Calvanese, Maurizio Lenzerini, and Daniele Nardi. Unifying class-based representation formalisms. J. of Artificial Intelligence Research, 11:199–240, 1999. [33] Tiziana Catarci and Maurizio Lenzerini. Representing and using interschema knowledge in cooperative information systems. J. of Intelligent and Cooperative Information Systems, 2(4):375–398, 1993. [34] The DAML+OIL Web Site. http://www.w3.org/TR/daml+oil-reference. [35] Giuseppe De Giacomo and Maurizio Lenzerini. Boosting the correspondence between description logics and propositional dynamic logics. In Proc. of the 12th Nat. Conf. on Artificial Intelligence (AAAI’94), pages 205–212. AAAI Press/The MIT Press, 1994. [36] Francesco M. Donini, Maurizio Lenzerini, Daniele Nardi, and Werner Nutt. The complexity of concept languages. Information and Computation, 134:1–58, 1997. [37] E. Allen Emerson. Temporal and modal logic. In Handbook of Theoretical Computer Science, pages 997–1072. Elsevier Science Publishers (North-Holland), Amsterdam, 1990. [38] D. Fensel, F. van Harmelen, I. Horrocks, D. McGuinness, and P. F. Patel-Schneider. OIL: An ontology infrastructure for the semantic web. IEIS, 16(2):38–45, 2001. [39] Enrico Franconi and Gary Ng. The ICOM tool for intelligent conceptual modelling. In Proc. of the 7th International Workshop on Knowledge Representation meets Databases (KRDB’2000), 2000. [40] Marc Friedman, Alon Levy, and Todd Millstein. Navigational plans for data integration. In Proc. of the 16th Nat. Conf. on Artificial Intelligence (AAAI’99), pages 67–73. AAAI Press/The MIT Press, 1999. 10

[41] Francois Goasdoue, Veronique Lattes, and Marie-Christine Rousset. The use of CARIN language and algorithms for information integration: the picsel system. International Journal on Cooperative Information Systems, 2000. [42] Erich Gr¨adel. Decision procedures for guarded logics. In Proc. of the 16th Int. Conf. on Automated Deduction (CADE’99), volume 1632 of Lecture Notes in Computer Science. Springer, 1999. [43] Erich Gr¨adel, Phokion G. Kolaitis, and Moshe Y. Vardi. On the decision problem for two-variable first-order logic. Bulletin of Symbolic Logic, 3(1):53–69, 1997. [44] Erich Gr¨adel, Martin Otto, and Eric Rosen. Two-variable logic with counting is decidable. In Proc. of the 12th IEEE Symp. on Logic in Computer Science (LICS’97), pages 306–317. IEEE Computer Society Press, 1997. [45] Thomas R. Gruber and Gregory R. Olsen. An ontology for engineering mathematics. In Pietro Torasso, Jon Doyle, and Erik Sandewall, editors, Proc. of the 4th Int. Conf. on the Principles of Knowledge Representation and Reasoning (KR’94), pages 258–269. Morgan Kaufmann, Los Altos, 1994. [46] Tom R. Gruber. Towards Principles for the Design of Ontologies Used for Knowledge Sharing. In N. Guarino and R. Poli, editors, Formal Ontology in Conceptual Analysis and Knowledge Representation, Deventer, The Netherlands, 1993. Kluwer Academic Publishers. [47] Nicola Guarino. Formal ontology, conceptual analysis and knowledge representation. Int. Journal of Human-Computer Studies, 43(5/6):625–640, 1995. [48] Volker Haarslev and Ralf M¨oller. RACER system description. In Proc. of the Int. Joint Conf. on Automated Reasoning (IJCAR 2001), volume 2083 of Lecture Notes in Artificial Intelligence, pages 701–705. Springer, 2001. [49] Udo Hahn, Manfred Klenner, and Klemens Schnattinger. A quality-based terminological reasoning model for text knowledge acquisition. In Nigel Shadbolt, Kieron O’Hara, and Schreiber Guus, editors, Proc. of the 9th European Workshop on Knowledge Acquisition (EKAW’96), volume 1076 of Lecture Notes in Artificial Intelligence, pages 131–146. Springer, 1996. [50] H. Haloran. Systems development quality control. MIS Quarterly, 2(4), 1978. [51] Joseph Y. Halpern and Moshe Y. Vardi. Model checking vs. theorem proving: A manifesto. In V. Lifschitz, editor, AI and Mathematical Theory of Computation — Papers in Honor of John McCarthy, pages 151– 176. Academic Press, 1991. [52] Patrick J. Hayes. Naive physics I: ontology for liquids. In Jerry R. Hobbs and Robert C. Moore, editors, Formal Theories of the Commonsense World, chapter 3, pages 71–107. Ablex, Norwood, New Jersey, 1985.

11

[53] Jeff Heflin and James Hendler. Dynamic ontologies on the web. In Proc. of the 7th National Conf. on Artificial Intelligence (AAAI’00) and of the 12th Conf. on Innovative Applications of Artificial Intelligence (IAAI’00), pages 443–449, Menlo Park, CA, 2000. AAAI Press. [54] Bj¨orn H¨ofling, Thorsten Liebig, Dietmar R¨osner, and Lars Webel. Towards an ontology for substances and related actions. In Dieter Fensel and Rudi Studer, editors, Proc. of the 11th European Workshop on Knowledge Acquisition, Modeling and Management (EKAW’99), volume 1621 of Lecture Notes in Artificial Intelligence, pages 191–206. Springer, 1999. [55] Ian Horrocks. Using an expressive description logic: FaCT or fiction? In Proc. of the 6th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR’98), pages 636–647, 1998. [56] Ian Horrocks and Peter F. Patel-Schneider. Optimizing description logic subsumption. J. of Logic and Computation, 9(3):267–293, 1999. [57] Ian Horrocks and Ulrike Sattler. A description logic with transitive and inverse roles and role hierarchies. J. of Logic and Computation, 9(3):385–410, 1999. [58] Ian Horrocks and Ulrike Sattler. Ontology reasoning in the SHOQ(D) description logic. In Proc. of the 17th Int. Joint Conf. on Artificial Intelligence (IJCAI 2001), pages 199–204, 2001. [59] Ian Horrocks, Ulrike Sattler, and Stephan Tobies. Practical reasoning for very expressive description logics. J. of the Interest Group in Pure and Applied Logic, 8(3):239–264, 2000. [60] Y. U. Huh, F. R. Keller, T. C. Redman, and A. R. Watkins. Data quality. Information and Software Technology, 32(8), 1990. [61] Michael N. Huhns, Nigel Jacobs, Tomasz Ksiezyk, Wei-Min Shen an Munindar P. Singh, and Philip E. Cannata. Integrating enterprise information models in Carnot. In Proc. of the Int. Conf. on Cooperative Information Systems (CoopIS’93), pages 32–42, 1993. [62] Ulrich Hustadt and Renate A. Schmidt. On evaluating decision procedures for modal logic. In Proc. of the 15th Int. Joint Conf. on Artificial Intelligence (IJCAI’97), pages 202–207, 1997. [63] M. Janson. Data quality: The achilles heel of end-user computing. Omega J. Management Science, 16(5), 1988. [64] Mathias Jarke, V. Quix, D. Calvanese, Maurizio Lenzerini, Enrico Franconi, S. Ligoudistiano, P. Vassiliadis, and Yannis Vassiliou. Concept based design of data warehouses: The DWQ demonstrators. In 2000 ACM SIGMOD International Conference on Management of Data, May 2000. 12

[65] Matthias Jarke, Maurizio Lenzerini, Yannis Vassiliou, and Panos Vassiliadis, editors. Fundamentals of Data Warehouses. Springer, 1999. [66] K. Kriebel. Evaluating the quality of information system. In N. Szysperski and E. Grochla, editors, Design and Implementation of Computer Based Information Systems. Germantown: Sijthoff and Noordhoff, 1979. [67] Alon Y. Levy, Divesh Srivastava, and Thomas Kirk. Data model and query evaluation in global information systems. J. of Intelligent Information Systems, 5:121–143, 1995. [68] Sean Luke, Lee Spector, David Rager, and James Hendler. Ontology-based web agents. In W. Lewis Johnson and Barbara Hayes-Roth, editors, Proc. of the 1st Int. Conf. on Autonomous Agents, pages 59–66. ACM Press, 1997. [69] Deborah L. McGuinness, Richard Fikes, James Rice, and Steve Wilder. The Chimaera ontology environment. In Proc. of the 17th Nat. Conf. on Artificial Intelligence (AAAI 2000), pages 1123–1124, 2000. [70] Eduardo Mena, Arantza Illarramendi, Vipul Kashyap, and Amit P. Sheth. OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distributed and Parallel Databases, 8(2):223–271, 2000. [71] Pieter J. Mosterman, Feng Zhao, and Gautam Biswas. An ontology for transitions in physical dynamic systems. In Proc. of the 15th Nat. Conf. on Artificial Intelligence (AAAI’98), pages 219–224. AAAI Press, 1998. [72] Tomohiro Nakatani and Hiroshi G. Okuno. Sound ontology for computational auditory scence analysis. In Proc. of the 15th Nat. Conf. on Artificial Intelligence (AAAI’98), pages 1004–1012. AAAI Press, 1998. [73] R. W. Pautke and T. C. Redman. Techniques to control and improve quality of data large databases. In Proceedings of Statistics Canada Symp. 90, 1990. [74] A. Rector and I. Horrocks. Experience building a large, re-usable medical ontology using a description logic with transitivity and concept inclusions. In Proc. of the 14th Nat. Conf. on Artificial Intelligence (AAAI’97). AAAI Press, Menlo Park, California, 1997. [75] T. C. Redman. Data Quality: Management and Technology. N.Y. Bantam Books, 1992. [76] Klaus Schild. A correspondence theory for terminological logics: Preliminary report. In Proc. of the 12th Int. Joint Conf. on Artificial Intelligence (IJCAI’91), pages 466–471, 1991.

13

[77] Y. Shoham. Reified temporal logics: Semantical and ontological considerations. In Proc. of the 7th Eur. Conf. on Artificial Intelligence (ECAI’86), pages 390–397, 1986. [78] Edgar Sommer. An approach to measuring theory quality. In Proc. of the 9th European Workshop on Knowledge Acquisition (EKAW’96), volume 1076 of Lecture Notes in Artificial Intelligence. Springer, 1996. [79] Steffen Staab, Michael Erdmann, and Alexander Maedche. An extensible approach for modeling ontologies in RDF(S). In Proc. of the 1st ECDL’2000 Semantic Web Workshop 2000, 2000. [80] Stephan Tobies. The complexity of reasoning with cardinality restrictions and nominals in expressive description logics. J. of Artificial Intelligence Research, 12:199–217, 2000. [81] Jeffrey D. Ullman. Information integration using logical views. In Proc. of the 6th Int. Conf. on Database Theory (ICDT’97), volume 1186 of Lecture Notes in Computer Science, pages 19–40. Springer, 1997. [82] Johan F. A. K. van Benthem. The logic of time: a model-theoretic investigation into the varieties of temporal ontology and temporal discourse, volume 156 of Synthese Library. Reidel, Dordrecht, 1983. [83] Moshe Y. Vardi. Reasoning about the past with two-way automata. In Proc. of the 25th Int. Coll. on Automata, Languages and Programming (ICALP’98), volume 1443 of Lecture Notes in Computer Science, pages 628–641. Springer, 1998. [84] R. Y. Wang, H. B. Kon, and S. E. Madnick. Data quality requirements analysis and modeling. In Proc. of the 8th IEEE Int. Conf. on Data Engineering (ICDE 1993), 1993. [85] R. Y. Wang and S. E. Madnick. A polygen model for heterogeneous database systems: The source tagging perspective. In Proc. of the 16th Int. Conf. on Very Large Data Bases (VLDB’90), 1990. [86] R. Y. Wang, M. P. Reddy, and A. Gupta. An object-oriented implementation of quality data products. In Proceedings of the 3rd Ann. Workshop on Information Technologies and Systems, 1993. [87] R. Y. Wang, M. P. Reddy, and H. B. Kon. Towards quality data: An attributebased approach. Decision Support Systems, 13, 1995. [88] R. Y. Wang, V. C. Storey, and C. P. Firth. A framework for analysis of data quality research. IEEE Transactions on Knowledge and Data Engineering, 7(4), 1995.

14