example layout

19 downloads 32780 Views 635KB Size Report
Preprocessor (PHP) was used to implement the NEFIS KB allowing ... The meta database itself is based on MySQL. ... The KB host in consultation with the ... of complex forest-related issues, to better decision-making and more informed public ...
Network for a European Forest Information Service (NEFIS) – Establishment of a Knowledge Base A. Schuck1, T. Green1, Binh T. Nguyen2, T. Richards3, J. Saarikko4, S. Varis1 1

European Forest Institute, Torikatu 34, 80100 Joensuu, Finland. e-mail: [email protected] 2 International Union of Forest Research Organizations, Hauptstrasse 7, A-1140 Vienna, Austria 3 Conservation Technology Ltd, 4 Old Tarnwell, Upper Stanton Drew, Somerset, BS39 4EA, UK 4 Finnish Forest Research Institute, Unioninkatu 40 A, 00170 Helsinki, Finland _______________________________________________________________________

Abstract One central component of the European Commission (EC) 5th Framework Programme ‘Accompanying Measure’ project ‘Network for a Forest Information Service’ (NEFIS - contract number QLK5-CT-2002-30638) is the establishment of a web-based KnowledgeBase with substantive structured information content. The KnowledgeBase of NEFIS includes a repository of metadata and controlled vocabulary standards, software design documents, comprehensive frequently asked questions, worked examples and software distributions. Its content is based on essential documentation of activities related to forest information systems. NEFIS originates from the Council Regulation (1989, (EC) No 1615/89 and its extension as of 1998; 1998-2002) aiming to elaborate a European Forest Information and Communication System. A first web-based demonstration tool of such a system was built within a preceding EC activity. NEFIS is taking the development of the tool further by seeking coordination of ongoing efforts to create forest information systems and services at the European level. Further it seeks options to support potential data providers at various spatial (international, national, sub-national) levels to develop harmonized standards and procedures for providing actual data and elaborate metadata. The project has close cooperation with the Global Forest Information Service initiative. This paper will concentrate on giving insight into the establishment of the KnowledgeBase. Keywords: forest information systems, forest information, knowledge, information, metadata _______________________________________________________________________

Introduction The European Commission (EC) 5th Framework Programme ‘Accompanying Measure’ project ‘Network for a Forest Information Service’, or short NEFIS (contract number QLK5CT-2002-30638) originates from the Council Regulation 1989, (EC) No 1615/89 and its extension as of 1998 (1998-2002) aiming to elaborate a European Forest Information and Communication System (EEC, 1989). A first web-based and fully operational demonstration tool of such a system was built within a preceding EC activity (Kennedy, 2003, Schuck et al., 2004). The functionalities of the developed European Forest Information System demonstration tool (EFIS) include the compilation, processing, analysis and dissemination of available forestry information from various data sources and of various data formats for different spatial levels (Schuck et al., 2005). In order to operationalize EFIS development at a broader scale a need was seen for a system to become more flexible in addressing (1)

diverse user needs, (2) data access policies and data rights, and (3) adequate and appropriate technological possibilities for the creation and presentation of value-added products (JRC, 2002a). The NEFIS project has taken the development of the EFIS demonstration tool further towards those goals by seeking coordination of ongoing efforts to create forest information systems and services at the European level. NEFIS aims at elaborating a framework for supporting potential data providers at various spatial levels to develop harmonized standards and procedures for providing actual data and elaborate metadata. The project has for this purpose built close cooperation with the Global Forest Information Service (GFIS) initiative whose coordinator is based at the International Union of Forest Research Organizations (IUFRO), which is one of the NEFIS consortium members. Further contacts have been established to the Canadian initiative of establishing a National Forest Information System initiative (CCFM, 2004). One of the NEFIS project’s specific targets is the establishment of a ‘KnowledgeBase’ (KB) which will allow hierarchical access to metadata and documentation related to forest information systems incorporating information from general descriptions to implementation details. The KB is built as an information and document handling system for recording in particular: • outputs from NEFIS activities; • freely available documentation of other forest information systems/services development initiatives (e.g. Global Forest Information Service – GFIS; European Forest Information Service – EFIS, National Forest Information System – NFIS) and further relevant projects which were of interest to the NEFIS project activities; • relevant software resources, relevant reports and documents. This paper will describe the development of the KB within the NEFIS project. The content and purpose of the NEFIS KB (information about information services and systems) should not be confused with the content and purpose of EFIS and GFIS (information/data about forests and forestry). However, there may be some resources for which it would be appropriate to use the NEFIS KB and EFIS or GFIS. What are data, information and knowledge? As a first step a set of terms used in this paper will be presented. They will include data, information, information system, knowledge, know-how and Knowledge Base (Figure 1). 1) Data are facts represented in a readable language (such as numbers, characters, images, or other methods of recording on a durable medium). Data on its own carries no meaning (ICHNET, 2005). 2) Information can be defined in a number of different ways. According to the Free On-line Dictionary of Computing, it can be a message, in the form of a document or an audible or visible communication, meant to change the way a receiver perceives something and to influence judgement or behaviour (ICHNET, 2005). It can also be defined as data that makes a difference (Davenport & Prusak, 2000) or it can represent patterns in data (O'Dell & Grayson, 1998). A further formulation presented on the website bases on Claude Shannon’s work of the 1940s on the subject of information theory defines information as “that which reduces uncertainty” (Lucent Technologies, 2005)

Figure 1. From data to knowledge and know-how.

3) An information system means an interconnected set of information resources under the same direct management control that shares common functionality. A system normally includes hardware, software, information, data, applications, communications, and people (SEHISC, 2005). 4) Knowledge is defined in the Free On-line Dictionary of Computing as “what is known by perceptual experience and reasoning” (ICHNET, 2005). Knowledge can either be explicit through, for example, the use of documents, databases and ‘Knowledge Bases’. It can, however, also be implicit, meaning that it may only be known to individuals. Such knowledge is regarded as experiential and therefore difficult to document (O'Dell & Grayson, 1998). 5) A Knowledge Base can be described as a store of knowledge about a domain represented in machine-processable form, which may be rules, facts, or other representations. It may also be referred to as a repository (ICHNET, 2005). A brief survey of how the term ‘Knowledge Base’ is used shows that there seem to be no harmonized approaches. There are a multitude of different types of on-line services which declare themselves to be a KB. On many websites KBs are Frequently Asked Questions (FAQs). Once FAQs and their answers are categorized, keywords added and all entered into a database, such a system develops in the direction of a ‘Knowledge Base’. Often KBs incorporate more extensive content, including also actual documents. 6) Know-how is a particular type of information or knowledge which enables the possessor to achieve a specific goal with an accuracy or precision that would not otherwise be possible (UT-SWMC, 2005). If we now present an example related to forestry we could think of 20.303 as data. The statement that “Finland’s forest land area has increased by 1.6 million hectares since the 1960s to 20.303 million hectares” can be regarded as information; “The increase in forest land area since the 1960’s is a result of afforestation of agricultural lands and peatlands as well as intensive forest improvement efforts” is knowledge. When now combining information and knowledge from different domains (e.g. economical and technological aspects, environment etc.) forest managers may come to the conclusion to further increase,

decrease or keep the forest area stable. In this way combined knowledge becomes knowhow and leads to a specific action. The increasing importance of meta information Embedded in the chain of turning data and information into knowledge and know-how is the organization, transparency and accessibility to the data and information sources (Päivinen & Schuck, 2003). The Internet as an access point for identifying data and information has become increasingly important. It is stated in ‘The Economist’ (2005) that “armed with the Internet the customer (or user) is seizing power”. As users have the opportunity to inform themselves, compare and read or share experiences with other users it is having both the effect of intensifying competition and of raising standards. This applies not only to commercial businesses, but also to research and the dissemination of research results. For example, the minutes of the Joint FAO/UNECE Working Party on Forest Economics and Statistics (2005) state that: “the FAO/UNECE website was now the primary method of communication and welcomed the secretariats intention to make it more structured and easy to navigate. As regards to databases, priority should be given to both data quality and availability of metadata”. There are numerous approaches and standards in organizing data and information resources using metadata. All these initiatives follow common principles of providing a set of standards which allow for discovery of information resources across domains based on a common metadata framework tailoring to particular needs and/or data and information types. Examples of such initiatives are the ISO 19115 (ISO 19115) which defines the schema required for describing geographic information and services. It provides information about the identification, the extent, the quality, the spatial and temporal schema, spatial reference, and distribution of digital geographic data. The European Commission adopted in 2004 a “proposal for a Directive of the European Parliament and of the Council establishing an infrastructure for spatial information in the Community, (INSPIRE)” (COM(2004) 516 final). It lays down general rules for the establishment of an infrastructure for spatial information in Europe, for the purposes of environmental policies and policies or activities which may have a direct or indirect impact on the environment (CEC, 2004). The United States Federal Geographic Data Committee (FGDC) uses the Content Standard for Digital Geospatial Metadata CSDGM which was designed to “provide a common set of terminology and definitions for the documentation of digital geospatial data” (FGDC, 1998). The Dublin Core Metadata Initiative (DCMI) is an organization dedicated to promoting the widespread adoption of interoperable metadata standards and developing specialized metadata vocabularies for describing resources that enable more intelligent information discovery systems (DCMI, 2005).

The Knowledge Base One central component of NEFIS has been the establishment of a web-based KnowledgeBase (KB) with substantive structured information content. The KB of NEFIS includes a repository of metadata and controlled vocabulary standards, software design documents, FAQs, worked examples and software distributions. Its content is based on essential documentation of activities related to forest information systems and services. The KB also gives the options to set up fora for discussions on selected topics and includes a help function introducing the various KB functionalities. The NEFIS KB metadata structure follows the Dublin Core Metadata Element Set. It consists of a set of 15 elements (DCMI, 2004). The set is designed to cater for sharing across

disciplines or within any type of organization needing to organize and classify information. For the NEFIS KB all 15 elements were made available for describing its expected contents (Figure 2). Not all elements were given a mandatory status meaning that content is not needed for all elements in order for a record to be accepted in the KB. Actually only five elements were labelled as mandatory. Those are: Title, Creator, Subject, Description and Identifier. These five elements allow the KB user to get a reasonable overview of the main information content of a resource and to be able to obtain the resource if desired. In particular the element ‘Identifier’ allows linking to an actual information source via a URL or posting a reference number to a publication (e.g. ISBN, ISSN). The KB further provides the option to attach a document to the metadata record which can then be downloaded by a user.

Figure 2. Metdata schema of the KnowledgeBase. Note: the marked elements are mandatory. In order to facilitate a more structured organization of the metadata entries the element ‘Subject’ incorporated a set of controlled vocabulary terms. The terms allow the allocation of information content to a matching or related topic under one or more main headings. The terms themselves were elaborated within the project in view of the intended content of the KB. The actual KB tree structure consists of eight main headings which are then further split up into a number of subheadings (Figure 3). When entering a record to the KB, the information provider has the option to choose from these main headings and subheadings. The information provider may also choose to enter their own keywords as free text to further specify the content of the resource.

Figure 3. Strands for cataloguing contributions to the KnowledgeBase. Technical specifications UML being a standard object-oriented design language (Unified Modelling Language) was used as a tool for elaborating the structural relationships of the NEFIS KB. Figure 4 shows UML use cases for the different functionalities of the KB. The UML class diagrams were then mapped to Extensible Markup Language or XML Schema. In a next step the Hypertext Preprocessor (PHP) was used to implement the NEFIS KB allowing hierarchical access to metadata and documentation related to forest information systems incorporating information from general descriptions to implementation details. PHP is an embedded, server-side scripting language. The meta database itself is based on MySQL. Currently the uploaded files which are attached to metadata records amount to more than 12MB.

Figure 4. UML use cases for the NEFIS KB.

KnowledgeBase contents As of 17 June 2005, the NEFIS KB has more than 130 entries. The display of the records on the main page is organized in terms of utilized controlled vocabulary. A particular metadata record may appear under more than one heading or subheading (Figure 5). The KB has been and will continue to be used to document the outputs from the NEFIS project. The project deliverables, other reports and minutes of the project meetings are available through the KB. Documents related to the NEFIS project include: the report on the NEFIS metadata schema and keyword lists; the NEFIS metadata guidelines; suggestion for a Forest Markup Language (FML); the keyword lists (for Silviculture; Inventory; Forest products and trade flows; Field experiments; Maps and geo-referenced data; Forest health; Vegetation (or forest or land) types; Non-wood goods and benefits; Forest institutions; Rural development; Fires; Forest operations); Data preparation guide for NEFIS data providers; use case storylines and use case diagrams for the UML (Unified Modelling Language); and data preparation guidelines for NEFIS data providers.

Figure 5. Web interface of the NEFIS KnowledgeBase. Note the number of entries is subject to change as new metadata is added continuously. A beneficial side-effect of the KB has been the adding, description and in parts the linking of documentation which has not been officially published (grey literature) but has shown to have been of considerable interest and use to the NEFIS project. The KB allows such documentation to become visible or even downloadable for other users who may find them interesting for their own work. Examples documents are: (1) the European Forest Information System - EFIS - Final Report including the published executive summary (JRC, 2002b); (2) Data Management and Database Issues for Integrated Forest Resource Information System of the Forest Inventory and Monitoring Project (Ibrahim, 1999); and (3) the Workshop Report of the World Forest Monitoring Workshop (Schuck et al., 1996). Figure 6 demonstrates a sample query to NEFIS KB. The information seeker is interested in information related to the Dublin Core Metadata Initiative which has played a central role in the NEFIS project. In directing the search to the main heading ‘Knowledge representation’ and within to subheading ‘Dublin Core’ 13 entries are available. The information seeker receives on display the titles of the 13 entries and is given the option to ask for full metadata details. In this case the metadata record ‘DCMI Terms’ was selected. When following the link under the element ‘Identifier’ the information seeker will be guided to the webpage displaying the Dublin Core Metadata Element Set.

Figure 6. Screenshot of a search of the NEFIS KB. A web-based KB entry interface has been elaborated for easy submission of metadata records. It allows further to edit existing records. In order to be able to enter or edit metadata records, users must login. During the NEFIS project lifetime a selected group of individuals was granted access rights to upload information to the KB. This approach was used as in the course of the project mainly documentation which was identified as useful within the project was uploaded. The overall management, maintenance and access rights control of the NEFIS KB was guaranteed by technical personnel (master user). Plans for maintaining the KB and guaranteeing further entries after the project lifetime will build on a more open structure. Individuals who would like to provide input related to the development of forest information systems and services will be able to register as a KB user and will be given the rights to enter metadata and edit their own metadata records. The KB host in consultation with the NEFIS expert network established within the NEFIS project will make a decision whether the information will be made visible through the KB.

Potential Use and Vision An additional use for the KB could be found in relation to the creation of a European Forest Information and Communication Platform (EFICP). The EFICP initiative, which was preceded by the Council regulations for a European Forest Information and Communication System, EFICS (EEC 1989; last amended regulation: 1998/1100/EC which expired in 2002) is dedicated to the development of a fully operational information system able to retrieve and analyse forest information from distributed national forest information databases. The EFICP objective is to help reporting and monitoring of forest resources at the EU level and to provide information on these resources to interested EU, national, and stakeholder communities. In this context, the NEFIS KB may serve as a useful tool in hosting important documentation when building the EFICP. A further option for the KB would be to seek options on interoperability with other related metadata services so that information can be accessed via other entry gates. One important example within the forestry domain is the Global Forest Information Service. GFIS aims to enhance access to all types of forest information, ensuring that it is accessible to governments and to all stakeholders, including researchers, forest managers, NGOs, community groups and the public at large. It should contribute to an improved understanding of complex forest-related issues, to better decision-making and more informed public

engagement in forest policy and forest management at all levels (GFIS, 2005). The NEFIS KB could therefore serve the GFIS as a potential thematic information provider. Such potential applications and the avenue of promoting the KB as a good reference point for forest service and system related information through popular channels such as popular articles, papers and newsletters may yield good prospects for the growth of the NEFIS KB towards a comprehensive information resource with regards to forest information systems and services.

Reference CEC (2004) Proposal for a Directive of the European Parliament and of the Council establishing an infrastructure for spatial information in the Community (INSPIRE). Brussels, 23.7.2004. COM(2004) 516 final. 2004/0175 (COD). 32 p. http://inspire.jrc.it/sdic_call/EN.pdf. CCFM (2004) National Forest Information System (CCFM-NFIS) - An Overview. Canadian Council of Forest Ministers. CCFM-NFIS Project Office. 7p. http://nfis.org/about/index_e.shtml#OVERVIEW. Davenport, T. and Prusak, L. (2000) Working Knowledge: How Organizations Manage What They Know. Harvard Business School Press. DCMI (2004) Dublin Core Metadata Element Set, Version 1.1. Reference Description 200412-20. http://dublincore.org/documents/2004/12/20/dces/. DCMI (2005) Dublin Core Metadata Initiative - Metadata Terms. Reference description 200501-10. http://dublincore.org/documents/2005/01/10/dcmi-terms/. EEC (1989) Council Regulation (EEC) No 1615/89 of 29 May 1989. Establishing a European Forestry Information and Communication System (EFICS). Official Journal L 165 , 15/06/1989 P. 0012 – 0013. FGDC (1998) Content Standard for Digital Geospatial Metadata Version 2, June 1998. Ibrahim, M. (1998) Data Management and Database Issues for Integrated Forest Resource Information System of the Forest Inventory and Monitoring Project. - An Exploratory Investigation into Data Modelling, Databases, DBMS, and Developing an Integrated Information System for Forest Resources Inventory and Monitoring. Draft final report and recommendations. Forest inventory & Monitoring Project - Indonesia Forest Sector Support Programme. FIMP/INTAG Technical Series. 1998/9. 104p. GFIS (2005) Development of a Global Forest Information Service (GFIS) as a CPF Initiative. GFIS Working Paper 1. 6 p. ICHNET (2005) Interoperability Clearinghouse Glossary of Terms. http://www.ichnet.org/glossary.htm. Reference date to ICHNET Internet glossary 2005-05-20. ISO 19115. Reference date 2005-05-11. http://www.iso.org/iso/en/ISOOnline.frontpage. JRC (2002a) European Forest Information System (EFIS) – Executive Summary. Prepared by: Päivinen, R.; Schuck, A.; Miina, S.; Zudin, S.; Varis, S.; Heikkinen E. (European Forest Institute); Köhl, M. Hehmke, M.; Römisch, K. (Tech. Univ. of Dresden, Chair of Forest Biometrics and Computer Sciences); Voss, H.; Andrienko, G.; Andrienko, N. (GMD); Richards, T.; Wood, R. (UNEP-WCMC). European Commission – Joint Research Centre. Project number: 17186-2000-12 F1ED ISP FI. 17 p. http://www.efi.fi/efidas/efis/documents/FinalReport.pdf. JRC (2002b) European Forest Information System (EFIS) – Final Report. Prepared by: Päivinen, R.; Schuck, A.; Miina, S.; Zudin, S.; Varis, S.; Heikkinen E. (European Forest Institute); Köhl, M. Hehmke, M.; Römisch, K. (Tech. Univ. of Dresden, Chair of Forest Biometrics and Computer Sciences); Voss, H.; Andrienko, G.; Andrienko, N. (GMD); Richards, T.; Wood, R. (UNEP-WCMC). European Commission – Joint Research Centre. Project number: 17186-2000-12 F1ED ISP FI. 146 p. Kennedy, P.; Folving, S.; Munro, A.; Päivinen, R.; Schuck, A.; Richards, T.; Köhl, M.; Voss, H. and Andrienko, G. (2004) European Forest Information System EFIS - a step towards better access to forest information. In: ‘Advances in forest inventory for sustainable

forest management and biodiversity monitoring’ (Eds. P. Corona, M. Köhl, M. Marchetti); IUFRO Conference ‘Collecting and Analyzing Information for Sustainable Forest Management and Biodiversity Monitoring With Special Reference to Mediterranean Ecosystems. Palermo, Sicily (Italy). 4-7 Dec. 2001. Kluwer Publishing. Pp. 295-310. Lucent Technologies (2005) The meaning of information. http://www.lucent.com/minds/infotheory/what1.html. Reference date to Lucent Technologies webpage 2005-05-20. O'Dell, C. and Jackson Grayson Jr., C. (1998) If We Only Knew What We Know: the Transfer of Internal Knowledge and Best Practice. Publisher: The Free Press. Päivinen, R. and A. Schuck (2003) Sharing the information at Global and European level. In: Proceedings of FAO Expert Consultation on Global Forest Resources Assessment 2002. Kotka, Finland, 1-5-July 2002 (Eds. A. Luhtala, J. Varjo). The Finnish Forest Research Institute, Research Papers 889. Pp. 340-346. Schuck, A.; Van Brusselen, J. and Andrienko, G. (2004) The European Forest Information Demonstrator – EFIS. GeoInformatics. March 2004, Vol. 7. 42-45. Schuck, A.; Andrienko, G.; Andrienko, N.; Folving, S.; Köhl, M.; Miina, S.; Päivinen, R.; Richards, T. and Voss, H. (2005) The European Forest Information System -an Internet based interface between information providers and the user community. Computers and Electronics in Agriculture 47(3): 185-206. Schuck, A.; Landis, E. and Palmer, C. (1996) “Meeting national management needs for forest monitoring information and enabling countries to respond to international requirements” World Forest Monitoring Workshop October 21-24, 1996. World Forest Institute. Portland, Oregon, USA. 29 p. SEHISC (2005) Security Related Definitions. Stanford Electronic Health Information Security Committee. http://ostinato.stanford.edu/hipaa-feedback/definitions.html. Reference date to SEHISC webpage 2005-05-20. The Economist (2005) Power at last. April 2nd 2005 Edition. UNECE (2005) Joint FAO/UNECE Working Party on Forest Economics and Statistics Report as approved by the Working Party. Twenty-seventh Session. 22-24 March 2005. Economic Commission for Europe/Timber Committee; Food and Agriculture Organization/European Forestry Commission. TIM/EFC/WP.2/2005/9. 6 April 2005. 11 p. UT-SWMC (2005) What is know how? UT Southwestern Medical Center, Office for Technology Development. http://www2.swmed.edu/technology_development/knowfaq.htm. Reference date to UT-SWMC webpage 2005-05-20.