Data and Metadata Representation

0 downloads 0 Views 230KB Size Report
should be open, it supports existing standards regarding the metadata scheme ... This concerns the creation of new collections and the modification of collection.
Organising and Maintaining Dynamic Digital Collections M. Nikolaidou1, D. Anastasiou2, D. Anagnostopoulos2, M. Hatzopoulos1 1

Department of Informatics, University of Athens Panepistimiopolis, 15771 Athens, Greece

{mara}, {mike}@di.uoa.gr

1.

2

Department of Geography Harokopion University of Athens 70 El. Venizelou Str, 17671, Athens, Greece {danasta}, {dimosthe}@hua.gr

Introduction

To provide access to research material to researchers and physicians of Athens Medical School (AMS), a Digital Library system (AMS DL) was developed. It maintains medical images produced by the laboratories of the Medical School. End-users access the system through the Web. Images are catalogued and processed partially by laboratory scientific staff and partially by cataloguers in the Central Library of Heath Sciences. AMS DL system is based on a multi-tiered client-server model and is implemented using Java and IBM Content Manager platform [1]. Since the system should be open, it supports existing standards regarding the metadata scheme and communication interfaces. Thus, the metadata scheme adopted is an extension of Dublin Core, while results are obtained as XML pages [2]. Laboratory requirements significantly differ, thus it was decided to develop a different collection for each laboratory. Since the number of collections needed is not static and predefined, we have identified two requirements: a. the need to easily create new collections and b. the need to extend or modify collection description (e.g. the metadata information used to characterize the images included in each collection). We introduced the term dynamic collection management to denote the support of automated collection definition and management within an integrated digital library environment.

2.

Dynamic Collection Management

This concerns the creation of new collections and the modification of collection description. For each collection added in digital library environment, the corresponding object structure and metadata model must be defined. Collection description can be derived from existing collections by extending the object structure and metadata model, e.g. a collection description can be defined as the descendant of an existing collection description, while additional object parts and metadata fields can also be defined. This feature allows flexibility during collection definition and facilitates collection description in a simplified manner. In the case of AMS DL, each collection consists of the medical images produced by a specific laboratory. Thus, they are characterized by common general metadata and domain specific metadata, useful for researchers in the specific domain [3]. The general metadata model and a basic object structure corresponding to medical image objects are used to describe the Medical Image Collection. The Medical Image Collection Description is used to easily define collections corresponding to each laboratory as its descendants, while the collection is practically empty. In order to efficiently support dynamic collection management, AMS DL facilitates dynamic interface creation both for processing and cataloguing and collection search. The same interface is used for all collection, while screens are dynamically formed based on collection description.

3.

Data and Metadata Representation

The following parts are included in the medical image objects belonging in the collections: Original Image, a high quality image strongly protected regarding copyright issues, Derivative Image produced from the original image to be accessed through the Web, Watermarked Image, Screen Size and Thumbnail Image and Image Description in Greek and English. The original image and the description are produced by the researcher, while all other formats are produced by the cataloguer. The Dublin Core metadata scheme is used to describe general metadata information [2]. Additional fields were used to represent domain-specific metadata. Since these fields are collection specific, we adopted a DC-like structure for their representation in XML, where the collection is depicted similar to a DC qualifier: collection_name/< AMSq:collection> local_field_value/ /< AMS:local_field_name>

Medical Image data and metadata internal representation using Content Manager constructs are presented in figure 1. As indicated in this figure, the digital object used to represent Medical Image Objects consists of all derived images and image descriptions. Since the system must support both exact and approximate search in combined multi-valued metadata fields, the searching capabilities of a relational database were too poor to ensure Collection Search application performance. Thus, database search is applied for exact numerical and date metadata field search, while string exact and approximate search is performed using free text search capabilities Metadata information is stored both in the underlying database and within a tagstructured text part in the corresponding Medical Image digital object (metadata part). Different tags are used to support Greek and English languages, while all properties of a specific metadata field, are included within one tag. D o c u m e n t: M e d ic a l Im a g e

D o c u m e n t: O rig in a l Im a g e

K e y F ie ld s (M e ta d a ta )

K e y F ie ld s (M e ta d a ta ) O r i g I m a g e _ ID : S trin g … … …

Im a g e _ ID

: S trin g

M a in T itle _ G R M a in T itle _ E N A lte rn a tiv e T itle _ G R A lte rn a tiv e T itle _ E N D e s c r ip tio n _ G R D e s c r ip tio n _ E N K e yw o rd s _ G R K e yw o rd s _ E N K e yw o rd s _ T yp e S u b je c t s S u b je c t s _ T y p e s C r e a to rs _ ID s P u b lis h e rs _ ID s C o n trib u to r s _ ID s R o le s _ ID s R ig h ts _ ID s R e la tio n s _ ID s R e la tio n _ T y p e s R e la tio n _ V a lu e s Im a g e Q u a lity C r e a tio n D a te In s e rtio n D a te L a s tU p D a te In s e rte d B y U s e r M o d ifie d B y U s e r C o m m e n ts O r ig in a lH e ig h t O r ig in a lW id th O r ig in a lF o rm a t O r ig in a lF ile N a m e D e riv a tiv e H e ig h t D e riv a tiv e W id th D e riv a tiv e F o rm a t D e riv a tiv e F ile N a m e S c re e n H e ig h t S c re e n W id th S c re e n F o rm a t S c re e n F ile N a m e T h u m b n a ilF ile N a m e

: : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : :

S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (3 0 0 ) S trin g (3 0 0 ) S trin g (1 0 0 ) S trin g (1 0 0 ) S trin g (2 0 ) S trin g (1 0 0 ) S trin g (1 0 0 ) S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (1 5 0 ) S trin g (8 0 ) S trin g (1 5 0 ) S trin g (1 0 0 ) S trin g (1 5 0 ) … … D a te D a te D a te S trin g (2 0 ) S trin g (2 0 ) S trin g (3 0 0 ) S h o rt In te g e r S h o rt In te g e r S trin g (6 ) S trin g (3 0 ) S h o rt In te g e r S h o rt In te g e r S trin g (6 ) S trin g (3 0 ) S h o rt In te g e r S h o rt In te g e r S trin g (6 ) S trin g (3 0 ) S trin g (3 0 )

H is to lo g ic a l M e ta d a ta O rg an : S trin g (3 0 ) S o u rc e : S trin g (3 0 ) G ender : S trin g (3 0 ) E q u ip m e n t : S trin g (1 0 0 ) D is e a s e : S trin g (5 0 )

P a rts

O r ig in a l Im a g e

P a rts R D F T a g s (D u b lin C o r e ) < r d f: D e s c r ip tio n … ..> < D C :T itle > T h e U ltim a te … /< D C :T itle > … … … … … … … … … … … . W a te r m a rk e d Im a g e S c re e n S iz e Im a g e T h u m b n a il Im a g e D e riv a tiv e Im a g e

In d e x e d b y Im a g e S e a rc h S e rv e r

M e ta d a ta (S tru c tu re d T e x t) x x E N x x y y M a i n T i t l e y y T h e U lt im a t e … x x G R x x y y M a in T itle y y Τ ο Α π ό λ υ το … y y S u b j e c t s y y H is t o lo g ic a l… y y S u b je c ts T y p e y y M e S H … … … …

E n g lis h D e s c r ip tio n

G reek D e s c r ip tio n

In d e x e d b y T e x t S e a rc h S e rv e r

Figure 1. Medical Image Data and Metadata Representation

References 1. 2. 3.

Darmoni SJ et.al: CISMeF: A Structured Health Resource Guide. Methods Information Medicine 39 (2001) Weibel S., Hakala J.:DC-5: The Helsinki Metadata Workshop. D-Lib Magazine 4 (1998). Content Manager Documentation. IBM Corporation (2000)

Suggest Documents