NERC DataGrid: Googling for Secure Data - National e-Science ...

10 downloads 59 Views 4MB Size Report
Bryan Lawrence on behalf of the NDG,. BADC and BODC. Ray Cramer, Marta Gutierrez, Kerstin Kleese, Siva Kondapalli, Sue Latham,. Roy Lowry, Kevin O' Neill ...
NERC DataGrid: Googling for Secure Data

Bryan Lawrence on behalf of the NDG, BADC and BODC. Ray Cramer, Marta Gutierrez, Kerstin Kleese, Siva Kondapalli, Sue Latham, Roy Lowry, Kevin O’Neill, Ag Stephens, Andrew Woolf

British Atmospheric Data Centre http://badc.nerc.ac.uk

Outline • • • •

NDG Aims and Metadata Taxonomy Demonstration of NDG in action NDG Authorisation – the security bit! Status

British Atmospheric Data Centre http://badc.nerc.ac.uk

Timelines & Bottom Line • 2002: E-science arrives at NERC: – Legacy Systems with millions of files and terabytes of data and existing access and authorisation systems that cannot easily be replaced. – Complex existing DISCOVERY metadata systems. – Discovery (where it exists) based on Z39.50 – Utilisation based on file retrieval.

• 2004: NERC DataGrid ready to move forward – New metadata systems describe data as well as datasets. – OAI based harvesting supports scalable FAST data discovery. – Requirements capture for new authorisation systems complete, and coding underway for implementation. – New communities involved, and international discovery very close to operational reality.

• 2005: – Utilisation based on metadata, on demand server side behaviours, grid-based back end parallelisation etc British Atmospheric Data Centre http://badc.nerc.ac.uk

Complexity + Volume + Remote Access = Grid Challenge British Atmospheric Data Centre

Simulations

British Oceanographic Data Centre Assimilation British Atmospheric Data Centre http://badc.nerc.ac.uk

http://ndg.nerc.ac.uk

NDG Metadata Taxonomy

British Atmospheric Data Centre http://badc.nerc.ac.uk

NDG Metadata Architecture

Service based model: • clear separation between discovery and use • discovery service standards compliant and interoperable British Atmospheric Data Centre http://badc.nerc.ac.uk

(D) - Discovery Open Archives Initiative – Digital Library Protocol for harvesting metadata. OAI

OAI

NDG Supports Multiple Discovery Services – “build your own”

Multiple Protocol Support will be built into the “NDG Vanilla Discovery Service” Existing Metadata

Intermediate Schema Document(s) (XML)

XSLT Ingest Transformation

NDG Discovery Service Element

British Atmospheric Data Centre http://badc.nerc.ac.uk

XSLT Processor

Directory Interchange Format

XSLT Processor

Dublin Core

XSLT Processor

GEO Profile (Z39.50)

ISO 19115?

Catalogue Interoperabiltiy Protocol ?

Internet Link

tape robot

Online Data

XML database BADC NDG Wrapper

Online Data

Online Data

XML database

XML database

BODC NDG Wrapper

Group NDG Wrapper

Wider Internet NERC Grid Software Agent

Grid User

ESG (&other) Applications

Satellite

Research Group Data Sources

Wider Internet

NDG Web Portal

Internet User

Internet Link

XML database

British Atmospheric Data Centre http://badc.nerc.ac.uk

Supercomputer

Discovery

British Atmospheric Data Centre http://badc.nerc.ac.uk

Choose to go to A service or B service.

Can order responses by title or data centre (or default random)

Flexible Information Return

British Atmospheric Data Centre http://badc.nerc.ac.uk

Look at DIFs in either HTML or XML

Current Interface

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

Background activity being parallelised with GODIVA/CCLRC e-science collaboration (spectral -> gridpoint + CDMS + visualisation tools)

Download either plot or the data that went into the plot. British Atmospheric Data Centre http://badc.nerc.ac.uk

British Atmospheric Data Centre http://badc.nerc.ac.uk

International Dimension

British Atmospheric Data Centre http://badc.nerc.ac.uk

Southampton Oceanography Centre

British Atmospheric Data Centre http://badc.nerc.ac.uk

Authorisation • Role-based access:

Signed “conditions of use” form exists for this dataset

badc.nerc.ac.uk ukmo-obs researcher ukmo-obs nerc

• Key concept: Only hosts that trust each other share data, even within a larger virtual organisation: e.g. at BADC: ndg.bodc.nerc.ac.uk nerc ashoe nerc bodc British Atmospheric Data Centre http://badc.nerc.ac.uk

NDG Security

Certificate based, pass encrypted credentials between user and gatekeeper.

British Atmospheric Data Centre http://badc.nerc.ac.uk

Where are we? • Migration to web services underway for some components, new A services in design phase, implementation details not yet obvious (e.g. GT4 etc). • Major effort on defining feature types for observation types so we can build an OGC/ISO compatible data extractor for observations and numerical data. • Security Infrastructure Development – Collaboration with CCLRC e-science, ECOGrid

• Ongoing work on metadata definition and population: – Oceanographic data – Atmospheric Chemistry data • Major issues with (un)controlled vocabularies

– Numerical Modelling data • DIF numerical definition (moving to ISO), BADC and UK Community • Katherine Bouton’s work at NCAS/CGAM (“B” MODEL METADATA)

– Remote Sensing Data • Collaboration with NEODC and PML

• Ongoing work on databases and interfaces, DIF to ISO and “B” British Atmospheric Data Centre http://badc.nerc.ac.uk