The UK e-Science Program: Next Generation Grid Applications Tony Hey Director of UK e-Science Core Program
[email protected]
e-Science and the Grid ‘e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.’ John Taylor, DG of UK OST Ø Requirements of e-Science Grid Application Projects determine services required by Grid middleware Ø UK Projects focus more on Grid Data Services than Teraflop/s HPC systems
NASA’s IPG Bill Johnston’s vision for the Information Power Grid is to revolutionize how NASA addresses large-scale science and engineering problems by providing persistent infrastructure for – “highly capable” computing and data management services that, on-demand, locate and co-schedule multi-Center resources to address large-scale and/or widely distributed problems – the ancillary services needed to support the workflow management frameworks that coordinate the processes of distributed science and engineering
Multi-disciplinary Simulations Wing Models •Lift Capabilities •Drag Capabilities •Responsiveness
Airframe Models
Stabilizer Models
•Deflection capabilities •Responsiveness Crew Capabilities - accuracy - perception - stamina - re-action times - SOP’s
Human Models
Engine Models
•Braking performance •Steering capabilities •Traction •Dampening capabilities
Landing Gear Models
•Thrust performance •Reverse Thrust performance •Responsiveness •Fuel Consumption
Whole system simulations are produced by coupling all of the sub-system simulations
UK e-Science Initiative • $180M Programme over 3 years • $130M is for Grid Applications in all areas of science and engineering • $50M ‘Core Program’ to encourage development of generic ‘industrial strength’ Grid middleware Ø Require $30M additional ‘matching’ funds from industry
UK e-Science Projects • $130M for e-Science application ‘pilots’ - span all sciences and engineering • Particle Physics and Astronomy (PPARC) - $25M GridPP and $8M AstroGrid • Engineering and Physical Sciences (EPSRC) - funding 6 projects at around $5M each • Biology, Medical and Environmental Science - funding projects with total value of $35M
UK e-Science Pilot Projects (1) • • • • • • • •
GRIDPP (PPARC) ASTROGRID (PPARC) Comb-e-Chem (EPSRC) DAME (EPSRC) DiscoveryNet (EPSRC) GEODISE (EPSRC) myGrid (EPSRC) RealityGrid (EPSRC)
UK e-Science Pilot Projects (2) • • • • • • •
Climateprediction.com (NERC) Oceanographic Grid (NERC) Molecular Environmental Grid (NERC) NERC DataGrid (NERC + OST-CP) Biomolecular Grid (BBSRC) Proteome Annotation Pipeline (BBSRC) High-Throughput Structural Biology (BBSRC) • Global Biodiversity (BBSRC)
UK e-Science Pilot Projects (3) • • • • • •
Biology of Ageing (BBSRC + MRC) Sequence and Structure Data (MRC) Molecular Genetics (MRC) Cancer Management (MRC + PPARC) Clinical e-Science Framework (MRC) Neuroinformatics Modeling Tools (MRC)
Some UK Core Program Projects • • • • • • •
MIASGRID (OST-CP) AKTing (OST-CP) EquatorGrid (OST-CP) DIRCGrid (OST-CP) MB-NG (OST-CP/PPARC) UK EDG (OST-CP/PPARC) OGSA-DAI (OST-CP)
Ø Plus Centre Industrial Grid Projects
Powering the Virtual Universe http://www.astrogrid.ac.uk (Edinburgh, Belfast, Cambridge, Leicester, London, Manchester, RAL) Multi-wavelength showing the jet in M87: from top to bottom – Chandra X-ray, HST optical, Gemini mid-IR, VLA radio. AstroGrid will provide advanced, Grid based, federation and data mining tools to facilitate better and faster scientific output.
Picture credits: “NASA / Chandra X-ray Observatory / Herman Marshall (MIT)”, “NASA/HST/Eric Perlman (UMBC), “Gemini Observatory/OSCIR”, “VLA/NSF/Eric Perlman (UMBC)/Fang Zhou, Biretta (STScI)/F Owen (NRA)”
p12
Printed: 27/09/2002
Comb-e-Chem Project Video
Simulation
Diffractometer
Properties Analysis Structures Database
X-Ray e-Lab
Properties e-Lab Grid
myGrid Project • Imminent ‘deluge’ of genomics data • Highly heterogeneous • Highly complex and inter-related • Convergence of data and literature archives
myGrid Generic Technologies 1. 2. 3. 4. 5.
Database access from the Grid Process enactment on the Grid Personalisation services Metadata services Development of Agent Services
Grid Services + Ontologies Ø
Towards the ‘Semantic Grid’
DAME Project In flight data
Airline
Global Network eg: SITA
Ground Station
DS&S Engine Health Center Maintenance Centre
Internet, e-mail, pager
Data centre
Geodise Project
Engineer
GEODISE PORTAL Knowledge repository Ontology for Engineering, Computation, & Optimisation and Design Search
Visualization
Session database
Traceability OPTIMISATION OPTIONS System
APPLICATION SERVICE PROVIDER Intelligent Application Manager
Reliability Security QoS
CAD System CADDS IDEAS ProE CATIA, ICAD
Globus, Condor, SRB Optimisation archive COMPUTATION
Licenses and code
Analysis CFD FEM CEM
Parallel machines Clusters Internet Resource Providers Pay-per-use
Intelligent Resource Provider
Geodise will provide grid-based seamless access to an intelligent knowledge repository, a state-of-the-art collection of optimisation and search tools, Design industrial strength analysis codes, and distributed computing & data resources
archive
UK Grid ‘Core Program’ 1. Ø 2. 3. 4. 5. 6.
Network of e-Science Centres UK e-Science Grid Generic/Industrial Grid Middleware e-Health Grid ‘Grand Challenge’ Support for e-Science Applications Outreach/International Activities Grid Network Issues
UK e-Science Grid Edinburgh Glasgow DL Belfast
Newcastle Manchester Cambridge
Oxford Cardiff
RAL
London
Southampton
Hinxton
Access Grid
Access Grid
Industrial Grid Projects • Grid Application Projects have more than $15M industrial input - mostly major pharmaceutical and engineering companies • Around $24M allocated for collaborative industrial projects for middleware/tools - at present $20M allocated with matching industrial funding - more than 50 companies involved
Open Grid Services Architecture • Development of web services from W3C • OGSA will provide – Naming /Authorization / Security / Privacy – Higher level services: Workflow, Transactions, Data Mining, Knowledge Discovery,… Ø Exploit Synergy: Commercial Internet with Grid Services
E-Science Centre Projects Example Industrial Projects: • E-Science Portal (with Sun) • OSCAR-G (with Intel and Compusys) • Tele-Medicine (with Siemens, Cancer Trust and Regional Hospitals) • Molecular Informatics (with Unilever) • GRIDmist (with HP) • Grid Data Services (with IBM,Oracle)
Databases in the Grid Data Complexity
Computational Complexity
OGSA – Data Access and Integration Project - Key middleware area for UK Program - Develop high-quality data-centric middleware capability - Total Budget $5M (CP $2M) - Three Centres: Edinburgh, Manchester and Newcastle - Industrial partners: IBM US, IBM Hursley and Oracle UK.
OGSA – DAI Project Update • Design Specification completed – Papers for GGF5 WG on Database Access and Integration Services • Three Prototypes delivered: – Distributed Query Service – XML Database Interface – Relational Database Interface • Beta versions by November 2002 – Integrate with Globus GT3?
e-Health Grid Projects • Grid-Enabled Knowledge Services for Medical Informatics - Triple Assessment in Breast Cancer: Fusion of Clinical, Radiological and Cytological data • Grid-based Medical Devices for Everyday Health - Patient sensors, mobile wireless communication
e-Diamond
The challenge
Medical Image Analysis Technology
Clinical Validation
GRID
Clinical Application
e-Diamond
Mammograms have different appearances, depending on image settings and acquisition systems
SMF is a normalised representation independent of scanner settings
e-Diamond Training and Differential Diagnosis
Applications of SMF Teleradiology and QC VirtualMammo
“Find one like it”
? Advanced CAD SMF-CAD workstation
Epidemiology SMFcomputed breast density
Grid Network Team • Expert group to identify end-to-end network bottlenecks and other network issues - e.g. problems with multicast for Access Grid • Identify e-Science project requirements • Funding £0.5M traffic engineering/QoS project with PPARC, UKERNA and CISCO - investigating MPLS using SuperJANET network • Funding DataGrid extension project investigating bandwidth scheduling with PPARC • Proposal for ‘UKLight’ lambda connection to Chicago and Amsterdam
SuperJanet4, June 2002 Scotland via Edinburgh
Scotland via Glasgow
NNW
20Gbps 10Gbps 2.5Gbps 622Mbps 155Mbps
WorldCom Glasgow
WorldCom Edinburgh NorMAN
YHMAN Northern Ireland MidMAN
WorldCom Manchester
WorldCom Leeds EMMAN
WorldCom Reading
WorldCom London
EastNet
TVN
South Wales MAN SWAN& BWEMAN
WorldCom Bristol
External Links
WorldCom Portsmouth
LMN
LeNSE
Kentish MAN
Research Challenges • Building a Future Infrastructure - Developing a Semantic Grid - Trusted Ubiquitous Systems - Rapid Customized Assembly of Services - Autonomic Computing • Putting the Infrastructure to work - Support for New Forms of Community - Socio-Economic Impact - Collaboratory IPR and legal issues
Summary • UK Grid projects emphasize data federation as much as computation • Metadata and ontologies will be key to higher level Grid services • e-Science will produce a deluge of scientific data that must be annotated and curated in scientific data ‘digital libraries’
e-Science and the Grid ‘e-Science will change the dynamic of the way science is undertaken.’ John Taylor, 2001 ‘[The Grid] intends to make access to computing power, scientific data repositories and experimental facilities as easy as the Web makes access to information.’ Tony Blair, 2002