High Performance Computing Infrastructure in Japan

81 downloads 9276 Views 3MB Size Report
Jan 16, 2013 ... “K computer”, supercomputers and high performance storage. ➢ first production level infrastructure for high performance computing in Japan.
High Performance Computing Infrastructure in Japan

Kento Aida National Institute of Informatics

Kento Aida, National Institute of Informatics

2

Overview  of  HPCI

Kento Aida, National Institute of Informatics

3

Introduction n  High Performance Computing Infrastructure (HPCI) Ø  national project promoted by Ministry of Education, Culture, Sports, Science and Technology (MEXT) in Japan Ø  distributed computing infrastructure for high performance computing ü “K computer”, supercomputers and high performance storage

Ø  first production level infrastructure for high performance computing in Japan

n  roadmap Ø  – Mar 2011

basic design

ü network, authentication, user management, shared storage, testbed for advanced software

Ø  Apr – Dec 2011 detailed design Ø  Jan – Aug 2012 test operation Ø  Sep 2012 – production level operation

Kento Aida, National Institute of Informatics

4

Services (1) account registration

(2) single sign-on

HPCI account

cert.

ü  input HPCI account and password   ü  operation through a web   browser

ü  application ü  account

         

(3) login to resources ü  no password ü  run jobs on supercomputers ü  access files on shared storages

                                              computer    

HPCI shared storage

Kento Aida, National Institute of Informatics

5

System Overview user management HPCI ID registration review proposals

authentication

CA system

HPCI acct.

shib. SP

apply certificate

acct. registration

portal

certificate repository

single sign-on helpdesk

shib. SP

HPCI Secretariat (RIST)

AICS (K-computer) Supercomputer Centers in 9 Universities

                                              computer computer  resource resource   shared storage

AICS, U. Tokyo

             

computer resource

shib. shib. IdP shib. IdP IdP

NII

network infrastructure

More resources will be connected after 2012.

Kento Aida, National Institute of Informatics

As of Nov. 2012

Computing Resources RIKEN AICS: K computer (10.62PF, 1.27PiB/30PiB) Kyoto Univ. XE6 (300.8 TF, 59 TB) GreenBlade8000(242.5TF, 38TB) 2548X(10.6TF, 24TB)

Hokkaido Univ.: SR16000/M1(51.6TF/172TF, 6.6TB/ 22TB) BS2000 (5.76TF/44TF, 1.92TB/14TB) RENKEI-VPE: VM Hosting

Osaka Univ.: SX-9 (16TF, 10TB) SX-8R (5.3TF, 3.3TB) PCCluster (6.1TF, 2.0TB)

Kyushu Univ.: FX10 (68.1TF/181.6TF, 9.2TB/24TB) CX400 (44.2TF/510.1TF, 16.4TB/184.5TB) SR16000 L2 (25.3TF, 5.5TB)

source: M. Hirakawa, AICS

Nagoya Univ.: FX1(30.72TF, 24TB) HX600(25.6TF, 10TB) M9000(3.84TF, 3TB)

Tohoku Univ.: SX-9(29.4TF, 18TB) Express5800 (1.74TF, 3TB) Univ. of Tsukuba: T2K (95.4Tflops, 20TB) HA-PACS (802Tflops, 34.3TB) FIRST (36.1TFlops, 1.6TB) Univ. of Tokyo: FX10 (1.13PF, 150TB) SR16000/M1(54.9TF, 10.94TB) T2K (75.36TF/140TF, 16TB/31.25TB) EastHubPCCluster(10TF/13TF, 5.71TB/ 8.15TB) GPU Cluster(CPU 4.5TF, GPU 16.48TF, 1.5TB) WestHubPCCluster(12.37TF,8.25TB) Tokyo Institute ofHosting Technology: RENKEI-VPE:VM TSUBAME2.0 (0.24PF/2.4PF, 10TB/ 100TB) RENKEI-VPE : VM Hosting

Storage HPCI WEST HUB

HPCI EAST HUB University of Tokyo

AICS, RIKEN

•  12 PB+ storage

•  10 PB+ storage

Hokkaido University

Gfarm2 is used as the global shared file system Kyushu University

Tohoku University University of Tsukuba Tokyo Institute of Technology Nagoya University Osaka University Kyoto University

source: Y. Ishikawa, Univ. of Tokyo

Network (SINET4) SINET4: Science Information NETwork 4

9

SINET4 (cont’d) n  connection to 700+ academic sites n  IX for commercial networks n  80Gbps backbone between Tokyo and Osaka Ø  134(30Gbps) in Tokyo Ø  22(11Gbps) in Osaka n  L3VPN, L2VPN/VPLS, QoS CA portal

user

user

univerisity

university

user

user

IX (Tokyo)

QoS

IX (Osaka)

commercial network

VPN non-comercial network university

university

AICS LAN storage user compt. resource

storage user compt. resource

storage user compt. resource

resource provider

resource provider

storage user compt. resource

Kento Aida, National Institute of Informatics

10

Cloud Service n  VM hosting Ø  repository for research results Ø  pre/post processing Ø  testbed for prototype system software

source: S. Takizawa, Tokyo Tech.

Kento Aida, National Institute of Informatics

11

Authen3ca3on  System

Kento Aida, National Institute of Informatics

12

Overview of Authentication System n  access to web portals: Shibboleth Ø  management of certificates, user support, cloud service

n  access to remote computers: GSI Ø  login to remote computers, access to shared storage

n  bridge between shibboleth and GSI: web portal user

portal IdP, HPCI account pass word single  sign-­‐on

% gsi-ssh host.univ.ac.jp

(1)  sign-on to the portal (cert. issuing system) (2)  generate a proxy certificate and download the proxy certificate (3) ssh login to remote computers ü  no need to give local account name and password

• login to remote computers • access to shared storage Kento Aida, National Institute of Informatics

13

Architecture NII ü apply user cert. ü single sigh-on

cert. management system

portal (Shib. SP)

cert. repository

proxy cert. repository

browser

CA system (Shib. SP)

Shib. DS SINET 4 ü login to resources

supercomputer centers, RIKEN portal (Shib. SP) proxy cert. repository

GSI-SSH client

supercomputer centers, RIKEN

Shib. IdP account DB

GSI-SSH server

Kento Aida, National Institute of Informatics

14

Architecture (cont’d) NII ü apply user cert. ü single sigh-on

cert. management system

portal (Shib. SP)

cert. repository

proxy cert. repository

browser

CA system (Shib. SP)

Shib. DS SINET 4 ü login to resources

supercomputer centers, RIKEN portal (Shib. SP) proxy cert. repository

GSI-SSH client

supercomputer centers, RIKEN

Shib. IdP account DB

GSI-SSH server

Kento Aida, National Institute of Informatics

15

Software role Certificate Authority

system

software

CA system

NAREGI-CA

certificate management

custom software

certificate repository

MyProxy

ID federation

Shibboleth

Portal (NII,supercomputer centers)

portal (cert. issuing system)

custom software

Proxy certificate repository

MyProxy

ID federation

Shibboleth

Identity Provider (supercomputer centers, AICS)

ID federation

Shibboleth

Resource Provider (supercomputer centers, AICS)

middleware to access resources

GSI-SSH Gfarm

Kento Aida, National Institute of Informatics

16

Summary and Future Plan n  Summary Ø  This talk presents a design of HPCI focusing on the authentication mechanism. Ø  HPCI started production level operation in Sep. 2012.

n  Issues Ø  interoperation with oversea infrastructure ü review of the operation in HPCI CA to obtain approval of International Grid Trust Federation (IGTF)

Ø  federation with other authentication system ü discussion about the federation with other web authentication systems, e.g. OpenID

Kento Aida, National Institute of Informatics

17

h=ps://www.hpci-­‐office.jp/

Kento Aida, National Institute of Informatics