DDN - HPC Advisory Council

17 downloads 207 Views 6MB Size Report
All Rights Reserved. The Broadest Big Data Portfolio. 3 .... Only DDN can Ensure NO DATA loss on Enclosure Failures ...
June 2012

Designed for Scale HPC Advisory Council

Dr James Coomer Snr. Technical Advisor

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

DDN | 15 years in HPC Storage

2

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

The Broadest Big Data Portfolio File Storage

SAN File Storage

Enterprise NAS

xSTREAMScaler™ 100s SAN/LAN Clients HSM Capable Streaming Optimized

Storage Systems

Silicon Storage Architecture S2A9900™

6GB/s in Real-Time 1,200 Drives: 2 Racks

NAS Scaler™

EXAScaler™

GRIDScaler™

1-16 NAS Servers Fully-Featured High Performance

10Ks of Clients 1TB/s+

1Ks of Clients 1TB/s+ Scale-Out NAS

Object Storage

Storage Fusion Architecture S2A6620™

2GB/s, 350K IOPS 120 Drives in 8U

SAS

Disks

Parallel File Storage

SFA10K™

SFA12K™

15GB/s, 840K IOPS 1,200 Drives: 2 Racks Embedded Computing

40GB/s/1.4M IOPS 1,680 Drives: 2 Racks Embedded Computing

SATA

Capacity + Performance

WOS® 256 Billion Objects GeoReplicated Cloud Foundation Mobile Cloud Access

SSD

Optimized Capacity

Enterprise IOPS

Open Systems Architecture For Maximum Versatility Highly Configurable for Optimized TCO 3

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Design for Extreme Scale ▶ 

You can build a scalable HPC storage system with conventional arrays. But what is the impact on: •  Reliability and Failure Handling •  Configuration Complexity and Manageability •  Performance and Performance Protection

▶ 

Product Updates: SFA 12K

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Issues in Large Scale Storage ▶ 

Disk Read Error Rate is constant at 1 in 1015 bits

▶ 

A 50PB system has many components – in particular: •  1500 LUNs, 15,000 disks à 1.2 drive failures per day SFA Software

Software RAID

Capacity Used

60%

60%

Rebuild Speed

40MB/s

25MB/s

Performance Impact

10%

50% when optimised

Striped IO performance during rebuild

820GB/s

455GB/s à 233GB/s

▶ 

5

Large systems are always in failure, always degraded ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Reliability and Manageability 3.6 PBs RAW Filesystem Small arrays Compute Cluster

OSS 1

OSS 2

OSS 3

Compute Cluster

OSS 1

OSS 4

OSS 2

OSS 3

OSS 4

8 Total Controllers 2 Controllers

1 x SFA10K-X

Let’s understand the overhead of going with multiple small systems ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Performance Small arrays A File System striped across many smaller systems is not optimal – •  The FS is only as fast as its slowest component – any small failure slows down the entire FS •  This will slow down the entire FS

File System

A File System striped across one big system is optimal – •  Highly over-provisioned backend ensures minimal performance hit during drive rebuilds •  Much lower chances of FS slowing down

File System

1 x SFA10K-X

Let’s understand the implications in detail ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Amdahl’s Law & Striped File I/O 1 Performance 1 1 1 1 Rebuilds 1 1 Much Lower Drop For Higher 2 Overall 2 Average 2 2 Performance 2 2 2 3

3

3

3

1200"

4

4

4

4

1000"

4SFA4Aver4age 4

5

5

5

5

5Perfo 5 rma5nce 5

600"

6

6

6

6Com6petito 6r 1 A6vera6ge

400"

7

7

7

7

Performance 7

7

7

7

200"

8

8

8

8

8

8

8

8

0"

P

P

P

P

P

P

P

P

Q

Q

Q

Q

Q

Q

Q

Q

Bandwidth (GB/s)

3

©2012 DataDirect Networks. All Rights Reserved.

Time ◊

3

2

1400"

800"

3

1 3

ddn.com

Withstanding Enclosure Failures Small Arrays Compute Cluster

OSS 1

OSS 2

OSS 3

Compute Cluster

OSS 4

LOSS OF CRITICAL DATA on any single enclosure failure

OSS 1

OSS 2

OSS 3

OSS 4

1 2 3 4 5 6 7 8 P Q

NO LOSS OF CRITICAL DATA on up to 4 enclosure failures

Only DDN can Ensure NO DATA loss on Enclosure Failures ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Quick-Healing Capabilities Small Arrays! Disk Drive Hangs or Misbehaves

Self Healing Capabilities get triggered

Per Drive ! Cache!

Disk Drive Hangs or Misbehaves

1.  Drive is marked bad

1.  Users can Power Cycle the Drive

2.  No effort is made to bring it back up again

2.  During Power Cycle, writes are cached

3.  Rebuild of data starts once drive is replaced

3.  If the suspect drive can be revived, all cached writes are written back to disk and disk can be restored to health rapidly

4.  System goes to degraded mode – data corruption & double/ triple disk failure possibility increases

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Storage Fusion Architecture™ SFA12K™ Line

©2012 DataDirect Networks. All Rights Reserved. ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

DDN | SFA12K™ ™

800% Faster Consolidate, Simplify, Save Intelligent Appliances: SFA12K-20E, SFA12K-20, SFA12K-40

40% Denser Reclaim Your Data Center Leading Density: StorageScaler™ 8460: 84 Disk in 4U (DDN exclusive)

©2012 DataDirect Networks. All Rights Reserved.

50% Less TCO Smarter Big Data Storage 10+ Years of Innovation: Storage Fusion Processing, Storage Fusion Fabric™, QoS, SATAssure™, ReACT™, more..

ddn.com

SFA12K-20E | Appliances

▶  SFA12K-20E

available with DDN | EXAScaler™ and DDN | GRIDScaler™ parallel file storage solutions ▶  Integrate multiple appliances to scale to over 1000GB/s and 10’s of petabytes EXAScaler SFA12K-20E

GRIDScaler SFA12K-20E

20GB/s Up To 5.3PB* Usable capacity

20GB/s Up To 5.3PB* Usable capacity * - Initial release limited to 840 Drives

©2012 DataDirect Networks. All Rights Reserved.

ddn.com

Storage Fusion Processing™ Embedded computing brings the function to the data, as opposed shipping data to the application.

Fewer data/packet transitions and application co-location within the storage appliance, the Storage Fusion Architecture accelerates data-intensive applications. ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

SFA12K™ | Management

Simple Systems Management ▶ GUI ▶ CLI ▶ SNMP ▶ New API ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

SFA12K™ | Read Quality of Service

SFA OS v1.5 Features ‘Real-Time’ Read Access To A Low-Latency Data Store

DDN provides the markets only true QoS platforms for big bandwidth applications over time. SFA read-QoS provides immediate benefit for: •  Latency-Sensitive Internet-Based Content Organizations (Cloud) •  Massively Parallel File Reads (Avoids Amdahl’s Law) ©2012 DataDirect Networks. All Rights Reserved.

ddn.com

DataDirect Networks, Information in Motion, Silicon Storage Appliance, S2A, Storage Fusion Architecture, SFA, Storage Fusion Fabric, Web Object Scaler, WOS, EXAScaler, GRIDScaler, xSTREAMScaler, NAS Scaler, ReAct, ObjectAssure, In-Storage Processing and SATAssure are all trademarks of DataDirect Networks. Any unauthorized use is prohibited.

17

©2012 DataDirect Networks. All Rights Reserved.

ddn.com