All Rights Reserved. The Broadest Big Data Portfolio. 3 .... Only DDN can Ensure NO DATA loss on Enclosure Failures ...
June 2012
Designed for Scale HPC Advisory Council
Dr James Coomer Snr. Technical Advisor
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
DDN | 15 years in HPC Storage
2
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
The Broadest Big Data Portfolio File Storage
SAN File Storage
Enterprise NAS
xSTREAMScaler™ 100s SAN/LAN Clients HSM Capable Streaming Optimized
Storage Systems
Silicon Storage Architecture S2A9900™
6GB/s in Real-Time 1,200 Drives: 2 Racks
NAS Scaler™
EXAScaler™
GRIDScaler™
1-16 NAS Servers Fully-Featured High Performance
10Ks of Clients 1TB/s+
1Ks of Clients 1TB/s+ Scale-Out NAS
Object Storage
Storage Fusion Architecture S2A6620™
2GB/s, 350K IOPS 120 Drives in 8U
SAS
Disks
Parallel File Storage
SFA10K™
SFA12K™
15GB/s, 840K IOPS 1,200 Drives: 2 Racks Embedded Computing
40GB/s/1.4M IOPS 1,680 Drives: 2 Racks Embedded Computing
SATA
Capacity + Performance
WOS® 256 Billion Objects GeoReplicated Cloud Foundation Mobile Cloud Access
SSD
Optimized Capacity
Enterprise IOPS
Open Systems Architecture For Maximum Versatility Highly Configurable for Optimized TCO 3
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Design for Extreme Scale ▶
You can build a scalable HPC storage system with conventional arrays. But what is the impact on: • Reliability and Failure Handling • Configuration Complexity and Manageability • Performance and Performance Protection
▶
Product Updates: SFA 12K
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Issues in Large Scale Storage ▶
Disk Read Error Rate is constant at 1 in 1015 bits
▶
A 50PB system has many components – in particular: • 1500 LUNs, 15,000 disks à 1.2 drive failures per day SFA Software
Software RAID
Capacity Used
60%
60%
Rebuild Speed
40MB/s
25MB/s
Performance Impact
10%
50% when optimised
Striped IO performance during rebuild
820GB/s
455GB/s à 233GB/s
▶
5
Large systems are always in failure, always degraded ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Reliability and Manageability 3.6 PBs RAW Filesystem Small arrays Compute Cluster
OSS 1
OSS 2
OSS 3
Compute Cluster
OSS 1
OSS 4
OSS 2
OSS 3
OSS 4
8 Total Controllers 2 Controllers
1 x SFA10K-X
Let’s understand the overhead of going with multiple small systems ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Performance Small arrays A File System striped across many smaller systems is not optimal – • The FS is only as fast as its slowest component – any small failure slows down the entire FS • This will slow down the entire FS
File System
A File System striped across one big system is optimal – • Highly over-provisioned backend ensures minimal performance hit during drive rebuilds • Much lower chances of FS slowing down
File System
1 x SFA10K-X
Let’s understand the implications in detail ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Amdahl’s Law & Striped File I/O 1 Performance 1 1 1 1 Rebuilds 1 1 Much Lower Drop For Higher 2 Overall 2 Average 2 2 Performance 2 2 2 3
3
3
3
1200"
4
4
4
4
1000"
4SFA4Aver4age 4
5
5
5
5
5Perfo 5 rma5nce 5
600"
6
6
6
6Com6petito 6r 1 A6vera6ge
400"
7
7
7
7
Performance 7
7
7
7
200"
8
8
8
8
8
8
8
8
0"
P
P
P
P
P
P
P
P
Q
Q
Q
Q
Q
Q
Q
Q
Bandwidth (GB/s)
3
©2012 DataDirect Networks. All Rights Reserved.
Time ◊
3
2
1400"
800"
3
1 3
ddn.com
Withstanding Enclosure Failures Small Arrays Compute Cluster
OSS 1
OSS 2
OSS 3
Compute Cluster
OSS 4
LOSS OF CRITICAL DATA on any single enclosure failure
OSS 1
OSS 2
OSS 3
OSS 4
1 2 3 4 5 6 7 8 P Q
NO LOSS OF CRITICAL DATA on up to 4 enclosure failures
Only DDN can Ensure NO DATA loss on Enclosure Failures ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Quick-Healing Capabilities Small Arrays! Disk Drive Hangs or Misbehaves
Self Healing Capabilities get triggered
Per Drive ! Cache!
Disk Drive Hangs or Misbehaves
1. Drive is marked bad
1. Users can Power Cycle the Drive
2. No effort is made to bring it back up again
2. During Power Cycle, writes are cached
3. Rebuild of data starts once drive is replaced
3. If the suspect drive can be revived, all cached writes are written back to disk and disk can be restored to health rapidly
4. System goes to degraded mode – data corruption & double/ triple disk failure possibility increases
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Storage Fusion Architecture™ SFA12K™ Line
©2012 DataDirect Networks. All Rights Reserved. ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
DDN | SFA12K™ ™
800% Faster Consolidate, Simplify, Save Intelligent Appliances: SFA12K-20E, SFA12K-20, SFA12K-40
40% Denser Reclaim Your Data Center Leading Density: StorageScaler™ 8460: 84 Disk in 4U (DDN exclusive)
©2012 DataDirect Networks. All Rights Reserved.
50% Less TCO Smarter Big Data Storage 10+ Years of Innovation: Storage Fusion Processing, Storage Fusion Fabric™, QoS, SATAssure™, ReACT™, more..
ddn.com
SFA12K-20E | Appliances
▶ SFA12K-20E
available with DDN | EXAScaler™ and DDN | GRIDScaler™ parallel file storage solutions ▶ Integrate multiple appliances to scale to over 1000GB/s and 10’s of petabytes EXAScaler SFA12K-20E
GRIDScaler SFA12K-20E
20GB/s Up To 5.3PB* Usable capacity
20GB/s Up To 5.3PB* Usable capacity * - Initial release limited to 840 Drives
©2012 DataDirect Networks. All Rights Reserved.
ddn.com
Storage Fusion Processing™ Embedded computing brings the function to the data, as opposed shipping data to the application.
Fewer data/packet transitions and application co-location within the storage appliance, the Storage Fusion Architecture accelerates data-intensive applications. ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
SFA12K™ | Management
Simple Systems Management ▶ GUI ▶ CLI ▶ SNMP ▶ New API ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
SFA12K™ | Read Quality of Service
SFA OS v1.5 Features ‘Real-Time’ Read Access To A Low-Latency Data Store
DDN provides the markets only true QoS platforms for big bandwidth applications over time. SFA read-QoS provides immediate benefit for: • Latency-Sensitive Internet-Based Content Organizations (Cloud) • Massively Parallel File Reads (Avoids Amdahl’s Law) ©2012 DataDirect Networks. All Rights Reserved.
ddn.com
DataDirect Networks, Information in Motion, Silicon Storage Appliance, S2A, Storage Fusion Architecture, SFA, Storage Fusion Fabric, Web Object Scaler, WOS, EXAScaler, GRIDScaler, xSTREAMScaler, NAS Scaler, ReAct, ObjectAssure, In-Storage Processing and SATAssure are all trademarks of DataDirect Networks. Any unauthorized use is prohibited.
17
©2012 DataDirect Networks. All Rights Reserved.
ddn.com