Integrating Flash-based SSDs into the Storage Stack - Semantic Scholar
Recommend Documents
A MODEL OF THE STAGES OF INTEGRATION1. Paula M.C. ... being a competitive weapon, are increasingly a necessary way of doing business" (Benjamin,.
Feb 2, 1994 - sociology namely John Hughes, Dan Shapiro, Dave Randall and Val ... Anderson, R. J., Hughes, J. A. and Sharrock, W. W. (1989), Working for ...
classes are small in terms of the number of organisations of which they are composed, ... "An examination of the strategic use of information systems technology by business ... purchasing, order entry, shipping, inventory management and accounts ...
drastic changes in the software and hardware landscape. Yet, many undergraduate computer science (CS) curricula have not
many undergraduate computer science (CS) curricula have not embraced the .... [Programming] Performance metrics: Know th
Software agents represent a radical departure from earlier monolithic ... assistance to human team performance using simplified, controllable laboratory experiments. .... the Naval Air Warfare Center-Training Systems Division and the.
Oct 9, 2006 - of the renewable type, in many countries around the world are discussed. A synopsis of ... Keywords: Distributed generation; Distributed energy resources; Combined heat and power. 1. .... by all players in the electricity supply chain f
of alternative approaches for integrating technology into adult basic education. There are four basic approaches to integrating technology into adult learning and ...
Mar 13, 2010 - The Commonwealth Alliance for Information Technology. Education (CAITE) is one ... SageFox Associates, worked to integrate a detailed and.
Alexander R. O'Neill1,2, Hemant K. Badola2, Pitamber P. Dhyani3 and ... reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, ..... Archibald Campbell, the first British political officer to.
Communications Technologies (ICT) components in the vehicle and connect with ... A typical car today will have at least 20 microprocessors and a host of other ...
Internet-based education, Personalised learning, Adaptive content, eLearning .... wrapping and delivery (i.e. Content Packaging and content API interfacing to.
genetic determinants for the cytotoxicity of perifosine. Hum Genomics, in press. Zhang W, Ratain MJ, Dolan ME. 2008c. The HapMap resource is providing.
Mihai Codescu,1 Till Mossakowski,1 Adrián Riesco,2 and Christian Maeder1. 1 DFKI GmbH Bremen and University of Bremen, Germany. 2 Facultad de ...
In order for a virtual environment to be effective as a training tool, it is not enough to concentrate on the .... as Loftin and Kenney's Hubble Telescope maintenance trainer (Loftin & Kenney 1995),. Kotani and Maes's ...... Amsterdam: IOS,. 1989 ...
Feb 19, 2017 - Clinical Development: State-of-the-Art, Challenges, and Recommendations. Savita Nandal 1,* and Tal Burt 2,*. 1. Novartis Pharmaceuticals ...
Integrating cyberinfrastructure into existing e-Social Science research. Svenja Adolphs,1 Bennett Bertenthal,2 Steve Boker,3 Ronald Carter,1. Chris Greenhalgh ...
[KCH+90] Kyo C. Kang, Sholom G. Cohen, James A. Hess, William E. Novak, and A. Spencer. Peterson. Feature-Oriented Domain Analysis (FODA): A Feasibility ...
two of the classic puzzles in the analysis of the attitudes can be pro tably viewed ..... In order to reason about the information states of Philip and Martin we must ...
teraction to install an original dialogue between musicians and a virtual improviser ..... the melodic track (theme, solo improvisation, etc.), the accompaniment ...
Such creative development tasks â such as user interface design or design of ... or animation workbenches like Adobe Flash, or even 3D graphics tools like 3D.
Educational Technology & Society 5(1) 2002. ISSN 1436-4522. 67 ... âwhatâ component concerned perception of what constitutes a technology. The âhowâ ...
APIs that free the application developer from the intricacies ... What is the JavaPorts framework? .... A newly defined task gets automatically translated by Java-.
works, topologies, protocols and adaptive approaches ex nihilo, a shared toolkit of well-characterised components might be used to develop networks whose ...
Integrating Flash-based SSDs into the Storage Stack - Semantic Scholar
Apr 19, 2012 - Integrating Flash-based SSDs into the Storage. Stack. Raja Appuswamy, David C. van Moolenbroek,. Andrew S. Tanenbaum. Vrije Universiteit ...
Integrating Flash-based SSDs into the Storage Stack Raja Appuswamy, David C. van Moolenbroek, Andrew S. Tanenbaum Vrije Universiteit, Amsterdam
April 19, 2012
Introduction: Hardware Landscape
$/GB of flash SSDs is still much higher than HDDs Flash-only installations are prohibitively expensive
Hybrid Storage Architectures (HSA) - a viable alternative Use high-performance SSDs in concert with high-density HDDs
Caching HSAs Extends the two-level RAM/HDD memory hierarchy SSDs used as intermediate caches
Dynamic Storage Tiering HSAs Establishes tiers of devices based on performance SSDs used for primary data storage
Introduction: Complex Workloads
The duality of storage workloads Isolated: well-defined app-specific access patterns Virtualized: disjoint I/O requests blended into a single stream
Two fundamental questions need to be answered How do DST/Caching systems fare under such workloads? Is the “one-architecture-per-installation” approach correct?
We need a modular, flexible hybrid storage framework Perform side-by-side comparison of various architectures Understand the impact of design alternatives
The Loris Storage Stack - Layers and Interfaces
File-based interface between layers Each file has a unique file identifier Each file has a set of attributes
Several access statistics proposed by prior research Extent-level IOPS and bandwidth statistics (EDT-DST) Block-level access frequency (Azor-Caching)
Our current implementation is based on Inverse Bitmaps Compute a bitmap value for each read/write operation b = 26−blog2 (N)c Bitmap value added to a per-file, in-memory counter Counter value indicates “hotness” of each file Prioritizes small, random I/Os over large, sequential ones
(1)
Loris-based Hybrid Systems
All Loris-based hybrid systems are file based Migration/Caching at whole-file granularities File granularity is only a limitation of the current prototype
Inverse Bitmaps used for “hot” data identification Data collection techniques are architecture neutral
All hybrid systems share the SSD cleaner implementation Cleaning is triggered reactively Side effect of writes that encounter a lack of space Both foreground and background writes trigger cleaning
Loris-based Conventional Hybrid Systems
Popular DST/Caching variants Interval-driven Hot-DST Migrates “hot” files to SSD tier periodically Migration interval is an important design parameter
On-demand Write-through Caching Caches “hot” files in SSD tier as a side effect of read operation Updates both copies on writes
Loris-based Unconventional Hybrid Systems
On-demand Hot-DST Migrate “hot” files as a side-effect of read operation Absence of a data copy in contrast to Caching
Interval-driven Write-through Caching Periodically cache “hot” files in SSD Low-overhead SSD cleanup in contrast to DST
On-demand Cold-DST Initially allocate all files in SSD SSD cleaner evicts “cold” files to accommodate new files Migrate back once-cold, but now-hot files from HDD
Benchmarks and Workload Generators: Quirks
Used variety of benchmarks/workload generators File Server, Web Server, and Mail Server workload types Parameters: file size, dir depth, r/w ratio, etc.
Benchmarks lack locality by default Uniform/random access pattern Grossly underestimates effectiveness of DST/Caching Need to extract workload properties from file system traces
Beware of transaction-bound benchmarks PostMark unlike FileBench is transaction bound Interval-based systems might fail to reach equilibrium
Results: Caching vs Hot-DST
Caching excels in read-heavy workloads (WebServer) Interval-driven/on-demand Caching faster than DST Cheap SSD cleanup by cached copy invalidation
Hot-DST excels in write-heavy workloads (FileServer) Interval-driven/on-demand DST faster than Caching No expensive synchronization of cached copy
Is write-back caching worth the complexity? Complicates consistency/availability maintenance But offers Cache-like read and DST-like write performance
Results: Caching vs Hot-DST (2) On-demand migration/caching systems outperform their interval-driven counterparts Quick responsiveness in read-heavy workloads But why is this the case in write-heavy workloads?
“Append-Read”–Inverse Bitmap interaction An append operation first reads last file block A single block read results in high increment to access counter Actual write buffered in OS cache On-demand migration migrates/caches file to/in SSD SSD services the write operation at a later time
Need for semantic awareness In the long run, append reads fill SSD with write-only logs Being file aware, Loris can isolate append reads
Results: Cold-DST
Cold-DST outperforms rest under most workloads Buffering allocation writes in the SSD tier boosts performance Scan-resistant Inverse Bitmaps retains hot files in the SSD tier Scales better as it avoids unnecessary background migration Configuration free unlike Interval-driven systems
Workload patterns also favor Cold-DST architecture 90% of newly created files are opened less than 5 times Proactive cold migration can exploit SSD parallelism to improve performance
Results: Cold-DST (2)
Cold-DST systems share several advantages with write-back caching without their disadvantages No synchronization overhead for maintaining consistency Efficient space utilization Admitting allocation writes and sieving “cold” data writes
However, more research is required to address Accelerated SSD wear due to excessive writes Are recent SW/HW reliability techniques sufficient?
Performance deterioration caused by using all SSD capacity Can over provisioning solve this problem?
Performance deterioration caused by high-cost random writes Is random write performance still an issue with modern SSDs?
Conclusion
Hybrid storage systems are effective and efficient No one architecture fits all workloads - Not yet! Can Cold-DST be the last word in hybrid architectures? How does Cold-DST stack up wrt write-back caching?
Pairing workloads with ideal architectures Preliminary results under virtualized workloads are encouraging