Exploring the data locality of On-board disk cache

Recommend Documents

Locality-Aware Data Replication in the Last-Level Cache

locality-aware selective data replication protocol for the last-level cache (LLC). Our goal is to lower memory access latency and energy by replicating only high ...

Shape Analysis and Cache Locality for Recursive Data Structures in

May 13, 1999 - Shape analysis algorithms attempt to statically deter- mine the ... programs construct pointer-based data structures where âactivityâ (structure.

range searching data structures with cache locality

Mar 17, 2011 - [62] Paul ErdËos, LÃ¡szlÃ³ LovÃ¡sz, Gustavus J. Simmons, and Ernst G. .... [112] Cristina Schmidt, Manish Parashar, Wenjin Chen, and David J.

DiSK: A Distributed Shared Disk Cache for HPC Environments

This has led us to develop a Distributed Shared Disk Cache,. DiSK, to reduce the ... the same time lower the amount of migrations required from archives. ..... functions like a delivery service inserting files into the DHT when called upon.

The Locality-Aware Adaptive Cache Coherence Protocol - DSpace@MIT

Massachusetts Institute of Technology .... tions have low hit latencies, their off-chip miss rates are high .... its L1 cache miss is handled at the shared L2 cache.

The RAM Enhanced Disk Cache Project (REDCAP) - Personal - UPC

are provided in section 5. 2. Related Work. Since Maurice Wilkes proposed the cache concept [13], ..... [12] S. Tweedie. Journaling the Linux ext2fs filesystem.

Victim Disk First: An Asymmetric Cache to Boost the ... - Usenix

cache miss directed to the faulty disk, and thus to reduce the I/O requests to the ... In this section we briefly overview some background ma- terials and related ...

The RAM Enhanced Disk Cache Project (REDCAP) - Personal - UPC

This paper presents the RAM Enhanced Disk Cache. Project, REDCAP, a new cache of disk blocks which re- duces the read I/O time by using a small portion of ...

Locality-driven MRC Construction and Cache ... - ACM Digital Library

Jun 15, 2018 - cache space compared to the VMs' working set (WS) size, mak- ing the cache ... Abstracting with credit is permitted. To copy otherwise, or ...

LACS: A Locality-Aware Cost-Sensitive Cache Replacement Algorithm

Sep 1, 2012 - Digital Object Indentifier 10.1109/TC.2013.61 ... Cache memories, cache replacement algorithms, cost-sensitive cache replacement ..... In Section VI, we compare LACS against SHiP (Signature-based Hit Predictor) [9]: a state-.

Exploiting Temporal Locality in Drowsy Cache Policies - CiteSeerX

May 6, 2005 - power is dissipated due to transistor switching activity, while leakage power .... Experiments were run considering multi-way sets. In particular,.

DATA LOCALITY OPTIMIZATIONS TO IMPROVE THE ... - CiteSeerX

To minimize idle times of the CPU, caches are used to speed ... codes are not cache-aware and hence exploit less than 10 percent of the peak performance of.

Number Theory Meets Cache Locality â Efficient ... - Semantic Scholar

than having no cache at all, since each memory operation involves reading an entire ...... Notes on the FFT. http://www.fftw.org/burrus-notes.html, September.

Optimizing Inter-Nest Data Locality

restructure the current nest to reuse these cache-resident data ele- ments. In this work, we ... fragment shown below: for (i1 = 0i2 < n i1 + +) f ...U i1]... g for (i3 = 0i3 < n i3 + +) f ...U i3]... g ... ements of U. This means they have perfect i

Cache Locality-Centric Parallel String Matching on Many-Core ...

Sep 7, 2015 - times speedup on the Intel Xeon Phi compared with the previous approach. Our parallel ... (HPC) server market and in the Top 500 list, in particular. ...... efficient because the L2 cache is now dedicated to the. STT data.

A Buffer Cache Management for Locality with ...

ABSTRACT. This paper proposes buffer cache architecture. The proposed model consists of three units â main buffer cache unit, pre-fetch unit and LRU Evict ...

Number Theory Meets Cache Locality â Efficient ... - Semantic Scholar

2w. < 1 + p. When using Montgomery reduction, the full double word product, x Â· y has to be com- puted first. .... [3] J. A. Beachy and W. D. Blair. Abstract Algebra.

A Hierarchical Model of Data Locality - CiteSeerX

a general model can be used to derive the hierarchical data place- ment from program traces ... affinity. The first is the complexity of finding reference affinity. We.

Cache-Oblivious Data Structures

assumes no knowledge about the hierarchy, was introduced by Frigo et al. [20]. ... [20] described optimal Î(SortM,B(N)) memory transfer cache-oblivious algo-.

Data Cache Parameter Measurements

experimental results for two CPU/cache structures, a 200. MHz Pentium with MMX and a 180 MHz Pentium Pro. 1. Introduction. Computer performance is ...

The Data Locality of Work Stealing - Carnegie Mellon School of ...

running time of nested-parallel computations using work stealing. ...... There are then two differences between the loca

The Data Locality of Work Stealing - Carnegie Mellon School of ...

work stealing algorithm that improves the data locality of multi- threaded ...... reuse the thread data structures, typi

The Data Locality of Work Stealing - Carnegie Mellon School of ...

Department of Computer Sciences. University of Texas at Austin .... race-free computation that can be represented with a

Data Cache with Distributed Cache: A Design ... - Semantic Scholar

Jamia Hamdard (Hamdard University), New Delhi, India. Abstract. Caching techniques has helped developers to deliver applications that are capable of fast ...

Exploring the data locality of On-board disk cache

Download PDF

5 downloads 0 Views 615KB Size Report

Comment

Disk drives has long been a performance bottleneck with computer systems due to the mechanical characteristics. â« Disk cache is an effective way to improve.

ICITIS会议资料

Exploring the data locality of On-board disk cache Prof. Yuhui Deng （邓玉辉） Department of Computer Science Jinan University 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Agenda

Hard disk drives Motivation Roles of the on-board disk cache Locality Experimental evaluation Impact of prefetching Impact of write cache Discussion 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Hard Disk Driver

(1) Serial Advanced Technology Attachment (SATA) (2) Enhanced Integrated Drive Electronics (EIDE) 灵创国际交流中心(www.leaderstudio.net.)下载

(3) Small Computer System Interface (SCSI)

(4) Fiber Channel (FC),

ICITIS会议资料

Motivation

Disk drives has long been a performance bottleneck with computer systems due to the mechanical characteristics. Disk cache is an effective way to improve the performance by avoiding the mechanical latency. Almost all modern disk drives employ a small amount of on-board cache (SRAM) (usually less than 32MB) as a staging area to improve the performance of disk drives. 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Roles of the on-board disk cache

A working memory for disk firmware; A speed matching buffer between the disk media and the disk interface; A prefetch buffer; A read/write cache.

The prefetch and read/write cache take advantage of the spatial locality and temporal locality to improve performance, respectively. 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Locality

Spatial locality: It implies that if a block is referenced, then nearby blocks will also soon be accessed. Temporal locality: It implies that a referenced block will tend to be referenced again in the near future. 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Experimental evaluation

灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Impact of prefetching

灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Impact of write cache

灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Discussion Static write cache has negligible impacts. The write requests do not have too much interference with the read streams. The impact of write cache size has negligible performance variation.

灵创国际交流中心(www.leaderstudio.net.)下载

Why?

ICITIS会议资料

Modern computers normally employ a few GB host memory. Any data rewritten will be rewritten in the host memory rather than the small on-board disk cache. EMC 8830 storage array adopts 64GB memory, HP rp7400 server employs 32GB memory. Even a high-performance laptop normally uses a few GB memory. 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Conclusion

The I/O traffic at the disk level does not demonstrate strong temporal locality. We believe it is better to leave the disk cache shared by both the write and read streams.

灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

References

A. J. Smith. Disk cache-miss ratio analysis and design considerations. ACM Transactions on Computer Systems, 3(3):161–203, 1985. E. V. Carrera and R. Bianchini. Improving Disk Throughput in DataIntensive Servers. In Proceedings of the 10th International Symposium on High-Performance Computer Architecture. 2004. Chris Ruemmler and John Wilkes. An introduction to disk drive modeling. Computer.Vol.27, No.3, 1994, pp.17-28. Alma Riska, Erik Riedel. Idle Read After Write-IRAW. In proceedings of USENIX 2008 Annual Technical Conference.2008. pp.43-56. Y. Deng, F. Wang, and N. Helian, EED: energy efficient disk drive architecture, Information Sciences, 178(22) (2008) 4403-4417. Y. Deng, B. Pung. Conserving Disk Energy in Virtual Machine Based Environments by Amplifying Bursts. Computing. Springer Science. (DOI: 10.1007/s00607-010-0083-2). Y.Deng .What is the future of disk drives, death or rebirth? ACM Computing Surveys. (Accepted) 灵创国际交流中心(www.leaderstudio.net.)下载

ICITIS会议资料

Thanks and Questions!

灵创国际交流中心(www.leaderstudio.net.)下载