Designs, Lessons and Advice from Building Large Distributed Systems

Recommend Documents

Software Engineering Advice from Building Large-Scale Distributed ...

Google Engineering Enviroment ... simpler from a software engineering standpoint .... ~1/2 will mispredict, so 2^32 mispredicts * 5 ns/mispredict = 21 secs.

Building large-scale robot systems: Distributed role ... - CiteSeerX

Building large-scale robot systems: .... The project aims at building a large-scale distributed ...... editor, Proc. of Distributed Autonomous Robotic Systems,.

Towards Building Large-Scale Distributed Systems ... - Semantic Scholar

contain misspelled words; hence, traditional NLP techniques [10] perform poorly .... Web data, in our lexicon builder to construct an opinion lexicon for Twitter ...

Towards Building Large-Scale Distributed Systems ... - Semantic Scholar

popular micro-blogging website. ... documents such as news articles, customer reviews, etc. ... Web data, in our lexicon builder to construct an opinion lexicon.

Adaptive Designs with Distributed Intelligent Systems - Cumincad

Keywords: Adaptable Designs, Arduino Microcontrollers, ESP8266, Smart. Buildings, Internet of .... form as part of a larger data set (cloud). Collected data can be ...

Adaptive Designs with Distributed Intelligent Systems - Cumincad

Keywords: Adaptable Designs, Arduino Microcontrollers, ESP8266, Smart. Buildings ... ized building systems. While building automation is an example of the.

Middleware for Large Distributed Systems and ... - CiteSeerX

accomplishing a specific task. .... decimal point, followed by at least one other number (Fig. 1). ... One algorithm collects bids for a certain time period and then ...

Building inclusive health innovation systems: lessons from ... - SciELO

SaÃºde PÃºblica, Rio de Janeiro, 32 Sup 2:e00045215, 2016 ..... the launch of an open source website hosting .... tional players, and the best approach is to en-.

Lessons Learned from War Room Designs and Implementations

automated text retrieval, data mining, decision modeling, data visualization, data ... Concepts of Employment (Approache

CloudScope - Large-Scale Distributed Systems Group, Imperial ...

Performance interference in multi-tenant data centers is well studied ... direct calls to the hypervisor's control domain (Dom0), instead ..... In our setup, we have.

Emergent Search in Large Distributed Systems

acorns3 according to their behaviors across the system. Those acorns ... be notified, otherwise the acorn will continue to be disseminated by other squirrels into.

In-Building Distributed Antenna Systems - American Tower

As a leading provider of wireless network infrastructure,. American Tower offers unique advantages, experience and suppo

Towards Distributed Citizen Participation: Lessons from WikiLeaks ...

Australia, and the global controversy surrounding Wikileaks and its ..... Amazon Web Services (AWS) or domain name servi

Towards Distributed Citizen Participation: Lessons from WikiLeaks ...

Australia, and the global controversy surrounding Wikileaks and its ..... Amazon Web Services (AWS) or domain name servi

Large Lessons from a Small Project

in studio art and design, and I spent the first part of my ... objects in response to the theme. We talked and ... on email, blogs, and social media: Have you ever ...

The Recommendation Architecture: Lessons from Large-Scale ...

To again give a human level illustration, suppose that the understanding ...... Competitive subsystem 3 takes its inputs from the Î³ layer of clustering set 5 and ...

Minimum Population Search â Lessons from building

a heuristic technique with two population members. Antonio BolufÃ©- ..... difference vector step away from each member and then an orthogonal step ... In (2), Fi is a scaling factor which determines the sign and size of the step from ..... 1.75e+1. 9

Lessons from Building a Persian Written Corpus

300 native speakers of Persian chosen from ten different dialectal areas in Iran .... Persian literature is replete with fiction texts in prose and poems, and they have a .... formed because the final letter of the token 'ï»ªïºïº§Ùïº®ï»' /foruxte/

Building Characters: Lessons Drawn from Virtual Environments

Building Characters: Lessons Drawn from Virtual Environments. Vinoba Vinayagamoorthy ([email protected]). Department of Computer Science ...

BUILDING REGIONAL CAPACITY: LESSONS FROM ... - CiteSeerX

support regional capacity building initiatives for management and leadership. ... 2005); little consensus on how best to develop it (Barker, 1997; Day, 2001); and ..... technical solutions such as websites and toolkits may be tempting in terms of ...

Lessons Learned from Building Model-Driven ...

elements identified in the previous phase are exam- ..... foundation of an industrial design toolkit for automotive ...... ASE 2002: 17th IEEE International Confer-.

Lessons from China: building technological ...

Jan 13, 2014 - If you wish to self-archive your article, please use the accepted manuscript version for posting on your own website. You may further deposit.

LESSONS FROM AN ADVANCED BUILDING SIMULATION COURSE ...

Aug 1, 2008 - Berkeley, California ... technology programs, offered by architecture schools in the US teach ... simulation that is conveyed in the other courses.

building national forest and land-use information systems: lessons ...

there are other budget requirements for sustaining a. FLUIS, such as ...... responsible for hosting, managing, and coordinating information feeding into the ...

Designs, Lessons and Advice from Building Large Distributed Systems

Download PDF

1 downloads 146 Views 2MB Size Report

Comment

Given a basic problem definition, how do you choose the "best" solution? â¢ Best could be .... monitoring systems perio

Designs, Lessons and Advice from Building Large Distributed Systems Jeff Dean Google Fellow [email protected]

Computing shifting to really small and really big devices UI-centric devices

Large consolidated computing farms

Google’s data center at The Dalles, OR

The Machinery

Servers • CPUs • DRAM • Disks

Clusters

Racks • 40-80 servers • Ethernet switch

Architectural view of the storage hierarchy P L1$

… L2$

P

P

L1$

L1$

…

Local DRAM

… L2$

P

One server

L1$

DRAM: 16GB, 100ns, 20GB/s

Disk

Disk:

2TB, 10ms, 200MB/s

Architectural view of the storage hierarchy P L1$

… L2$

P

P

L1$

L1$

…

…

P

One server

L1$

L2$

DRAM: 16GB, 100ns, 20GB/s

Disk

Disk:

2TB, 10ms, 200MB/s

Local DRAM

Rack Switch Local rack (80 servers) DRAM DRAM DRAM DRAM

Disk Disk Disk Disk

…

DRAM DRAM DRAM DRAM

Disk DRAM: 1TB, 300us, 100MB/s Disk Disk Disk Disk: 160TB, 11ms, 100MB/s

Architectural view of the storage hierarchy P L1$

… L2$

P

P

L1$

L1$

…

…

P

One server

L1$

L2$

DRAM: 16GB, 100ns, 20GB/s

Disk

Disk:

2TB, 10ms, 200MB/s

Local DRAM

Rack Switch Local rack (80 servers) DRAM DRAM DRAM DRAM

Disk Disk Disk Disk

…

DRAM DRAM DRAM DRAM

Disk DRAM: 1TB, 300us, 100MB/s Disk Disk Disk Disk: 160TB, 11ms, 100MB/s

Cluster Switch Cluster (30+ racks)

…

DRAM: 30TB, 500us, 10MB/s Disk:

4.80PB, 12ms, 10MB/s

Storage hierarchy: a different view

A bumpy ride that has been getting bumpier over time

Reliability & Availability • Things will crash. Deal with it! – Assume you could start with super reliable servers (MTBF of 30 years) – Build computing system with 10 thousand of those – Watch one fail per day

• Fault-tolerant software is inevitable • Typical yearly flakiness metrics – 1-5% of your disk drives will die – Servers will crash at least twice (2-4% failure rate)

The Joys of Real Hardware Typical first year for a new cluster: ~0.5 overheating (power down most machines in