MapReduce/Bigtable for Distributed OpZmizaZon

Recommend Documents

Distributed Breakout Algorithm for Solving Distributed Constraint ...

e-mail: [email protected]. Abstract ... CSPs) called the distributed breakout algorithm, which .... fx2; x6g, and the distance between x1 and x4 is 3.

Middleware for Distributed Systems - Distributed Object Computing ...

1 Middleware is Part of a Broad Set of Information Technology Trends ... Middleware represents the confluence of two key areas of information technology (IT): ...

Distributed Indexing: A Scalable Mechanism for Distributed ...

1It is possible that this protocol will be the Z39.50 Information. Retrieval Protocol ..... 3] Andrew Birrell, Roy Levin, Roger M. Needham, and Michael D. Schroeder.

Distributed Intelligence for Distributed Energy Resources: Selected ...

and the vendors' offers in wide area monitoring, control and protection. â Economical pressure ...... that the agent less/more easily makes concessions in its util-.

Distributed Artificial Intelligence for Distributed Corporate Knowledge

Thus, CoMMA does not target the Internet and the open Web but corporate mem- ... semantic annotations and with RDFS we specify the ontology O'CoMMA.

Distributed Optimization for Control

It was also extended to time-varying networks by. Benezit et al. (60). Tsianos et al. (61) were the first to employ the push-sum consensus model to develop dis-.

DC Converters for Distributed

An FC is potentially the most efficient modern approach to distributed power generation. The efficiency of the conversion,. i.e., the ratio of the electrical output to ...

Distributed Architecture for Hierarchically

The reader with a background in genetics/molecular biology may easily skip .... The set of genetic markers use on a worksheet is referred to as a ..... Mendelian.

DISTRIBUTED METACOGNITION Distributed ...

Leeds School of Business, University of Colorado Boulder. 2. University of Waterloo, Department of Psychology. WORD COUNT: 5165. All data, code, and ...

A distributed system architecture for a distributed ...

and data sources. The development of distributed applications in such environments presents many chal- lenges to ...... tions, including data recovery. ..... Toronto. D. 1. IBM SYSTEMS JOURNAL, VOL 33. NO 3, 1994. Parallel and Distributed ...

Distributed Proxy: A Design Pattern for Distributed ... - The Hillside Group

mental development process, encapsulates the underlying distribution ... A distributed agenda application has several users which manipulate agenda items ...

A distributed system architecture for a distributed ...

and data sources. The development of distributed applications in such .... ment components use the services of the database system. ...... recovery services.

Distributed Diffusion-Based Mesh Algorithm for Distributed Mesh ...

In recent years, wireless mesh networks have received an increased amount of interest from academia and in- dustry. Many communities and private companies ...

A simple distributed garbage collector for distributed real-time Java ...

the distributed garbage collector service defined in Java's Remote Method In- ..... remote node refers to the node hosting the remote object, which is called.

Distributed NLP, Java Technologies for Distributed Computing - DFKI

Feb 11, 2004 - composite process, for which the best way of selecting and applying single .... server are available for the same NLP task and the distributed application ..... dedicated task area and performs an apparent task immediately.

Clusters and Security: Distributed Security for Distributed Systems

National Center for Supercomputing Applications (NCSA). University of Illinois ... clusters varies from carrier-class applications with tight re- quirements on ...

Study of Distributed Conjugate Gradient Strategies for Distributed ...

Jan 16, 2016 - DC] 16 Jan 2016. 1. Study of Distributed Conjugate Gradient Strategies for. Distributed Estimation Over Sensor Networks. Songcen Xu* ...

A distributed system architecture for a distributed ... - Google Sites

Advances in communications technology, development of powerful desktop workstations, and increased user demands for soph

Distributed NLP, Java Technologies for Distributed Computing - DFKI

Feb 11, 2004 - [CNPB00] Claire Cardie, Vincent Ng, David Pierce, and Chris Buckley. Ex- amining the role of statistical and linguistic knowledge sources in.

Distributed Control Loop Patterns for Managing Distributed Applications

for managing distributed applications with multiple control loops. We introduce a high-level framework, called DCMS, for developing, deploying and managing ...

A distributed accelerated gradient algorithm for distributed model ...

Feb 8, 2013 - (email: [email protected]) ... (email: {pontusg,rantzer}@control.lth.se) .... using the algorithm from [8] to the HPV benchmark problem. The first ...

Distributed Blinding for Distributed ElGamal Re-encryption - CiteSeerX

Jun 1, 2004 - â Microsoft Research Silicon Valley, 1065 La Avenida, Mountain View, .... generate digital signatures provided at least f + 1 servers cooperate.

Distortion-Rate for Non-Distributed and Distributed Estimation Problems

May 7, 2005 - [11] P. Ishwar, R. Puri, K. Ramchadran, S.S. Pradhan, âOn Rate-Constrained Distributed Estimation in Unreliable Sensor Networks,â IEEE.

Facebook Distributed System Case Study For Distributed System ...

Keywords: facebook; distributed system; availabilty; scalability; Hadoop ; social cloud ;Hive ;HDFS ;. 1. ..... It added a zero downtime in case of individual data ...

MapReduce/Bigtable for Distributed OpZmizaZon

Download PDF

1 downloads 211 Views 2MB Size Report

Comment

âq. Can be done in parallel. Cheap to compute ... subset is size 1, then the algorithm is an online: ... For certain c

MapReduce/Bigtable for  Distributed Op7miza7on  Keith Hall, Sco@ Gilpin, Gideon Mann  presented by: Slav Petrov  Zurich, Thornton, New York  Google Inc. 

Outline  •  Large‐scale Gradient Op7miza7on  –  Distributed Gradient  –  Asynchronous Updates  –  Itera7ve Parameter Mixtures 

•  MapReduce and Bigtable  •  Experimental Results 

Gradient Op7miza7on SeRng  Goal: find  θ∗ = argmin f (θ) θ

f If    is differen7able, then solve via gradient  updates:  i+1 θ = θi + α∇f (θi ) f Consider case where     is composed of a sum of  fq differen7able func7ons     , then the gradient  update can be wri@en as:  This case is the  ! i+1 i i focus of the talk  θ =θ +α ∇fq (θ ) q

Maximum Entropy Models  X : input space  Y : output space  Φ : X × Y → Rn : feature mapping 

S = ((x1 , y1 ), ..., (xm , ym )) : training data  1 pθ [y|x] = exp(θ · Φ(x, y)) : probability model  Z ! 1 log pθ (yi |xi ) θ∗ = argmin m i θ The objec7ve func7on is a summa7on of func7ons,  each of which is computed on one data instance. 

Distributed Gradient  Observe that the gradient  update can be broken  down into three phases: 

θ

i+1

∇fq (θ ) Can be done in parallel  ! Cheap to compute 

=θ +α i

! q

∇fq (θi )

i

q

Update  Depends on number of  Step  features, not data size.  [Chu et al. ’07] 

At each itera7on, must  send complete    to each  θ parallel worker. 

Stochas7c Gradient  ! Alterna7vely approximate  i+1 i θ =θ +α ∇fq (θi ) the sum by a subset of  q func7ons                 . If the  ∇fq (θi ) subset is size 1, then the  algorithm is an online:  θθ+1 = θi + α∇fq (θi )

Stochas7c gradient approaches provably converge,  and in prac7ce oZen much quicker than exact  gradient calcula7on methods.   

Asynchronous Updates  Asynchronous updates are a distributed extension  of stochas7c gradient.   Each worker:    Get current  θi   Compute     ∇fq (θi )   Update global parameter vector  Since each worker will not be compu7ng in lock‐ step, some gradients will be based on old  parameters. Nonetheless, this also converges.  [Langford et al. ’09] 

Itera7ve Parameter Mixtures  Separately es7mate    on different samples, and  θ then combine. Itera7ve: take resul7ng     as star7ng  θ point for next round and rerun.   For certain classes of objec7ve  func7ons (e.g. maxent and  perceptron), convergence can  also be shown. 

[Mann et al. ’09, McDonald et al. ‘10] 

Li@le communica7on between  workers. 

Typical Datacenter (at Google) 

Commodity  machines, with  rela7vely few  Racks of machines connected by  cores (e.g.