Stubby: A Transformation-based Optimizer for MapReduce Workflows

3 downloads 5358 Views 438KB Size Report
Aug 1, 2012 - Web clicks, social media, scientific experiments, and datacenter monitoring are among sources .... HiveQL. Workflow Scheduling and Execution Engine ... Stubby will find the best plan subject to the given annotations, while ...
Stubby: A Transformation-based Optimizer for MapReduce Workflows

arXiv:1208.0082v1 [cs.DB] 1 Aug 2012



Herodotos Herodotou Duke University

Duke University

[email protected]

[email protected]

[email protected]

ABSTRACT

J1

M1 R1

J1.K1={C} J1.K2={O} J1.K3={O}

D02 J2

M2 R2

J2.K2={O} J2.K3={O}

D2

D1 J3

M3 R3

J3.K1={O} J3.K2={O} J3.K3={O}

M4

J4.K1={O} J4.K2={O} J4.V2={S,Z,P}

D3 J4 D4 J5

M5 R5

D5

J7

M7 R7

J5.filter={50