Dynamic request allocation and scheduling for context ... - salsahpc

4 downloads 27487 Views 585KB Size Report
Clients. Data center hosting. K context-aware applications. Data center hosting. K context-aware applications. Data center hosting. K context-aware applications.
Dynamic request allocation and scheduling for context aware applications subject to a percentile response time SLA in a distributed cloud Keerthana Boloor∗ , Rada Chirkova? , Tiia Salo and Yannis Viniotis∗ ∗ Department

of Electrical and Computer Engineering of Computer Science North Carolina State University

? Department

 IBM Software Group Research Triangle Park

Cloudcom 2010, Indianapolis, Indiana, USA

1 / 17

Agenda

Agenda

Problem description Dynamic request allocation and scheduling scheme Comparison with static allocation and FIFO/Weighted Round Robin scheduling scheme Conclusion

Cloudcom 2010, Indianapolis, Indiana, USA

2 / 17

Problem description

Problem description

More web applications are designed to be context aware. Most context aware applications are built on SOA principles. Cloud computing systems - the most preferred platform for deployment. Service Level Agreements (SLA) - terms of service and pricing model. What is this presentation about? Cloudcom 2010, Indianapolis, Indiana, USA

3 / 17

Problem description

Geographically distributed cloud computing system

Geographically distributed cloud computing system

Data center hosting K context-aware applications

Data center hosting K context-aware applications

Clients

Data center hosting K context-aware applications

Data center hosting K context-aware applications

Cloudcom 2010, Indianapolis, Indiana, USA

4 / 17

Problem description

Context aware applications

SOA based context aware application

2. Client request allocated to and scheduled at end-server

3. Load required service-endpoint

Gateway

4. Load required contextdata

Contextaware SOA applications

Contextdata stores End servers

DATA CENTER

Internet 1. Client request with context-id

Updates to contexts at contextdata stores

Cloudcom 2010, Indianapolis, Indiana, USA

5 / 17

Problem description

Model of an end-server

An end-server serving multiple user classes

Class 1

Server ‘j’ at data center ‘i’

Class 2

Class K

Each context aware application services multiple classes of users Each user class is guaranteed different quality of service based on economic considerations SLA specifies different service levels and service charges for the different user classes Cloudcom 2010, Indianapolis, Indiana, USA

6 / 17

Problem description

Percentile Service Level Agreements

Percentile Service Level Agreements

Profit

P

0

X

100

Conformance(%)

X % - the fraction of requests of a particular user class which need to have a response time less than r seconds $P - The profit charged by the cloud, if the percentile of requests that have response time less than r seconds is greater than or equal to X % Cloudcom 2010, Indianapolis, Indiana, USA

7 / 17

Problem description

Problem statement

Problem statement Allocate and schedule service requests locally at the end-servers so as to globally:

max

X

profitj

(1)

1≤j≤K

where profitj is the profit charged for conformance of the requests from users of class j.

Cloudcom 2010, Indianapolis, Indiana, USA

8 / 17

Problem description

Problem statement

Problem statement Allocate and schedule service requests locally at the end-servers so as to globally:

max

X

profitj

(1)

1≤j≤K

where profitj is the profit charged for conformance of the requests from users of class j. This problem is NP-hard!!

Cloudcom 2010, Indianapolis, Indiana, USA

8 / 17

Solution

Management scheme description

Heuristic-based data-oriented request management scheme Periodic allocation and adaptation at each datacenter.

Allocation phase

Allocation phase

Adaptation phase

Allocation phase

Adaptation phase

Allocation phase

Adaptation phase

Allocation phase

subinterval Adaptation Adaptation phase phase

Observation interval (T)

00:00

06:00

Cloudcom 2010, Indianapolis, Indiana, USA

9 / 17

Solution

Management scheme description

Heuristic-based data-oriented request management scheme Periodic allocation and adaptation at each datacenter.

Allocation phase

Allocation phase

Adaptation phase

Allocation phase

Adaptation phase

Allocation phase

Adaptation phase

Allocation phase

subinterval Adaptation Adaptation phase phase

Observation interval (T)

00:00 Adaptation phase

06:00

Datacenters exchange conformance levels. Allocation phase Rank-based request allocation and gi-FIFO scheduling. Aim at increasing global profit. Cloudcom 2010, Indianapolis, Indiana, USA

9 / 17

Solution

Rank-based allocation and gi-FIFO scheduling

Rank-based allocation and gi-FIFO scheduling

Profit-score calculation

Required global conformance: ck Current global conformance: cck If cck < ck Profit-score = pk /(ck − cck ) Else Profit-score = 0

Profit−score assigned to each arriving request of class 1 ($)

Profit: pk Class 1 SLA − Profit of 2000$ on conformance of 75% 2000

1500

1000

500

0 0

10

20

30 40 50 60 70 Current conformance of class 1 (%)

80

90

100

Cloudcom 2010, Indianapolis, Indiana, USA

10 / 17

Solution

Rank-based allocation and gi-FIFO scheduling

Rank-based request allocation 1

Query hash-based lookup table ([context-id,machine-id] or [service-id,machine-id])

2

Rank-based compatibility test

3

1

The arriving request is assigned a rank based on its profit-score and deadline.

2

Does the arriving request meet its deadline? - Machine compatible!!!

Compatible machine not found? - Choose least loaded closest to context DB

Cloudcom 2010, Indianapolis, Indiana, USA

11 / 17

Solution

Rank-based allocation and gi-FIFO scheduling

gi-FIFO scheduling Choose the request of user class with the highest current profit-score

Choose one with maximum waiting time but which results in a response time less than or equal to r If no such request exists, choose the request with higher waiting time resulting in a response time greater than r

gi-FIFO has been proven to be the most suitable for percentile SLAs for a single server serving multiple classes.

Cloudcom 2010, Indianapolis, Indiana, USA

12 / 17

Evaluation

Evaluation Dynamic scheme vs static schemes 11000 10000 9000 Dynamic rank based allocation with gi−FIFO scheduling

8000 Profit incurred ($)

Static allocation with WRR scheduling 7000

Static allocation with FIFO scheduling

6000 5000 4000 3000 2000 1000 0

5

10

15

20

25 30 Request rate

35

40

45

50

Cloudcom 2010, Indianapolis, Indiana, USA

13 / 17

Evaluation

Dynamic rank based allocation vs static allocation scheme

11000 Static allocation with gi−FIFO scheduling

10000

Dynamic rank based allocation with gi−FIFO scheduling

9000

Profit incurred ($)

8000 7000 6000 5000 4000 3000 2000 1000 0

0

50

100

150

Request rate

Cloudcom 2010, Indianapolis, Indiana, USA

14 / 17

Evaluation

Variation in subinterval length

Variation in context update interval

18000

18000 16000

16000 Uniform distribution of classes, stringent SLA Uniform distribution of classes, relaxed SLA Non−uniform distribution of classes, stringent SLA Non−uniform distribution of classes, relaxed SLA

12000

14000

Profit obtained ($)

Profit obtained($)

14000

10000 8000 6000

12000 10000

Medium Contextdata load times

4000

2000

2000

0

50

100

150

200 250 300 Subinterval period

350

400

450

0

500

High contextdata load times

6000

4000

0

Low contextdata load times

8000

0

20

40

60

80 100 120 140 Contextdata update interval

160

180

200

Cloudcom 2010, Indianapolis, Indiana, USA

15 / 17

Conclusion

Conclusion

Identified the need for dynamic request scheduling and allocation for context aware applications in a distributed cloud.

Proposed a novel rank-based request allocation and gi-FIFO scheduling scheme for managing percentile SLAs with an aim to maximize profit obtained by the cloud.

Cloudcom 2010, Indianapolis, Indiana, USA

16 / 17

Questions??

Cloudcom 2010, Indianapolis, Indiana, USA

17 / 17

Suggest Documents