A Dynamic Approach for Characterizing Collusion in Desktop ... - graal

A Dynamic Approach for Characterizing Collusion in Desktop Grids Louis-Claude C ANON, Emmanuel J EANNOT and Jon W EISSMAN Project-Team Runtime Labri/INRIA/Université Henri Poincaré University of Minnesota ”Scheduling in Aussois” Workshop

June 2, 2010

C ANON – J EANNOT – W IESSMAN

Collusion in Desktop Grids

June 2, 2010

1 / 18

Outline

1


2

Characterization Mechanism

3

Empirical Validation

4

Conclusion



June 2, 2010

2 / 18


Outline

1


2


3


4

Conclusion



June 2, 2010

3 / 18


Desktop Grids Distributed Computing The server assigns job to each active worker. Each worker returns a result for the corresponding job. We want to have the correct result for each submitted job. Example: BOINC-based projects.

Network

Server Batch of tasks

Workers



June 2, 2010

4 / 18


Cheating

Collusion Some workers produces the same wrong result for a given job. Job1 Non-colluders Collusion

Job2

No collusion Colluders



June 2, 2010

5 / 18


Motivation

Causes of collusive behavior Malicious participants. Sybil attack (one entity for several identities). Library with platform-specific bugs.



June 2, 2010

6 / 18


Objectives

Estimate collusion probabilities Estimate the probability that any pair of workers gives the same wrong result for the same job. Detecting the groups of workers that behave similarly.



June 2, 2010

7 / 18


Duplication

Constraints Generic and non-intrusive mechanism: no information on the jobs. No trustworthy computational resources: each result is unknown.



June 2, 2010

8 / 18


Duplication

Constraints Generic and non-intrusive mechanism: no information on the jobs. No trustworthy computational resources: each result is unknown. Current BOINC solution Assign a job to k distinct workers (redundancy) and select the result that has the majority.



June 2, 2010

8 / 18


Duplication

Constraints Generic and non-intrusive mechanism: no information on the jobs. No trustworthy computational resources: each result is unknown. Current BOINC solution Assign a job to k distinct workers (redundancy) and select the result that has the majority. Extension proposal Build a characterization system that will be used at higher level. Based only on the generated results, what can we say?



June 2, 2010

8 / 18


Outline

1


2


3


4

Conclusion



June 2, 2010

9 / 18


Interaction Model

Two interaction representations Interaction between groups i and j: collusion groups i and j collude together or not (collusion estimation cij ) agreement groups i and j agree together or not (agreement estimation aij )



June 2, 2010

10 / 18


Interaction Model

Two interaction representations Interaction between groups i and j: collusion groups i and j collude together or not (collusion estimation cij ) agreement groups i and j agree together or not (agreement estimation aij ) Relations aij ≤ 1 + 2 × cij − cii − cjj cij ≤

1+aij −a1i −a1j 2


(the index of the largest groups is 1)


June 2, 2010

10 / 18


On-line Algorithm 1

1 1

Data structure Graph: each node corresponds to a set of worker; each edge, to some collusion characteristics (agreement for the example on the right).

v1

v2

v7

v3

0.4

0.4 v4 v5 v6

1



June 2, 2010

11 / 18


On-line Algorithm 1

1 1

Data structure Graph: each node corresponds to a set of worker; each edge, to some collusion characteristics (agreement for the example on the right).

v1

v2

v7

v3

0.4

0.4 v4 v5 v6

1

Algorithm Initially, each worker is in a singleton. We proceed by succession of merges and splits operations.



June 2, 2010

11 / 18


Agreement Criteria Merge Observed group i and j are merged if workers always returned the same result (i.e., aij ≈ 1) the number of observations is greater than |i ∪ j| (merge of small groups is easy while large groups need more observations)



June 2, 2010

12 / 18


Agreement Criteria Merge Observed group i and j are merged if workers always returned the same result (i.e., aij ≈ 1) the number of observations is greater than |i ∪ j| (merge of small groups is easy while large groups need more observations) Split If a worker disagree with any other from the same observed group: both conflicting workers are put in two new singletons



June 2, 2010

12 / 18


Agreement Criteria Merge Observed group i and j are merged if workers always returned the same result (i.e., aij ≈ 1) the number of observations is greater than |i ∪ j| (merge of small groups is easy while large groups need more observations) Split If a worker disagree with any other from the same observed group: both conflicting workers are put in two new singletons Agreement vs. collusion Agreement criteria are easier.



June 2, 2010

12 / 18


Outline

1


2


3


4

Conclusion



June 2, 2010

13 / 18


Trace-Based Inputs

Process Input: availibility traces (FTA), workload traces (GWA) and performance trace (FTA) Output: collection of events (< t, w, j, r > and < t, j >)



June 2, 2010

14 / 18


Trace-Based Inputs

Process Input: availibility traces (FTA), workload traces (GWA) and performance trace (FTA) Output: collection of events (< t, w, j, r > and < t, j >) Scheduling Redundancy-based scheduler: achieve a quorum of q with initial and maximal duplication l and lmax . Jobs scheduled on workers when available. Result computations are based on reliability and colluding probabilities. Mix real traces with scheduling and threat model for generating events.



June 2, 2010

14 / 18


Trace Parameters Parameters Parameter

Default value

Tested values

Worker availability trace Workload model or trace Quorum (k , q, l) Number of workers (n)

Seti@home charmm (4, 3, 10) 100

Reliability (fraction, probability)

(0.7, 0.7)

Collusion (fraction, probability)

(0.2, 0.5)

Overnet, Microsoft, . . . mfold, docking@home (2, 1, 2), (19, 15, 19) 30, 50, 70, 80, 200 {0.7, 0.99} × {0.7, 0.99} \ (0.7, 0.7) {0.02, 0.2, 0.49} × {0.01, 0.5, 1} \ (0.2, 0.5), Pair



June 2, 2010

15 / 18


Trace Parameters Parameters Parameter

Default value

Tested values

Worker availability trace Workload model or trace Quorum (k , q, l) Number of workers (n)

Seti@home charmm (4, 3, 10) 100

Reliability (fraction, probability)

(0.7, 0.7)

Collusion (fraction, probability)

(0.2, 0.5)

Overnet, Microsoft, . . . mfold, docking@home (2, 1, 2), (19, 15, 19) 30, 50, 70, 80, 200 {0.7, 0.99} × {0.7, 0.99} \ (0.7, 0.7) {0.02, 0.2, 0.49} × {0.01, 0.5, 1} \ (0.2, 0.5), Pair

Summary 28 scenarios with 20 seeds: 560 traces on which both heuristics are run. C ANON – J EANNOT – W IESSMAN


June 2, 2010

15 / 18

Specific Run

0.30

Error committed at each iteration ● ● ● ● ● ● ● ● ● ●● ●● ● ● ● ● ● ● ● ●● ● ●●● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ●●● ●● ● ● ●● ●● ●● ● ● ●

0.25

● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ●

0.20

● ●

● ●

0.15

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● ● ● ● ●● ●

0.10

Collusion RMSD

Agreement representation Collusion representation

0.0

0.5

1.0

1.5 Time (days)

2.0

2.5

3.0

Conclusion

Outline

1


2


3


4

Conclusion



June 2, 2010

17 / 18

Conclusion

Conclusion and future directions

Main contributions Propose a characterization system (with 2 representations) Validate with realistic inputs based on existing traces Perspective Use both interaction models with the same group structure (agreement for updating structure, collusion to get the values) Use certification mechanism for fixing systematic error (single result may be correct) Higher-level scheduling mechanisms (oriented towards detection or avoidance)



June 2, 2010

18 / 18

A Dynamic Approach for Characterizing Collusion in Desktop ... - graal

A Dynamic Approach for Characterizing Collusion in Desktop ... - graal

Suggest Documents

Towards MapReduce for Desktop Grid Computing - graal

Towards MapReduce for Desktop Grid Computing - graal

An Active Approach to Characterizing Dynamic ... - CiteSeerX

A Dynamic Approach to Characterizing Termination of General Logic ...

XtremLab: A System for Characterizing Internet Desktop Grids

tackling the collusion threat in p2p-enhanced internet desktop grids

A Novel Approach for Characterizing Microsatellite

A Texture Analysis Approach for Characterizing ... - CiteSeerX

Screening for collusion: A spatial statistics approach - Google Sites

A New Approach in Characterizing Secondary ...

Extending the EGEE Grid with XtremWeb-HEP Desktop Grids - graal

Characterizing Graphical Desktop Sharing ... - Semantic Scholar

Soil Dynamic Constitutive Model for Characterizing ...

Dynamic Scheduling for Heterogeneous Desktop Grids - CiteSeerX

Characterizing Result Errors in Internet Desktop Grids - CiteSeerX

Characterizing Result Errors in Internet Desktop ... - Semantic Scholar

a dynamic panel approach

An Approach for Characterizing and Comparing ... - BioMedSearch

Non-Cooperative Collusion in Static and Dynamic Oligopolies

Characterizing Result Errors in Internet Desktop Grids - CiteSeerX

Characterizing Dynamic Walking Patterns and

An Analytic Approach for Deploying Desktop Videoconferencing

Characterizing Actions in a Dynamic Common Pool Resource ... - MDPI

A Comprehensive Approach Characterizing Fusion Proteins ... - bioRxiv