Name of author

9 downloads 0 Views 788KB Size Report
Dec 11, 2015 - Research objective and questions ... NoC-based communication .... n5 ... In-buffer. Avoiding Data Loss. Paper 3. Sergey Ostroumov, Pontus ...
Sergey Ostroumov

Agent-Based Management Systems for Many-Core Platforms Rigorous Design and Efficient Implementation

Outline

• Motivation and background • Research objective and questions • Thesis contributions • Conclusions

12.11.2015

2

Many-Core Platforms

http://www.rle.mit.edu/isg/On-ChipIntegratedCMOSPhotonics.htm

12.11.2015

3

Applications

1 Samsung edge 6 2 HP Stream 7 Tablet 3 Advantech UNO-4673A, Certified with IEC 61850-3 and IEEE 1613 4 MikroTik Cloud Core Router CCR1016-12G, TileGx16-based 5 Toshiba Artida

12.11.2015

4

Networks-on-chip Network-on-Chip-based platform Tile

Tile

PU

Tile

PU RT

Tile

PU RT

Tile

PU

Tile

PU RT

Tile

PU RT

PU

PU RT

RT Tile

Tile

PU

RT

RT

RT

L. Benini, G. De Micheli, Networks on chips: a new SoC paradigm, Computer, IEEE, Vol. 35, Issue 1, pp. 70 – 78, 2002.

12.11.2015

5

Resilience of Many-Core Platforms to Faults

http://cccp.eecs.umich.edu/research/reliability.php

12.11.2015

6

Resilience of Many-Core Platforms to Faults

Shekhar Borkar, Designing reliable systems from unreliable components: the challenges of transistor variability and degradation, Micro, Volume 25, Issue 6, IEEE, pp. 10-16, 2005

12.11.2015

6

Agent-based Management Platform agent

Cluster agent

An agent is something that perceives and acts resiliently and autonomously

Tile Cell agent

processing unit

RT

router

Tile Cell agent

PU

NoC-based communication between routers

PU

RT

RT NoC-based communication between RC and RT through NI

Cluster agent communication between cluster agents and the platform agent Tile Cell agent

Tile Cell agent

PU

Cell agent

communication between cell agents and cluster agents

PU

RT

RT

Tile

application region

Tile Cell agent

PU RT

12.11.2015

PU

PU RT

8

Challenges • The bigger the number of resources in the platform is, the more agents are needed

• Hierarchical organization of the agents and their interdependencies

• Complexity of an agent-based system • Exhaustive testing is infeasible

12.11.2015

9

Formal Methods • Based on strictly defined (mathematical) languages and rules

• Provide reasoning about system correctness with respect to postulated properties

• Help to decrease the number of design faults by means of proofs

• Support abstractions, so that the details are added stepwise

12.11.2015

10

Event-B • System Level Modelling

• Stepwise refinement

• Correct-byConstruction Development

• Tool Support –

the Rodin platform

12.11.2015

11

Problem

 Many-core platforms enable distributed parallel and efficient computation

 Agent-based systems provide active management (monitoring and decision making)

 Formal methods are essential for ensuring correctness of the system

Problem: To merge these areas

12.11.2015

12

Objective

To determine methodological aspects of the design and implementation of agent-based systems for many-core platforms considering efficiency (performance, area overhead etc) and resilience of the platform and agentbased management system simultaneously

12.11.2015

13

Research Questions RQ1: How to take into account hierarchical organization of agents? Moreover, how to rigorously design a hierarchical agentbased management system such that its behaviour can be trusted? RQ2: How to integrate dynamic reconfiguration of the platform into agents hierarchy, so that an acceptable level of performance is maintained? Additionally, how to show that the agents will behave resiliently under these circumstances? RQ3: How to avoid data loss when the many-core platform is reconfigured? RQ4: How to evaluate performance of the derived agent-based system? Specifically, how to translate a formal model into a synthesizable code? 12.11.2015

14

Formal Design of Agents Hierarchy Abstract model

Applications, underlying platform and basic functionality of the platform agent

First refinement

Abstract application mapping and reallocation at the platform level

Second refinement

Cluster agents with DVFS

Fourth refinement

Cell agents with local reconfiguration

Platform agent layer Cluster agents layer Cell agents layer

Paper 1. Sergey Ostroumov, Leonidas Tsiopoulos, Marina Waldén, Juha Plosila, Hierarchical agent-based monitoring systems for dynamic reconfiguration in NoC platforms: A formal approach, Advancing Embedded Systems and Real-Time Communications with Emerging Technologies, Ch. 13, IGI Global, pp. 302-333, 2014

12.11.2015

15

Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

– application region F – failed core S – spare core

n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.

12.11.2015

16

Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

– application region F – failed core S – spare core

n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.

12.11.2015

16

Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

– application region F – failed core S – spare core

n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.

12.11.2015

16

Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

– application region F – failed core S – spare core

n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.

12.11.2015

16

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

S

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2 n5 S

n4 n6

n1

n0

n7

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

n5

n6

n1

n0

n7

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Avoiding Data Loss Platform and Mapping Router Core

Link

n3

n2

n4

F

n6

n1

n0

n7

n5

In-buffer ... n1

n5 ...

n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.

12.11.2015

20

Translation of Formal Models into Synthesizable Descriptions ... Refinement n Input variables

Implementable (deterministic) model

Internal variables

Input signals

Model parameters (constants) Output variables

Description parameters (generics)

VHDL description

Output signals

Internal signals

Paper 4. Sergey Ostroumov, Leonidas Tsiopoulos, Juha Plosila, Kaisa Sere, Generation of Structural VHDL Code with Library Components From Formal Event-B Models, In Euromicro Conference on Digital System Design, IEEE Conference Publishing Services (CPS), pp. 111-118, 2013.

12.11.2015

45

Design Flow for Structural Description Generation ... Refinement n Implementable model (deterministic) Refinement n+1 Implementable model with functions Model simulation

Automated VHDL code generation

Test cases

Model results Online testing

VHDL code with library components Code simulation Code results

Paper 5. Sergey Ostroumov, and Leonidas Tsiopoulos, VHDL Code Generation from Formal Event-B Models, In Euromicro Conference on Digital System Design: Architectures, Methods and Tools, IEEE Computer Society Conference Publishing Services (CPS), pp. 127-134, 2011.

12.11.2015

46

Overall Picture Performance, Energy, Resilience

Formal Methods

AgentBased Systems

Many-Core Platforms

12.11.2015

47

Conclusion  Formal rigorous development of agent-based systems taking into account their hierarchical organization

 Various dynamic reconfiguration procedures integrated into the agents functionality simultaneously considering efficiency (performance, overhead etc.) and resilience attributes

 A mechanism that prevents data loss when the underlying platform is reconfigured at run-time

 An approach to a model translation into a synthesizable

description in order to facilitate easier derivation of the implementation and evaluation of non-functional properties in a real-world environment

 The approach proposed in the thesis has been validated and evaluated by several case studies from different application domains

12.11.2015

48

Sergey Ostroumov

Agent-Based Management Systems for Many-Core Platforms Rigorous Design and Efficient Implementation

Future Work o More advanced task reallocation procedure considering various

placements of available resources (also within other clusters)

o Monitoring procedures for rapid invocation of dynamic

reconfiguration

o Dynamic reconfiguration for the agents o SW/HW co-design at the formal level

12.11.2015

50