Dec 11, 2015 - Research objective and questions ... NoC-based communication .... n5 ... In-buffer. Avoiding Data Loss. Paper 3. Sergey Ostroumov, Pontus ...
Sergey Ostroumov
Agent-Based Management Systems for Many-Core Platforms Rigorous Design and Efficient Implementation
Outline
• Motivation and background • Research objective and questions • Thesis contributions • Conclusions
12.11.2015
2
Many-Core Platforms
http://www.rle.mit.edu/isg/On-ChipIntegratedCMOSPhotonics.htm
12.11.2015
3
Applications
1 Samsung edge 6 2 HP Stream 7 Tablet 3 Advantech UNO-4673A, Certified with IEC 61850-3 and IEEE 1613 4 MikroTik Cloud Core Router CCR1016-12G, TileGx16-based 5 Toshiba Artida
12.11.2015
4
Networks-on-chip Network-on-Chip-based platform Tile
Tile
PU
Tile
PU RT
Tile
PU RT
Tile
PU
Tile
PU RT
Tile
PU RT
PU
PU RT
RT Tile
Tile
PU
RT
RT
RT
L. Benini, G. De Micheli, Networks on chips: a new SoC paradigm, Computer, IEEE, Vol. 35, Issue 1, pp. 70 – 78, 2002.
12.11.2015
5
Resilience of Many-Core Platforms to Faults
http://cccp.eecs.umich.edu/research/reliability.php
12.11.2015
6
Resilience of Many-Core Platforms to Faults
Shekhar Borkar, Designing reliable systems from unreliable components: the challenges of transistor variability and degradation, Micro, Volume 25, Issue 6, IEEE, pp. 10-16, 2005
12.11.2015
6
Agent-based Management Platform agent
Cluster agent
An agent is something that perceives and acts resiliently and autonomously
Tile Cell agent
processing unit
RT
router
Tile Cell agent
PU
NoC-based communication between routers
PU
RT
RT NoC-based communication between RC and RT through NI
Cluster agent communication between cluster agents and the platform agent Tile Cell agent
Tile Cell agent
PU
Cell agent
communication between cell agents and cluster agents
PU
RT
RT
Tile
application region
Tile Cell agent
PU RT
12.11.2015
PU
PU RT
8
Challenges • The bigger the number of resources in the platform is, the more agents are needed
• Hierarchical organization of the agents and their interdependencies
• Complexity of an agent-based system • Exhaustive testing is infeasible
12.11.2015
9
Formal Methods • Based on strictly defined (mathematical) languages and rules
• Provide reasoning about system correctness with respect to postulated properties
• Help to decrease the number of design faults by means of proofs
• Support abstractions, so that the details are added stepwise
12.11.2015
10
Event-B • System Level Modelling
• Stepwise refinement
• Correct-byConstruction Development
• Tool Support –
the Rodin platform
12.11.2015
11
Problem
Many-core platforms enable distributed parallel and efficient computation
Agent-based systems provide active management (monitoring and decision making)
Formal methods are essential for ensuring correctness of the system
Problem: To merge these areas
12.11.2015
12
Objective
To determine methodological aspects of the design and implementation of agent-based systems for many-core platforms considering efficiency (performance, area overhead etc) and resilience of the platform and agentbased management system simultaneously
12.11.2015
13
Research Questions RQ1: How to take into account hierarchical organization of agents? Moreover, how to rigorously design a hierarchical agentbased management system such that its behaviour can be trusted? RQ2: How to integrate dynamic reconfiguration of the platform into agents hierarchy, so that an acceptable level of performance is maintained? Additionally, how to show that the agents will behave resiliently under these circumstances? RQ3: How to avoid data loss when the many-core platform is reconfigured? RQ4: How to evaluate performance of the derived agent-based system? Specifically, how to translate a formal model into a synthesizable code? 12.11.2015
14
Formal Design of Agents Hierarchy Abstract model
Applications, underlying platform and basic functionality of the platform agent
First refinement
Abstract application mapping and reallocation at the platform level
Second refinement
Cluster agents with DVFS
Fourth refinement
Cell agents with local reconfiguration
Platform agent layer Cluster agents layer Cell agents layer
Paper 1. Sergey Ostroumov, Leonidas Tsiopoulos, Marina Waldén, Juha Plosila, Hierarchical agent-based monitoring systems for dynamic reconfiguration in NoC platforms: A formal approach, Advancing Embedded Systems and Real-Time Communications with Emerging Technologies, Ch. 13, IGI Global, pp. 302-333, 2014
12.11.2015
15
Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
– application region F – failed core S – spare core
n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.
12.11.2015
16
Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
– application region F – failed core S – spare core
n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.
12.11.2015
16
Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
– application region F – failed core S – spare core
n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.
12.11.2015
16
Dynamic Reconfiguration: Tasks Reallocation Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
– application region F – failed core S – spare core
n8 Paper 2. S. Ostroumov, L. Tsiopoulos, J. Plosila, K. Sere, Formal Approach to Agent-Based Dynamic Reconfiguration in Networks-On-Chip, Journal of Systems Architecture, 59(9), Elsevier, pp. 709-728, 2013.
12.11.2015
16
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
S
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2 n5 S
n4 n6
n1
n0
n7
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
n5
n6
n1
n0
n7
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Avoiding Data Loss Platform and Mapping Router Core
Link
n3
n2
n4
F
n6
n1
n0
n7
n5
In-buffer ... n1
n5 ...
n8 Paper 3. Sergey Ostroumov, Pontus Boström, Marina Waldén, Derivation of Parallel and Resilient Programs from Simulink Models, In Proceedings of International Conference on Parallel, Distributed and Network-based Processing, IEEE Computer Society Conference Publishing Services (CPS), pp. 416-420, 2015.
12.11.2015
20
Translation of Formal Models into Synthesizable Descriptions ... Refinement n Input variables
Implementable (deterministic) model
Internal variables
Input signals
Model parameters (constants) Output variables
Description parameters (generics)
VHDL description
Output signals
Internal signals
Paper 4. Sergey Ostroumov, Leonidas Tsiopoulos, Juha Plosila, Kaisa Sere, Generation of Structural VHDL Code with Library Components From Formal Event-B Models, In Euromicro Conference on Digital System Design, IEEE Conference Publishing Services (CPS), pp. 111-118, 2013.
12.11.2015
45
Design Flow for Structural Description Generation ... Refinement n Implementable model (deterministic) Refinement n+1 Implementable model with functions Model simulation
Automated VHDL code generation
Test cases
Model results Online testing
VHDL code with library components Code simulation Code results
Paper 5. Sergey Ostroumov, and Leonidas Tsiopoulos, VHDL Code Generation from Formal Event-B Models, In Euromicro Conference on Digital System Design: Architectures, Methods and Tools, IEEE Computer Society Conference Publishing Services (CPS), pp. 127-134, 2011.
12.11.2015
46
Overall Picture Performance, Energy, Resilience
Formal Methods
AgentBased Systems
Many-Core Platforms
12.11.2015
47
Conclusion Formal rigorous development of agent-based systems taking into account their hierarchical organization
Various dynamic reconfiguration procedures integrated into the agents functionality simultaneously considering efficiency (performance, overhead etc.) and resilience attributes
A mechanism that prevents data loss when the underlying platform is reconfigured at run-time
An approach to a model translation into a synthesizable
description in order to facilitate easier derivation of the implementation and evaluation of non-functional properties in a real-world environment
The approach proposed in the thesis has been validated and evaluated by several case studies from different application domains
12.11.2015
48
Sergey Ostroumov
Agent-Based Management Systems for Many-Core Platforms Rigorous Design and Efficient Implementation
Future Work o More advanced task reallocation procedure considering various
placements of available resources (also within other clusters)
o Monitoring procedures for rapid invocation of dynamic
reconfiguration
o Dynamic reconfiguration for the agents o SW/HW co-design at the formal level
12.11.2015
50