Validation and evaluation of a software solution for fault tolerant ...

Validation and evaluation of a software solution for fault tolerant distributed synchronization P. Ballarini, S. Bernardi and S. Donatelli Dipartimento di Informatica, Università di Torino, Corso Svizzera 185, 10149 Torino, Italy E-mail:fballarin,bernardi,susi [email protected] Abstract This paper presents a case study on the combined use of different tools and techniques for the validation and evaluation, from the early stages of the design, of a fault tolerant software mechanism named distributed synchronization The mechanism has been specified using UML state charts and sequence diagrams. A number of Stochastic Well-formed Nets (SWN) models have been derived from the specifications: they have been composed using the tool algebra, and the resulting model has been model-checked using the PROD tool for temporal logic properties, thanks to a GreatSPN-to-PROD translator. The quantitative analysis has been performed using the SWN solvers of the GreatSPN tool.

1 Introduction Modelling and evaluation is traditionally an activity that takes place at the latest stages of the development of a system, sometimes even after the system is already in the operational phase, although in recent years we have seen a number of papers that try to produce performance models from (even partial) specifications of the system, most notably in the field that studies the generation of performance models starting from various UML specification diagrams (see for example [2, 3, 4]). This paper follows this line of research and reports on an effort conducted to support the design of a software synchronization mechanism using a suite of tools based on Petri nets. The paper addresses the use of Stochastic Well-formed Nets [12, 13] (SWN) and related tools to support the validation and evaluation from the early stages of the design. Peculiar aspects of the early-stage evaluation are: 1. frequent changes in the system being modelled (asking for high flexibility and reuse); 2. delays are usually unknown, or fixed only by an order of magnitude (so that typically sensitivity analysis is performed);

3. need for quick macro models (to quickly evaluate possible macro alternatives) 4. need for detailed models (to validate and evaluate the chosen proposed solution). The paper addresses points 1, 2 and 4, while an example of analysis for point 3 can be found in [8]: point 3 and 4 are strictly related since it would be interesting to be able to prove some sort of “consistency” between the macro models and the detailed ones. As far as we know there are not many attempts along this line. We have taken a stochastic approach to evaluation, and we use as our modelling formalism Stochastic Well-formed Nets [12, 13] (SWN) a colored extension of Generalized Stochastic Petri Nets[1] (GSPN). GSPN and SWN allow delays to be exponentially distributed, and many tools allow also non exponential distributions, especially if the solution method is simulation. Indeed in our work we are not much concerned with specific distributions (we are still in the early stages of the design), but we assume that the qualitative and quantitative models have the same behaviour (same reachability graph), and a trivial condition for this to be satisfied is that all distributions have infinite support. A number of tools allow the definition and solution of GSPN models and of colored GSPN models of some sort, like: APNNtoolbox [16], GreatSPN [11], SMART [14], TimeNET [20], UltraSAN [18]. In this paper we use GreatSPN since we want to take advantage of the efficient solution methods of SWN (both analytical and simulation [5]) and of a number of additional tools/programs that have been recently added to GreatSPN: Multisolve [6], algebra [7] and GreatSPN-to-PROD [9]. Multisolve is a Java interface of GreatSPN to plan and execute solution experiments (using any of the GreatSPN solvers for GSPN and SWN) and to produce gnuplot and postscript files of the results. Similar facilities are present in most performance evaluation tools, and they are of great help while performing the sensitivity analysis demanded by the point 2 above. algebra is a tool for composing SWN, and it allows the composition over immediate and timed transitions (syn-

chronizing transitions) based on non-injective transition labelling, and the composition over places (communicating places), again based on a non-injective place labelling. algebra also allows us to select whether the variables on input arcs from different models to a synchronized transition should be unified or not. It is an essential tool when frequent changes and reuse are needed, as for point 1 above. Validation of Stochastic Petri Nets is usually supported by the computation of a number of structural properties, like P- and T-semiflows, conflict sets, structural deadlocks and traps, while a much smaller number of properties are available, and implemented, for SWN. For example, to the best of our knowledge, the only available computation of SWN semiflows is the one in CPN- AMI, but for P-semiflows [15] only and with a number of restrictions on the class of accepted models. Moreover there is in most tools a limited possibility of examining the state space, for GSPN but especially for SWN: GreatSPN provides a textual description of the reachability graph (possibly symbolic for SWN) together with the list of deadlock states, and number of livelocks and of strongly connected components (for GSPN only). These facilities are absolutely inadequate when models become very large and colors come into play. PROD [19] is a tool for the definition and validation of place/transition nets and of the high level Petri net class known as Predicate/transition (Pr/T) nets. PROD allows the construction of the reachability graph of a net and its inspection by means of the interactive program probe. It allows the verification of the RG by displaying paths between RG nodes and by testing propositional logic formulae and temporal logic formulae expressed in Computational Tree Logic (CTL) [17]. An important characteristic of PROD is that all formulae are expressed in terms of net elements like marking of places and firing of transitions, therefore using a language that is similar (not too different may be) to the one used for the definition of the performance indices. GreatSPN-to-PROD [9] is a tool that allows the use of the validation facilities of PROD by translating GSPN and SWN nets into algebraic nets in PROD format. The translation also produces a number of ad-hoc macros that allow an easy check of the most common properties of nets, like the computation of the minimum path to a deadlock or the markings enabling a given set of transitions. The case study of the paper is a software mechanism called distributed synchronization (DS), that has been developed inside the EEC projects TIRAN [10], and its follow-up DepAuDE, as part of a software framework that allows the cheap addition of fault tolerance provision for systems built from standard COTS components. DS allows the synchronization of a number of user tasks over a number of synchronization points, called levels. Both the user tasks and the DS itself can be distributed over different nodes, to tolerate crashes of a node or of a single DS task. In this

paper we concentrate on the case of a single DS. The validation and evaluation of the DS plays an important role in the project since it is used by a number of other mechanisms of the framework. The emphasis of the paper is on the validation and evaluation process, and on the tools used in the process, since we want to report on the combined use of evaluation and validation from the early stages of the design, more than on the specific performance results. The paper is organized as follows: Section 2 introduces the distributed synchronization mechanism. Section 3 introduces the initial SWN models of the DS components and their composition into a single SWN model. Section 4 introduces a variation of the pattern of use of the DS by the users. The “stochastic” evaluation of the system seems satisfactory but a closer look shows that a deeper qualitative analysis is needed to validate the model using GreatSPNto-PROD and PROD. Section 5 shows the new definition of the DS as suggested by the problems detected in the previous section, and the corresponding SWN model, that is again validated and finally evaluated. Section 6 concludes the paper by taking a critical view point to this modelling activity.

2 The DS mechanism Distributed Synchronization (DS) is a software mechanism whose objective is to allow tasks (possibly distributed over a net) to synchronize on the execution of certain activities. Different levels of synchronization are supported, representing different, possibly parallel, independent activities for which synchronization is needed. We consider n-tasks (t1 : : :tn ) that require synchronization services on m-levels (l1 : : :lm ). Peculiar aspects of the DS behaviour are:

synchronizing tasks and DS are executing on a distributed architecture, made of a certain number of node; participants to synchronization depend on the level: on different levels, participants may be different; participating tasks may fail during their execution and this must not block the other tasks that are waiting for a synchronization. DS has been specified as a UML class with an attribute

DSTable and two methods, one for checking if a synchro-

nization is possible, according to the table, and one for changing the state of the table according to the stimulus received. The DSTable is used by the DS to keep track of the evolution of the activities (who’s finished, who’s participating, who’s still executing). The entry DSTable i j represents

[ ][ ]

3 The Petri net model of the DS

task failed task failed

Last reached LR

d he el ac ev re l o nt hr re nc ur sy c at

Not reached Not available NR NA task partecipates in synchronization task ready task task risen failed and ready task task ready failed Reached R

Restart RS

synchro reached at current level

Figure 1. Automaton of the DS table

the state of task ti with respect to level lj . The possible values are: Not Reached (task ti participates in a synchronization at level lj and has not yet completed), Reached (task ti completed on level lj ), Not Available (task ti fails and it should not be taken in consideration for synchronization), Restarted (task ti is alive again and ready to synchronize on level lj ), and Last Reached (level lj is the last one on which synchronization has been reached). The change of state of a single entry is described by the automaton of Figure 1, while the behaviour of the DS is depicted by the automaton of Figure 2. To better describe a system a number of sequence diagrams were also provided, to show specific paths of execution, together with snapshots of the DSTable during the path. As an example, let us consider what happens when a ready message from a user task t on level l is received by the DS: the state of the automaton of Figure 2 changes to checking tasks, and the entry ( t l ) of the DSTable changes from LR to R or from NR to R. If instead a task t fails the automaton moves to the disconnecting task state, and all entries of the table for task t are set to Not Available.

[ ][ ]

A full description of the DS is beyond the scope of this paper, but we hope that what has been described is enough for the reader to appreciate (or depreciate) the analysis efforts described in the remaining sections.

sending msg of reached synchro

wait for message synchro not reached

task failed

task ready

disconnecting task

task disconnected

synchro reached

checking tasks

Figure 2. Automaton of the DS states

The SWN component models that we consider as basic elements for the DS are: tasks the user tasks requiring synchronization; backbone the model that accounts for fault and repair (the particular name come from the TIRAN project) ds the distributed synchronization (translation of the automaton of Figure 2); check synch represents the method for checking if a synchronization has been reached. DStab the DS table (translation of the automaton of Figure 1); All SWN models share the definition of two basic color classes T and L, defined as: T = u Tk and Tk= ft1, . . . , tn g meaning that the class T of tasks is made of a single static subclass that contains n colors, representing the n tasks of the system (the SWN formalism requires the definition of basic color classes in terms of static subclasses, even if there is a single one, like in this case) and L = o L1,: : :,Lm with Li =fli g meaning that the class of levels is made of m static subclasses of one color each; the class is ordered, so that the circular successor function can be applied to colors in this class and the expression “next level” makes sense. We now describe each model in isolation: these models are later composed to form the SWN model of the whole system. The composition operators are superposition over place and transitions, based on place and transitions labels, as defined in [7]. Synchronizing transitions and communicating places are easily recognizable in the figures, since labels are graphically represented by a suffix “ jlabel name” to the place or transition tag. We also remind the reader that in SWN syntax S is a constant function that means “all colours of a (sub)class”, and that “!” indicates the successor function (for ordered classes only); in the SWN models presented in this paper we have used x and l to indicate a variable of colour T (tasks) and L (levels), respectively. Figure 3(a) models the user tasks: each task performs activities independently from the others and then requires a synchronization on a given level by sending a “READY” message to the DS through a mailbox (place with label ”TK DS R”): when the synchronization is reached, an ack message is sent back to the task through another mailbox (place with label “DS TK R”) that may proceed to the next activity towards the next synchronization level (indeed there is the successor function “!” on the arc from rcv-globalsynch to place tk1). A task may fail while working and later be restarted. Failure and restart events are modeled by timed transitions. This model interacts with the backbone through transition labels FAULT and RESTART, with the DS through place labels TK DS R and DS TK R. The initial marking is non null only in place tk1, where there

are two tokens, meaning that tasks t1 and t2 are working to reach level l1. (a)

The model is also tightly interfaced with the model of the DSTable, as explained later in this section.

(b) T,L tk1 Mtk0

working

T,L tk2

snd-ready tk3|TK_DS_R

tk5

T tk6

tk2|RESTART T,L

T,L

tk1|FAULT

T,L tk4|DS_TK_R

bk2 T snd-tk-failed

ds5

ds5|synchro

3

π

T bk3 bk3|BK_DS_F T

sendingL

ds7

ds1|BK_DS_F π

T

2

L reached-synch

snd-global

ds1|not-reached

failure

rcv-global-synch

ds8|ready

2

π

wait-msg

bk2|RESTART

T,L ds5|DS_TK_R

T,L ds3|LR

bk1 I T bk1|FAULT

L ds6

T,L ds2|TK_DS_R

ds4|reached

ds9|check-synch

Figure 3. SWN model of tasks (a) and backbone (b)

tk_discon T

Figure 3(b) models a super simplified backbone, since the only activities represented are the ones connected to the failure and restart of a task. In case of failure, the backbone sends to the DS mechanism a notification message through a mailbox (place labelled “BK DS F”). Please note the use of the # before the name of the variables as a directive to algebra that the variable should be unified upon composition. This model interacts with the user task model through transition labels FAULT and RESTART, with the ds model through place labels “BK DS F”. Since all tasks may fail, the net is initialized with as many tokens in place bk1 as there are tasks. Figure 4 models the distributed synchronization mechanism as specified by the automaton and by the sequence diagrams that, for brevity, we have not reported here. We consider the case of a single instance of the distributed synchronization task. It is easy to see that the structure of the SWN is basically the same as the automaton of Figure 2. The upper part of the net (transition ds5) represents the sending of an acknowledge message to all tasks that are waiting for it: the structure is more complicated that it should due to the fact that marking dependent arcs are not allowed in SWN. The interface to the check synch model is the synchronization transition label “check-synch” and the communicating place label “levels”. When the DS receives a READY message for level l from an user task, it puts in the communicating place the value l (through transition ds9 and function hli on the arc) and it activates the check-synch model; when instead the check is caused by the failure of a user task the DS model passes to check-synch the whole set of levels (function hS i on the arc) since it is not able to determine, a priori, on which level a synchronization will, eventually, take place.

L tocheck|levels

ds3|check-synch

check-tks

ds2|tkfailed π

2

Figure 4. SWN model of the DS Figure 5 represents the Check-Synch model that checks whether the synchronization conditions are satisfied. It is called by the DS when a READY message from a task is received or when a failed task is detected. Place ck2jlevels contains input values passed to the method: for each level contained in this place synchronization conditions are checked until a level is found such that the conditions are satisfied: in this case the level is passed as a return value to the DS mechanism when the transition ck6jreached is fired (function h li on the input arc to ck6). In case the conditions are not satisfied for any level contained in place ck2jlevels no value is passed to the DS mechanism and the transition ck8jnot-reached is fired. Given a level l, the synchronization conditions are:

#

:

[ ][l] 62 fLR, NRg, and ( there 9 at least a task t 2 T : DStable[t][l] = R or there 9 at least a task t 2 T : DStable[t][l] = RS)

a for all tasks t 2 T DStable t b1 b2

If these conditions are satisfied, either one of the two conflicting transitions ck3 (representing condition a and b1) or ck4 (representing condition a and b2) fires; otherwise transition ck5 (with lower priority) fires. The model needs to check the state of the DS table and indeed the four places of labels R, RS, LR, NR are going to be superposed with the places of equal label in the model of the DSTable of Figure 6. The SWN model of Figure 6 is obtained from the automaton of Figure 1. The DSTable is modified every time

idle

all_lev

clean

ck1|check-synch

NR|NR Motab T,L all_tks all_tks [d(other) d(l)] LR_to_NR|synchro π 3

L ck2|levels

ck1 ck2 L ck4|R ck3 ck5|RS ck6|LR T,L T,L T,L

π

ck3 2

ck4 π

all_lev

2

t5|ready

t4|tkfailed

all_lev all_lev

π

2

t6|ready

KO

π

2

RS|RS T,L

R|R T,L R_to_LR|synchro

π

2

RS_to_LR|synchro

all_tks

π

2

all_tks

ck7

emptying

3

ck8 end1 π

ck9 ck8|not-reached

2

Figure 5. SWN model of the algorithm for checking sychronization

an event on any task occurs, like when a message sent by a task or by the backbone is received by the DS, and when a synchronization takes place. Places all lev and all tks, on the right portion of the net, have been introduced since SWN do not allow marking dependent arcs. All other places are colored with the pair “T,L”, since there are as many entries in the table as the cross product of tasks and levels. Initially all entries of the table are set to the value “not reached”, and therefore the only non empty place in the net is place NR. The model is interfaced with the DS model through transition labels tkfailed, ready and synchro, and through place label LR, and with the check synch model though place labels R, RS, LR, and NR. The program algebra has been used to produce the complete model; the command: algebra net1 net2 par1 net3 composes the SWN models of name net1 and net2 over places or transitions, or both, depending on the value (p, t, or b) of par1, and names the resulting net net3. The model of the whole system has been produced using the following script: algebra algebra algebra algebra

all_lev

t3|tkfailed

other

π

π

2

OK L ck6|reached

2 NA NA_to_NR|synchro π 3 T,L [d(other) d(l)] p T

π 2 all_lev all_lev

π

ck5

t2|tkfailed

t7|ready

LR|LR T,L

T,L ck7|NR

all_lev t1|tkfailed π 2

tasks bk t tk-bk ds check-synch b ds-check tk-bk ds-check p tk-bk-ds2 DStab tk-bk-ds2 b final

t1|tkfailed t3|tkfailed t1|tkfailed t3|tkfailed t4|tkfailed t2|tkfailed t4|tkfailed I t2|tkfailed all_lev L

R_to_LR|synchro LR_to_NR|synchro R_to_LR|synchro

LR_to_NR|synchro

I all_tks T

Figure 6. SWN model of the DS table State space analysis The model has 31 tangible states and 731 vanishing for 2 tasks and 2 levels, and it can be very quickly solved with the SWN solvers of GreatSPN. We have also produced a version of the model in which tasks are not restarted upon a failure, that is to say, when a task fails it does not re-enter play. This was simply done by deleting the transitions with label RESTART in the task model: we call this modified model “open”. The run of GreatSPN now reveals 15 tangible state, 385 vanishing and a single deadlock that is easy to interpret as the state in which both tasks have failed. The number of vanishing would have been bigger if not for the extensive use of priorities in the model, to avoid the construction of unnecessary interleavings. To investigate further we have used GreatSPN-toPROD and PROD. The closed and open models have been translated into PROD using GreatSPN-to-PROD, and this has required a little help from the user (indeed, as explained in [9], PROD does not allow inhibitor arcs, so the translator builds a complementary place for each inhibiting place, but then user intervention is needed to define the initial marking of the complementary place and the associated arc functions). Using the macro of GreatSPN-to-PROD we are able to easily check a number of properties (see [9] for more details), and for the open model we have computed the shortest path from the initial marking to the deadlock state using the “PathTOdeadlock” macro. The resulting path is displayed by PROD as follows: 0#PathTOdeadlock Arrow 1: transition Arrow 0: transition Arrow 0: transition Arrow 1: transition

tk1|FAULT, x = 3 l = 0 snd|tkfailed, x = 3 failure, x = 3 t2|tkfailed, l = 1 x = 3

Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow Arrow

0: 0: 1: 0: 0: 0: 0: 0: 0: 0: 0: 0: 1: 0: 0: 1: 0: 0: 0: 0: 0: 0:

transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition transition

t2|tkfailed, l = 0 x ds3|check-synch, x = ck2, l = 1 ck5, l = 1 other, l = 0 ck2, l = 0 ck5, l = 0 ck7, ds1|not-reached, tk1|FAULT, x = 2 l = snd|tkfailed, x = 2 failure, x = 2 t2|tkfailed, l = 1 x t2|tkfailed, l = 0 x ds3|check-synch, x = ck2, l = 1 ck5, l = 1 other, l = 0 ck2, l = 0 ck5, l = 0 ck7, ds1|not-reached,

= 3 3

0

= 2 = 2 2

where each line says that a given transition has fired for a given instantiation of the variables. The translator has mapped the colour class L onto PROD colours 0 and 1, and T onto 2 and 3. The initial information on the arrow is not relevant in this context. The closed model, as expected, has no deadlock, but we would like to check whether there are states in which the system is not performing any “useful activity”, so we have checked if there are states in which the only enabled transition is tk2jRESTART. Again we have used a GreatSPNto-PROD macro called “AllMarkEnabOnly”, and we got: 0#AllMarkEnabOnly(tk2$RESTART(1)) Node 257, belongs to strongly connected component %%0 dual_tk4$DS_TK_R: +++ dual_LR$LR: +++ dual_NR$NR: +++ NA: +++ tk6: + bk3: + wait$msg: idle:

The only state satisfying this property is node 257, whose description in terms of marking over places is also shown by PROD. The construction of the shortest path to state 257 produces a path that is exactly equal to the one that leads to a deadlock in the open model, as expected.

4 Changing the user task model The user tasks considered in the previous section reach the synchronization levels in order, that is to say: a new ready message is sent only after a message of “reached synchronization” on the previous level has been received, and all tasks follow the same sequence of synchronization level. This task model is adequate, for example, when the DS is used to synchronize pieces of computations as in a sequence of fork and join statements, but it is not adequate in

other cases, for example, the case of the TIRAN mechanism called distributed memory, that manages a replicated and distributed set of variables. The distributed memory tasks, that are running on different nodes, use the DS to synchronize the distributed writes: there is one level per variable, and there is no reason why levels should be ordered, since it is indeed possible that a new request for a write on a variable arrives before the previous request (on another variable) is over. The only component of the SWN affected by this change is the task model of Fig. 3(a): the initial marking has been changed so that in place tk1 there is now one token per element in the Cartesian product of the color classes T and L, and the successor function has been deleted from the function on the arc from transition rcv-global-synch to place tk1. Also the colors have been changed, since there is no need now to keep the color class L ordered and split in static subclasses, so the new definition is L = u Lv and Lv= f l1, l2g The model has been produced by running again the same script of algebra commands as in the previous section, simply substituting the old model of the tasks with the new one. The analysis with GreatSPN reveals 97 tangible and 2595 vanishing states for the case of 2 tasks and 2 levels, the steady-state analytical solution produces “reasonable” results (all throughputs are different from zero and there is throughput conservation along paths), but the model contains an error, as we shall see in the following, caused by an incomplete specification of the DS behaviour. Following the procedure described in the previous section we have run the GreatSPN-to-PROD translator and used the interactive facility of PROD to perform the reachability analysis, on both the open and closed models. The closed model has the following characteristics: 1 single non trivial terminal strongly connected component (livelock) of 2491 states and 301 strongly connected components of 1 state each. It is not surprising that the steady state results are “reasonable”, since what is happening is that the system evolves through a number of transient states until it reaches the non trivial strongly connected component, where the system stays forever. What is instead unclear is what causes such behaviour. Since the analysis of a livelock of 2491 states is quite time-consuming we have preferred to check the open model first. The open model presents 4 livelocks of 14 states each, and a single deadlock. The deadlock is the expected one: both tasks have failed and since there is no repair the model is stuck. The livelocks are instead more subtle to understand, but they are of limited size, so we have proceeded as follows: we have used PROD to construct the shortest path from the initial marking to one of the livelocks, and to check the possible execution paths inside the livelock. The first observation that we could draw is that the 4 livelocks present a very symmetric behaviour, that is to say they

represents the same behaviour but for different pairs (task, level). By taking a closer look at a single livelock we could observe that the problem found is related to the model component called check synch. Indeed a possible state of the DS table is that task t1 is in state NR for both levels l1 and l2 and that task t2 is in state R for both levels (implying that two ready messages have been sent by t2, but since t1 has not reached the synchronization point on either of the two levels, no global synchronization has taken place). If at this point task t1 fails, both its entries in the table become NA (not available) and task t2 should be able to pass the synchronization on both levels, but the check synch model is such that the check ends when the first synchronization is reached. Is this an error in the model or in the specification? And is our analysis of the causes of the problem right? That is to say: it is true that the DS mechanism does not synchronize tasks as soon as possible? We have decided to express this question as a property to be checked by PROD. Informally the property should state that if the DS table is in a state such that the synchronization over a level can take place, then it should. But what does it mean “it should”? We have formalized our intuition by saying that from any state in which the column for level l of the DS table allows the synchronization over l, then all paths out of that state eventually lead to a state in which the firing of the synchronization transition takes place, passing only through states that do not alter the column l of the table. If is the condition over the column of the table and is the condition of reached synchronization, then we need to check that, for all the states of the system:

() OR Until

NOT

that is to say, either the table is in a state in which the synchronization is not allowed, or the synchronization takes place in a future state, but passing through states in which keeps holding. Both and have to be expressed as marking condition, that results in , for the specific level l1, being equal to the following in PROD syntax: ((

(not (not (not (not

(NR (NR (LR (LR

>= >= >= >=

)) and )) and )) and ))

) and ((R >= ) or (R >= ) or (RS >= ) or (RS >= ) ))

The condition for is much simpler, because when there is a synchronization over level l1 a token of color l1 is put into place reached-synch; this translates into: ( reached-synch >= )

The test with PROD reveals that the property does not hold on all reachable states neither in the open nor in the closed model, and therefore the DS does not behave as expected.

5 Fixing the specification To fix the problem we first thought that it was enough to change the check-synch method (and the corresponding model), but this is not quite the case. Indeed the checksynch model of Figure 5 can be changed by deleting the transition called “emptying”, so that all levels are checked, but this does not work, as the composed model immediately reveals a deadlock since every time that a synchronization on a level has been reached (firing of transition of label “reached”), this causes a synchronization with the DS model, that synchronizes with the DStable model to change the entries corresponding to the synchronized level, but the DS model was not meant to iterate the interaction of reached synchronization with the check-synch model more than once, and this causes a deadlock. This problem is not a wrong interpretation by the modellers of the specifications, since there is clearly no iteration of the DS automaton over reached synchronizations, but instead a case of underspecification of the system. The solution adopted was to limit the check-synch method (and therefore model) to check the synchronization over a single specific level that is an input parameter for the method (a single token in place of the place of label “levels”), and to change the behaviour of the DS automaton of Figure 2 (model of Figure 4) so as to call the check-synch a single time in normal situation (a ready message from a user task is received), and as many times as there are levels in the case of a user task failure. The modified check-synch model is shown in Figure 7 and it is clearly simpler than before. The modified DS is shown in Figure 8: the model execution now cycles jLj times through the paths [check-synch, reached, snd-global, ds7, t14] or [check-synch, not-reached, t14], before coming back to the initial state represented by place wait-msg. The model of the whole system has again been produced with algebra and translated into PROD. The property NOT OR Until is now satisfied on all states, but we still have a livelock in the closed model (of the 2443 states of the model 2110 belong to a livelock). Clearly the model is not able to reproduce the initial state, but why? We took a closer look to the model and decided to check how many states of the model have the same values of the table entries as in the initial marking, by checking the following query on all nodes:

()

0# query node

NR == 1

that builds the set of markings with all table entries equal to NR (place NR contains all pairs of tasks and levels).

T,L ds5|DS_TK_R

idle rcv_R T,L

ck1|check-synch L ck2|levels

π

ck3

2

ck4 π

T,L ck7|NR

ds5|synchro π

ds5 π

3 ds7

ds8 π

ds-check-more L failure

tk_discon T t12

π

we get an empty set: none of the markings of the livelock has the table entries all set to NR. A posteriori this is not surprising, indeed after the first synchronization takes place there is always a level for each task that is equal to LR (to memorize the last level on which the task has synchronized). This behaviour has been approved by the DS designers, and therefore we have not modified the initial marking. On the final closed model we have performed our experiments, and we report here only an example of study of the size of the reachability graph and of a performance index. Table 1 reports the number of vanishing and tangible markings for various values of tasks and levels. The first two columns are the ordinary marking, while the last two columns are the symbolic markings, obtained using the symbolic reachability graph solver of GreatSPN. The size of the Markov chain to be solved is given by the number of tangible symbolic markings, and it can be seen that the saving can be quite significant. As an example of a performance index we have considered the mean waiting time of tasks in place tk5, where the tasks wait for a message of reached synchronization. The diagram of Figure 9 plots two curves of the mean waiting

ds4|reached

ds9|check-synch

P13

tocheck|levels L

ds3|check-synch

check-tks

ds2|tkfailed 2

Figure 8. The modified DS model

Figure 7. The model for checking synchronization

0#build %%0 & %4

ds6 L

ck9

The set built by PROD contains 15 markings and it was automatically named %4. If we now build the intersection among %4 and the livelock set (named %%0 by a previous PRODquery), with:

ds2|TK_DS_R T,L

ds8|ready

ck7

reached-synch L

sending snd-global L

ds1|not-reached

ds1|BK_DS_F T

ck8|not-reached

2

t14

ck5

KO

t13

wait-msg

2

2

OK L ck6|reached

ck1 ck2 L ck4|R ck3 ck5|RS ck6|LR T,L T,L T,L

time in front of a synchronization barrier for a number of tasks of 2 and 3, all with 2 levels, with the rate of the timed transition “working” on the x-axis: as expected the waiting times are larger for the three tasks case, and the faster the task activity the smaller is the waiting time. The two curves have been obtained analytically, while for number of tasks larger than 4 simulation is the only possibility. The diagram of Figure 10 plots instead a similar situation of tasks and levels, but under the hypothesis that no fault can happen. By comparing the two diagrams it appears that the

tasks 1 1 2 2 3 1 3 2 3 4 1 4 2

levels 1 2 1 2 1 3 2 3 3 1 4 2 4

RG TM 4 5 22 73 130 6 1133 307 15636 754 7 15841 1451

VM 38 98 339 2370 2390 202 45510 15580 898510 15317 388 686454 99390

SRG TM 4 4 13 24 34 4 127 43 567 75 4 527 74

VM 38 54 178 630 534 63 4217 1608 28354 1254 72 19849 3435

Table 1. Ordinary and Symbolic Reachability Graphs.

presence of faults decreases the waiting time, which may appear counter-intuitive, but it is perfectly right, since the DS mechanisms does not block, waiting for failed tasks, while trying to reach a global synchronization. Mean waiting time to synchronize: fault case 12

Tk3Lv2 Tk2Lv2

10

8

6

4

2

0 0.2

0.4

0.6

0.8

1

working rate of tk1 (1/ms)

Figure 9. Performance measure for the DS

Mean waiting time to synchronize: absence of faults 14

Tk3Lv2 Tk2Lv2

12

10

8

6

4

2

0 0.2

0.4

0.6

0.8

1

working rate of tk1 (1/ms)

Figure 10. Performance measure for the DS

6 Conclusion In this paper we have presented the validation and evaluation process for a software solution to distributed synchronization. We have a long list of items that deserve more attention in the future, and of things that we have learned from this case study. The first observation is that for the analysis from the early stages of the design to be effective it should be fast and not be perceived as a non useful add-on to the project (evaluation from the early stages encounters similar problems as

formal specification of systems), and that for a model to be correct it is essential that the specification comes with “properties” to be proved, since the fact that a model has an ergodic behaviour is obviously not enough to tell that its behaviour is correct. If (reasonably formal and complete) specifications are not available, then it may be better to limit the study to the macro behaviour of the system (point 3 of the introduction list). To be fast enough for the results to be useful, the availability of compositional facilities and of an experiment planner is a necessity. Symmetries in the model should be exploited also in the analysis: indeed the case of the four livelocks that are instances of the same problem that we have encountered in the case study could have been better tackled if a single parametrized livelock was reported. Large models can be very tricky, and although the obvious answer is that they can be solved using simulation, still there is a need to show that the model used for the simulation is adherent with reality. Model checkers are well known to validate systems with billions of states using very efficient solution methods, but our experience with PROD showed that no efficient and tricky solution methods could be used due to the presence of priorities over transitions. We have not yet investigated other model checkers, but we shall do it in the near future since, if on one side PROD has the big advantage of allowing properties to be expressed as net elements, on the other side it is not very user-friendly. In particular we found it quite difficult to specify properties, both conceptually (choosing the “good property” is not an easy tasks for performance engeneers) and syntactically. The model that we have built has basically no time associated to the synchronization mechanism. If timed transitions are introduced in the check activity or in sending and receiving of messages through the mailboxes then the number of states increases very significantly: a phenomenon that we still do not master completely. As a final comment let us draw your attention on the problem of modelling failure: in the model we have assumed that tasks can fail only when they are in the working state, but what happens if instead they can fail in any state, for example while waiting for a synchronization? The model can change significantly and the number of states can increase accordingly.

References [1] M. Ajmone Marsan, G. Balbo, G. Conte, S. Donatelli, and G. Franceschinis. Modelling with Generalized Stochastic Petri Nets. J. Wiley, 1995. [2] S. Balsamo and M. Simeoni. On transforming UML models into performance models. In ETAPS01 Satellite Event, Genova (ITALY) April,2001.

[3] J. Merseguer, J. Campos, and E. Mena. Performance evaluation for the design of agent-based systems: A Petri net approach. In Mauro Pezzè and Sol M. Shatz, editors, Proceedings of the Workshop on Software Engineering and Petri Nets, within the 21st International Conference on Application and Theory of Petri Nets, Aarhus, Denmark, June 2000.

[12] G. Chiola, C. Dutheillet, G. Franceschinis, and S. Haddad. On Well-Formed coloured nets and their symbolic reachability graph. In Proc. 11th Intern. Conference on Application and Theory of Petri Nets, Paris, France, June 1990. Reprinted in High-Level Petri Nets. Theory and Application, K. Jensen and G. Rozenberg (editors), Springer Verlag, 1991.

[4] A. Bondavalli, I. Maizik, and I. Mura. Automated Dependability Analysis of UML Designs. In Proc. ISORC’99 - 2nd IEEE International Symposium on Object-oriented Real-time distributed Computing, Saint Malo, France, 1999, IEEE Computer Society Press.

[13] G. Chiola, C. Dutheillet, G. Franceschinis, and S. Haddad. A Symbolic Reachability Graph for Coloured Petri Nets. Theoretical Computer Science B (Logic, semantics and theory of programming), 176(1&2):39–65, April 1997.

[5] C. Bertoncello, G. Bruno, G. Franceschinis, G. Lungo Vaschetti, and A. Pigozzi. SWN models of a contact center: a case study. In Proc. 9th International Workshop on Petri Nets and Performance Models, Aachen, Germany, Sept. 11-14 2001. IEEE Computer Society Press.

[14] G. Ciardo and A. S. Miner. SMART: Simulation and Markovian Analyzer for Reliability and Timing. In proc. IEEE International Computer Performance and Dependability Symposium (IPDS’96), Urbana-Champaign, IL, USA. Sept. 1996. IEEE Comp. Soc. Press.

[6] S. Bernardi, C. Bertoncello, S. Donatelli, G. Franceschinis, R. Gaeta, M. Gribaudo, and A. Horváth. GreatSPN in the new Millenium. In Tools of Aachen 2001, Int. Multiconference on Measurement, Modelling and Evaluation of Computer-Communication Systems, Dortmund, (Germany), Sept. 2001. Technical report no.760/2001 - Dortmund Universitaet [7] S. Bernardi, S. Donatelli, and A. Horvàth. Compositionality in the GreatSPN tool and its application to the modelling of industrial applications. Int. Journal of Software Tools for Technology Transfer (STTT), 3(4), August 2001. Springer and Verlag. [8] V. DeFlorio, S. Donatelli, and G. Dondossola. Flexible Development of Dependability Services: An Experience Derived from Energy Automation Systems. In proc. of the 9th annual IEEE Conference and Workshop on Engineering of Computer-Based Systems, Lund, Sweden, April 8-10(11), 2002. [9] S. Donatelli, and L. Ferro. Validation of GSPN and SWN models through the PROD tool. In proc. of the 12th International Conference on Modelling Tools and Techniques for Computer and Communication System Performance Evaluation, London, UK, April 2002, Springer Verlag LNCS 2324. [10] O. Botti, V. De Florio, G. Deconinck, R. Lauwreins, F. Cassinari, A. Bobbio, S. Donatelli, A. Lein, H. Kufner, E. Thurner, and E. Verhulst. The TIRAN approach to reusing software implemented fault-tolerance. In Proc. 8th Euromicro Workshop on Parallel and Distributed Processing (PDP2000) , Rhodos, Greece, Jan. 2000. IEEE Comp. Soc. Press. [11] G. Chiola, G. Franceschinis, R. Gaeta, and M. Ribaudo. GreatSPN 1.7: GRaphical Editor and Analyzer for Timed and Stochastic Petri Nets. Performance Evaluation, special issue on Performance Modelling Tools, (1), 24, 1996.

[15] J.M. Couvreur and J. Martinez. Linear invariants in commutative high-level nets. In proc. 10th Intern. Conference on Application and Theory of Petri Nets, Bonn, Germany, June 1989. [16] P. Kemper F. Bause, P. Buchholz. A toolbox for functional and quantitative analysis of DEDS. In proc. of the 10th International Conference on Modelling Techniques and Tools for Computer Performance Evaluation Palma de Mallorca, Spain, 1998; Springer and Verlag LNCS 1469. [17] K.L. McMillan. Symbolic Model Checking. Kluwer Academic Publishers, 1993. [18]

W. D. Obal II, M. A. Qureshi, D. D. Deavours, and W. H. Sanders. Overview of UltraSAN. In proc. of IEEE Int. Performance and Dependability Symposium, Urbana, IL, USA, Sept. 4-6, 1996. http://www.crhc.uiuc.edu/UltraSAN.

[19] K. Varpaaniemi, J. Halme, K. Hiekkanen, and T. Pyssysalo. PROD reference manual. Technical Report Series B, number 13, Helsinki University of Technology, August 1995. [20] Zimmermann, A.; Freiheit, J.; German, R. Hommel, G.: Petri Net Modelling and Performability Evaluation with TimeNET 3.0. In the proc. of the 11th Int. Conf. on Modelling Techniques and Tools for Computer Performance Evaluation (TOOLS’2000), Springer and Verlag LNCS 1786, pp. 188-202, 2000.

Validation and evaluation of a software solution for fault tolerant ...

Validation and evaluation of a software solution for fault tolerant ...

Suggest Documents

Validation and evaluation of a software solution for

Fault-Tolerant SOftware

Fault-Tolerant SOftware

A Fault-Tolerant Software Architecture for COTS-Based Software ...

Fault Tolerant Software Architectures - CiteSeerX

Evaluation of Software-Based Fault-Tolerant Techniques ... - ThinkMind

Reliability Allocation for Fault Tolerant Software

Fault Tolerant Supercomputing: A Software Approach - CiteSeerX

Evaluation of a Fault-tolerant Model for Tactic ... - Semantic Scholar

Evaluation of a Fault-tolerant Model for Tactic

Fault Tolerant Software Systems Using Software Configurations for ...

A Fault-Tolerant Software Architecture for Component ... - CiteSeerX

Software for a Fault-Tolerant Microcontroller Network - Cs.UCLA.Edu

Software Exploitation of a Fault-Tolerant Computer with a ... - CiteSeerX

a scaleable and fault-tolerant architecture for

dependability evaluation of fault tolerant ... - Semantic Scholar

Performance Evaluation of Fault-Tolerant Controllers

A benchmark for fault tolerant flight control evaluation - EUCASS book ...

A benchmark for fault tolerant flight control evaluation - CiteSeerX

A benchmark for fault tolerant flight control evaluation - EUCASS book ...

A Design Method for Fault Reconfiguration and Fault-Tolerant Control ...

Software Based Fault Tolerant Computing Using

Fault-Tolerant Deployment of Embedded Software for ... - Columbia CS

simulations with fault-tolerant controller software of a digital valve