Jan 30, 2018 - nected stochastic control systems and interconnected abstractions in the probabilistic setting. ... design controllers for them, and then refine the controllers to the ones for the concrete complex ...... Principles of model checking.
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS
arXiv:1801.10505v1 [cs.SY] 30 Jan 2018
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
Abstract. This paper is concerned with a compositional approach for constructing infinite abstractions of interconnected discrete-time stochastic control systems. The proposed approach uses the interconnection matrix and joint dissipativity-type properties of subsystems and their abstractions described by a new notion of so-called stochastic storage functions. The interconnected abstraction framework is based on new notions of so-called stochastic simulation functions, constructed compositionally using stochastic storage functions of components. Using stochastic simulation functions one can quantify the distance between original interconnected stochastic control systems and interconnected abstractions in the probabilistic setting. Accordingly, one can leverage the proposed results to perform analysis and synthesis over abstract interconnected systems, and then carry the results over concrete ones. In the first part of the paper, we derive dissipativity-type compositional reasoning for the quantification of the distance in probability between the interconnection of stochastic control subsystems and that of their abstractions. Moreover, we consider specifications expressed as syntactically co-safe linear temporal logic formulae and show how a synthesized policy for the abstract system can be refined to a policy for the original system while providing guarantee on the probability of satisfaction. In the second part of the paper, we focus on a class of discrete-time nonlinear stochastic control systems with independent noises in the abstract and concrete subsystems, and propose a computational scheme to construct abstractions together with their corresponding stochastic storage functions. We demonstrate the effectiveness of the proposed results by constructing an abstraction (totally 3 dimensions) of the interconnection of three discrete-time nonlinear stochastic control subsystems (together 222 dimensions) in a compositional fashion such that the compositionality condition does not require any constraint on the number or gains of the subsystems. We also employ the abstraction as a substitute to synthesize a controller enforcing a syntactically co-safe linear temporal logic specification.
1. Introduction Large-scale interconnected systems have received significant attentions in the last few years due to their presence in real life systems including power networks and air traffic control. Each complex real-world system can be regarded as an interconnected system composed of several subsystems. Since these large-scale network of systems are inherently difficult to analyze and control, one can develop compositional schemes and employ the abstractions of the given networks as a replacement in the controller design process. In other words, in order to overcome the computational complexity in large-scale interconnected systems, one can abstract the original concrete system by a simpler one with potentially a lower dimension. Those abstractions allow us to design controllers for them, and then refine the controllers to the ones for the concrete complex systems, while provide us with the quantified errors in this controller synthesis detour. In the past few years, there have been several results on the construction of (in)finite abstractions for stochastic systems. Existing results for continuous-time systems include infinite approximation techniques for jumpdiffusion systems [JP09], finite bisimilar abstractions for incrementally stable stochastic switched systems [ZAG15] and randomly switched stochastic systems [ZA14], and finite bisimilar abstractions for incrementally stable stochastic control systems without discrete dynamics [ZMEM+ 14]. Recently, compositional construction of infinite abstractions is discussed in [ZRME17] using small-gain type conditions and of finite bisimilar abstractions in [MSSM17] based on a new notion of disturbance bisimilarity relation. For discrete-time stochastic models with continuous state spaces, finite abstractions are initially employed in [APLS08] for formal synthesis of this class of systems. The algorithms are improved in terms of scalability in 1
2
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
[SA13, Sou14] and implemented in the tool FAUST [SGA15]. Extension of the techniques to infinite horizon properties is proposed in [TA11] and formal abstraction-based policy synthesis is discussed in [TMKA13]. A new notion of approximate similarity relation is proposed in [HSA17] that takes into account both deviation in stochastic evolution and in outputs of the two systems. Recently, compositional construction of finite abstractions is discussed in [SAM15] and [LSZ18] using dynamic Bayesian networks and finite Markov decision processes, respectively, and of infinite abstractions in [LSMZ17] using small-gain type conditions all for discretetime stochastic control systems. In this paper, we provide a compositional approach for the construction of infinite abstractions of interconnected discrete-time stochastic control systems using the interconnection matrix and joint dissipativity-type properties of subsystems and their abstractions. Our abstraction framework is based on a new notion of stochastic simulation functions under which an abstraction, which is itself a discrete-time stochastic control system with potentially a lower dimension, performs as a substitute in the controller design process. The stochastic simulation function is used to quantify the error in probability in this detour controller synthesis scheme. As a consequence, one can leverage our proposed results to synthesize a policy that satisfies a temporal logic property over the abstract interconnected system and then refine this policy back for the concrete interconnected one. Our proposed approach differs from the one in [LSMZ17] in three directions. First and foremost, rather than using small-gain type reasoning, we employ the dissipativity-type compositional reasoning that may not require any constraint on the number or gains of the subsystems for some interconnection topologies (cf. case study). Second, we provide a scheme for the construction of infinite abstractions for a class of discrete-time nonlinear stochastic control systems whereas the construction scheme in [LSMZ17] only handles linear systems. As our third contribution, we consider a fragment of linear temporal logic (LTL) known as syntactically co-safe linear temporal logic (scLTL) [KV01] whereas the results in [LSMZ17] only deals with bounded invariant. In particular, given such a specification over the concrete system, we construct an epsilon-perturbed specification over the abstract system whose probability of satisfaction gives a lower bound for the probability of satisfaction in the concrete domain. It should be also noted that we do not put any restriction on the sources of uncertainties in the concrete and abstract systems. Thus our result is more general than [ZRME17], where the noises in the concrete and abstract systems are assumed to be the same, which means the abstraction has access to the noise of the concrete system. Finally, we show the effectiveness of dissipativity-type compositional reasoning for large-scale systems by first constructing an abstraction (totally 3 dimensions) of the interconnection of three discretetime nonlinear stochastic control subsystems (together 222 dimensions) in a compositional fashion. Then, we employ the abstraction as a substitute to synthesize a controller enforcing a syntactically co-safe linear temporal logic specification over the concrete network.
2. Discrete-Time Stochastic Control Systems 2.1. Preliminaries. We consider a probability space (Ω, FΩ , PΩ ), where Ω is the sample space, FΩ is a sigmaalgebra on Ω comprising subsets of Ω as events, and PΩ is a probability measure that assigns probabilities to events. We assume that random variables introduced in this article are measurable functions of the form X : (Ω, FΩ ) → (SX , FX ). Any random variable X induces a probability measure on its space (SX , FX ) as P rob{A} = PΩ {X −1 (A)} for any A ∈ FX . We often directly discuss the probability measure on (SX , FX ) without explicitly mentioning the underlying probability space and the function X itself. A topological space S is called a Borel space if it is homeomorphic to a Borel subset of a Polish space (i.e., a separable and completely metrizable space). Examples of a Borel space are the Euclidean spaces Rn , its Borel subsets endowed with a subspace topology, as well as hybrid spaces. Any Borel space S is assumed to be endowed with a Borel sigma-algebra, which is denoted by B(S). We say that a map f : S → Y is measurable whenever it is Borel measurable.
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 3
2.2. Notation. The following notation is used throughout the paper. We denote the set of nonnegative integers by N := {0, 1, 2, . . .} and the set of positive integers by N≥1 := {1, 2, 3, . . .}. The symbols R, R>0 , and R≥0 denote the set of real, positive and nonnegative real numbers, respectively. Given a vector x ∈ Rn , kxk denotes the Euclidean norm of x. Symbols In and 1n denote respectively the identity matrix in Rn×n and the column vector in Rn×1 with all elements equal to one. Given N vectors xiP ∈ Rni , ni ∈ N≥1 , and i ∈ {1, . . . , N }, we use x = [x1 ; . . . ; xN ] to denote the corresponding vector of dimension i ni . We denote by diag(a1 , . . . , aN ) a diagonal matrix in RN ×N with diagonal matrix entries a1 , . . . , aN starting from the upper left corner. Given QN QN QN functions fi : Xi → Yi , for any i ∈ {1, . . . , N }, their Cartesian product i=1 fi : i=1 Xi → i=1 Yi is defined QN as ( i=1 fi )(x1 , . . . , xN ) = [f1 (x1 ); . . . ; fN (xN )]. For any set A we denote by AN the Cartesian product of Q∞ a countable number of copies of A, i.e., AN = k=0 A. A function γ : R≥0 → R≥0 , is said to be a class K function if it is continuous, strictly increasing and γ(0) = 0. A class K function γ is said to be a class K∞ if γ(r) → ∞ as r → ∞. 2.3. Discrete-Time Stochastic Control Systems. We consider stochastic control systems in discrete time (dt-SCS) defined over a general state space and characterized by the tuple Σ = (X, U, W, ς, f, Y1 , Y2 , h1 , h2 ),
(2.1)
where X is a Borel space as the state space of the system. We denote by (X, B(X)) the measurable space with B(X) being the Borel sigma-algebra on the state space. Sets U and W are Borel spaces as the external and internal input spaces of the system. Notation ς denotes a sequence of independent and identically distributed (i.i.d.) random variables on a set Vς as ς := {ς(k) : Ω → Vς , k ∈ N}. The map f : X × U × W × Vς → X is a measurable function characterizing the state evolution of the system. Finally, sets Y1 and Y2 are Borel spaces as the external and internal output spaces of the system, respectively. Maps h1 : X → Y1 and h2 : X → Y2 are measurable functions that map a state x ∈ X to its external and internal outputs y1 = h1 (x) and y2 = h2 (x), respectively. For given initial state x(0) ∈ X and input sequences ν(·) : N → U and w(·) : N → W , evolution of the state of dt-SCS Σ can be written as x(k + 1) = f (x(k), ν(k), w(k), ς(k)) Σ : y1 (k) = h1 (x(k)) k ∈ N. (2.2) y2 (k) = h2 (x(k)), Remark 2.1. The above definition can be generalized by allowing the set of valid external inputs to depend on the current state and internal input of the system, i.e., to include {U (x, w)|x ∈ X, w ∈ W } in the definition of dt-SCS which is a family of non-empty measurable subsets of U with the property that K := {(x, ν, w) : x ∈ X, w ∈ W, ν ∈ U (x, w)} is measurable in X × U × W . For the succinct presentation of the results, we assume in this paper that the set of valid external inputs is the whole external input space: U (x, w) = U for all x ∈ X and w ∈ W , but the obtained results are generally applicable. Given the dt-SCS in (2.1), we are interested in Markov policies to control the system. Definition 2.2. A Markov policy for the dt-SCS Σ in (2.1) is a sequence ρ = (ρ0 , ρ1 , ρ2 , . . .) of universally measurable stochastic kernels ρn [BS96], each defined on the input space U given X × W and such that for all (xn , wn ) ∈ X × W , ρn (U |(xn , wn )) = 1. The class of all such Markov policies is denoted by ΠM . We associate respectively to U and W the sets U and W to be collections of sequences {ν(k) : Ω → U, k ∈ N} and {w(k) : Ω → W, k ∈ N}, in which ν(k) and w(k) are independent of ς(t) for any k, t ∈ N and t ≥ k. For 1 any initial state a ∈ X, ν(·) ∈ U, and w(·) ∈ W, the random sequences xaνw : Ω × N → X, yaνw : Ω × N → Y1
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
4
2 and yaνw : Ω × N → Y2 that satisfy (2.2) are called respectively the solution process and external and internal output trajectory of Σ under external input ν, internal input w and initial state a.
In the sequel we assume that the state space X of Σ is a subset of Rn . System Σ is called finite if X, U, W are finite sets and infinite otherwise. In this paper we are interested in studying interconnected discrete-time stochastic control systems without internal inputs and outputs that result from the interconnection of dt-SCS having both internal and external inputs and outputs. In this case, the interconnected dt-SCS without internal input and output is indicated by the simplified tuple (X, U, ς, f, Y, h) with f : X × U × Vς → X. 3. Stochastic Storage and Simulation Functions In this section, we first introduce a notion of so-called stochastic storage functions for the discrete-time stochastic control systems with both internal and external inputs which is adapted from the notion of storage functions from dissipativity theory [AMP16]. We then define a notion of stochastic simulation functions for systems with only external input. We use these definitions to quantify closeness of two dt-SCS. ˆ 1, h ˆ 2 ) with b = (X, ˆ U ˆ, W ˆ , ςˆ, fˆ, Y1 , Yˆ2 , h Definition 3.1. Consider dt-SCS Σ = (X, U, W, ς, f, Y1 , Y2 , h1 , h2 ) and Σ ˆ the same external output spaces. A function V : X × X → R≥0 is called a stochastic storage function (SStF) b to Σ if there exist α ∈ K∞ , κ ∈ K, ρext ∈ K∞ ∪ {0}, some matrices G, G, ˆ H of appropriate dimensions, from Σ ¯ ¯ ij , i, j ∈ {1, 2}, and some symmetric matrix X of appropriate dimension with conformal block partitions X ˆ one has such that for any x ∈ X and xˆ ∈ X ˆ x)k) ≤ V (x, xˆ), α(kh(x) − h(ˆ
(3.1)
ˆ ∀ˆ ˆ ∃ν ∈ U such that ∀w ˆ ∀w ∈ W one obtains and ∀x ∈ X ∀ˆ x∈X ν∈U ˆ∈W h i E V (x(k + 1), x ˆ(k + 1)) x(k) = x, xˆ(k) = xˆ, w(k) = w, w(k) ˆ = w, ˆ ν(k) = ν, νˆ(k) = νˆ − V (x, xˆ)
z T
ˆw Gw − G ˆ ≤ −κ(V (x, xˆ))+ ˆ h2 (x) − H h2 (ˆ x)
¯ X:=
¯ 11 X ¯ 21 X
}|
¯ 12 X ¯ 22 X
{
ˆw Gw − G ˆ +ρext(kˆ ν k)+ψ, ˆ 2 (ˆ h2 (x) − H h x)
(3.2)
for some ψ ∈ R≥0 . b S Σ if there exists a storage function V from Σ b to Σ, in which Σ b is considered as an We use notation Σ abstraction of concrete system Σ.
Remark 3.2. The second condition above implies implicitly existence of a function ν = ννˆ (x, xˆ, νˆ) for satisfaction of (3.2). This function is called the interface function and can be used to refine a synthesized policy νˆ b to a policy ν for Σ. for Σ For the interconnected dt-SCS, the above notion reduces to the following definition.
b = (X, ˆ U ˆ , ςˆ, fˆ, Y, ˆh) with the same output Definition 3.3. Consider two dt-SCS Σ = (X, U, ς,f,Y,h) and Σ ˆ → R≥0 is called a stochastic simulation function (SSF) from Σ b to Σ if spaces. A function V : X × X • ∃α ∈ K∞ such that
ˆ α(kh(x) − h(ˆ ˆ x)k) ≤ V (x, xˆ), ∀x ∈ X, ∀ˆ x ∈ X,
(3.3)
ˆ νˆ ∈ U ˆ , ∃ν ∈ U such that • ∀x ∈ X, x ˆ ∈ X, h i E V (x(k+1), x ˆ(k+1)) x(k) = x, xˆ(k) = x ˆ, ν(k)=ν, νˆ(k) = νˆ −V (x, xˆ) ≤−κ(V (x, xˆ))+ρext (kˆ ν k)+ψ,
(3.4)
for some κ ∈ K, ρext ∈ K∞ ∪ {0}, and ψ ∈ R≥0 .
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 5
The next theorem shows the usefulness of SSF in comparing output trajectories of two dt-SCS in a probabilistic sense. b = (X, ˆ U ˆ , ςˆ, fˆ, Y, ˆh) be two dt-SCS with the same output spaces. Theorem 3.4. Let Σ = (X, U, ς, f, Y, h) and Σ b to Σ, and there exists a constant 0 < κ Suppose V is an SSF from Σ b < 1 such that function κ ∈ K in (3.4) satisfies κ(r) ≥ κ br ∀r ∈ R≥0 . For any random variables a and a ˆ as the initial states of the two dt-SCS and b there exists an input any external input trajectory νˆ(·) ∈ Uˆ preserving Markov property for the closed-loop Σ, trajectory ν(·) ∈ U of Σ through the interface function associated with V such that the following inequality holds P
δ :=
sup kyaν (k) − yˆaˆνˆ (k)k ≥ ε | [a; ˆa]
≤δ
(3.5)
0≤k≤Td
( b T (a,ˆ a) ψ 1−(1− Vα(ε) )(1− α(ε) ) d b
b
if α(ε)≥ ψbκ , b
(a,ˆ a) ψ ( Vα(ε) )(1−b κ)Td+( κbα(ε) )(1−(1−b κ)Td ) if α(ε)< ψbκ ,
provided that there exists a constant ψb ≥ 0 satisfying ψb ≥ ρext (kˆ ν k∞ ) + ψ.
The proof of Theorem 3.4 is provided in the Appendix. The results shown in Theorem 3.4 provide closeness of output behaviours of two systems in finite-time horizon. We can extend the result to infinite-time horizon given that ψb = 0 as stated in the following corollary.
b be two dt-SCS without internal inputs and outputs and with the same output Corollary 3.5. Let Σ and Σ b to Σ such that ρext (·) ≡ 0 and ψ = 0. For any random variables a and a spaces. Suppose V is an SSF from Σ ˆ as the initial states of the two dt-SCS and any external input trajectory νˆ(·) ∈ Uˆ preserving Markov property b there exists ν(·) ∈ U of Σ through the interface function associated with V such that the for the closed-loop Σ, following inequality holds: V (a, a ˆ) . P sup kyaν (k) − yˆaˆ0 (k)k ≥ ε | [a; a ˆ] ≤ α (ε) 0≤k 0. Then y(·) φ with labeling function L if yˆ(·) φˆ over time interval [0, Td ] ˆ with labeling function Lǫ and modified specification φ. The proof of Lemma 4.4 is provided in the Appendix. Next theorem presents the core result of this section. b are two dt-SCS for which inequality (3.5) holds with the pair (ǫ, δ) and any Theorem 4.5. Suppose Σ and Σ time bound Td . Suppose a specification φ and a labeling function L is defined for Σ. The following inequality ˆ b and modified specification φ: holds for the labeling function Lǫ on Σ ˆ − δ ≤ P(y(·) φ), P(ˆ y (·) φ) where the satisfaction is over time interval [0, Td]. The proof of Theorem 4.5 is provided in the Appendix. In order to get an upper bound for P(y(·) φ), we need to define for any Borel measurable set A ⊂ Y , its (−ǫ)-perturbed version A−ǫ as A−ǫ := {y ∈ Y ∃¯ y ∈ A with k¯ y − yk < ǫ}. A new labeling map L−ǫ : Y → 2Σa is constructed using the (−ǫ)−perturbation of subsets of Y as [ L−ǫ (y) := a ∈ Σa | y ∈ [L−1 (a)]−ǫ .
(4.2)
b are two dt-SCS for which inequality (3.5) holds with the pair (ǫ, δ) and any Theorem 4.6. Suppose Σ and Σ time bound Td . Suppose a specification φ and a labeling function L is defined for Σ. The following inequality b holds for the labeling function L−ǫ defined in (4.2) on Σ: P(y(·) φ) ≤ P(ˆ y (·) φ) + δ,
where the satisfaction is over time interval [0, Td ] and the probability in the right-hand side is computed for having yˆ(·) φ for any choice of non-determinism introduced by the labeling map L−ǫ .
The proof is similar to that of Theorem 4.5, and is omitted here due to lack of space. 5. Compositional Abstractions for Interconnected Systems In this section, we analyze networks of control systems and show how to construct their abstractions together with the corresponding simulation functions by using stochastic storage functions for the subsystems. First, we first provide a formal definition of interconnection between discrete-time stochastic control subsystems. Definition 5.1. Consider N ∈ N≥1 stochastic control subsystems Σi = (Xi , Ui , Wi , ςi , fi , Y1i , Y2i , h1i , h2i ), i ∈ {1, . . . , N }, and a static matrix M of an appropriate dimension defining the coupling of these subsystems. The interconnection of Σi for any i ∈ {1, . . . , N }, is the interconnected stochastic control sysQN QN tem Σ = (X, U, ς, f, Y, h), denoted by I(Σ1 , . . . , ΣN ), such that X := i=1 Xi , U := i=1 Ui , function QN QN QN f := i=1 fi , Y := i=1 Y1i , and h = i=1 h1i , with the internal variables constrained by: [w1 ; . . . ; wN ] = M [h21 (x1 ); . . . ; h2N (xN )].
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 9
5.1. Compositional Abstractions of Interconnected Systems. This subsection contains one of the main contributions of the paper. Assume that we are given N stochastic control subsystems Σi = (Xi ,Ui ,Wi , ςi ,fi , Y1i , ˆ 2i ) with SStF b i = (X ˆi, U ˆi , W ˆ i , ςˆi , fˆi , Y1i , Yˆ2i , ˆh1i , h Y2i , h1i , h2i ) together with their corresponding abstractions Σ 11 ¯ 12 ¯ 21 22 b ˆ ¯ ¯ ¯ Vi from Σi to Σi . We use αi , κi , ρiext , Hi , Gi , Gi , Xi , Xi , Xi , Xi , and Xi to denote the corresponding functions, matrices, and their corresponding conformal block partitions appearing in Definition 3.1. In the next theorem, as one of the main results of the paper, we quantify the error between the interconnection of stochastic control subsystems and that of their abstractions in a compositional way. Theorem 5.2. Consider interconnected stochastic control system Σ = I(Σ1 , . . . , ΣN ) induced by N ∈ N≥1 b i are stochastic control subsystems Σi and the coupling matrix M . Suppose stochastic control subsystems Σ ˆ of abstractions of Σi with the corresponding SStF Vi . If there exist µi > 0, i ∈ {1, . . . , N }, and matrix M appropriate dimension such that the matrix (in)equality T GM GM ¯ 0, (5.1) Xcmp Iq˜ Iq˜ ˆM ˆ, GM H = G (5.2) PN are satisfied, where q˜ = i=1 q2i and q2i are dimensions of internal outputs of subsystem Σi , and
then
G :=diag(G1 , . . . , GN ), ¯ 11 µ1 X 1 ¯ cmp := X ¯ 21 µ1 X 1
ˆ := diag(G ˆ1, . . . , G ˆ N ), H := diag(H1 , . . . , HN ), G ¯ 12 µ1 X 1 .. .. . . 12 11 ¯ ¯ µN X N µN X N ¯ 22 , µ1 X 1 .. .. . . 22 ¯ ¯ 21 X µ µN X N N N
V (x, xˆ) :=
N X
µi Vi (xi , x ˆi )
(5.3)
(5.4)
(5.5)
i=1
b = I(Σ ˆ 1, . . . , Σ b N ), with the coupling is a stochastic simulation function from the interconnected control system Σ ˆ , to Σ. matrix M The proof of Theorem 5.2 is provided in the Appendix. Remark 5.3. Note that linear matrix inequality (LMI) (5.1) with G = I is exactly similar to the LMI appearing in [AMP16] for compositional stability condition based on dissipativity theory. As discussed in [AMP16], the LMI holds independently of the number of subsystems in many physical applications with specific interconnection structures including communication networks, flexible joint robots, power generators, and so on. We refer the interested readers to [AMP16] for more details on the satisfaction of this type of LMI. Remark 5.4. One can relax condition (5.2) and employ the linear least square approach instead of solving the equality exactly. In this case, an additional error resulting from the least square approach is added to ψ in (9.2) which is left for the future investigations. 6. Discrete-Time Nonlinear Stochastic Control Systems In this section, we focus on a specific class of discrete-time nonlinear stochastic control systems Σnl and quadratic stochastic storage functions V and provide an approach on the construction of their abstractions. In the next subsection, we first formally define the class of discrete-time nonlinear stochastic control systems.
10
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
6.1. A Class of Discrete-Time Nonlinear Stochastic Control Systems. The class of discrete-time nonlinear stochastic control systems, considered here, is given by x(k+1) = Ax(k)+Eϕ(F x(k))+Bν(k)+Dw(k)+Rς(k), (6.1) Σnl : y1 (k) = C1 x(k), y2 (k) = C2 x(k),
where the additive noise ς(k) is a sequence of independent random vectors with multivariate standard normal distributions, and ϕ : R → R satisfies ϕ(v) − ϕ(w) ≤ b, ∀v, w ∈ R, v 6= w, (6.2) a≤ v−w for some a ∈ R and b ∈ R>0 ∪ {∞}, a ≤ b. We use the tuple Σnl = (A, B, C1 , C2 , D, E, F, R, ϕ), to refer to the class of discrete-time nonlinear stochastic control systems of the form (6.1). Remark 6.1. If ϕ in (6.1) is linear including the zero function (i.e. ϕ ≡ 0) or E is a zero matrix, one can remove or push the term Eϕ(F x) to Ax and, hence, the tuple representing the class of discrete-time nonlinear stochastic control systems reduces to the linear one Σnl = (A, B, C1 , C2 , D, R). Therefore, every time we use the tuple Σnl = (A, B, C1 , C2 , D, E, F, R, ϕ), it implicitly implies that ϕ is nonlinear and E is nonzero. Remark 6.2. Similar to what is shown in [AK01], without loss of generality, we can assume a = 0 in (6.2) for the class of nonlinear control systems in (6.1). If a 6= 0, one can define a new function ϕ(r) ˜ := ϕ(r) − ar which satisfies (6.2) with a ˜ = 0 and ˜b = b − a, and rewrite (6.1) as ˜ ϕ(F ˜ x(k))+Bν(k)+Dw(k)+Rς(k), x(k+1) = Ax(k)+E Σnl : y1 (k) = C1 x(k), y2 (k) = C2 x(k), where A˜ = A + aEF .
Remark 6.3. For the sake of simple presentation, we restrict ourselves to systems with a single nonlinearity as in (6.1). However, it would be straightforward to obtain analogous results for systems with multiple nonlinearities as P¯ x(k+1) = Ax(k)+ M i=1 Ei ϕi (Fi x(k))+Bν(k)+Dw(k)+Rς(k), Σnl : y1 (k) = C1 x(k), y2 (k) = C2 x(k), ¯ }. where ϕi : R → R satisfies (6.2) for some ai ∈ R and bi ∈ R>0 ∪ {∞}, for any i ∈ {1, . . . , M In the next subsection, we provide conditions under which a candidate V is an SStF facilitating the construction b nl . of an abstraction Σ 6.2. Quadratic Stochastic Storage Functions. Here, we employ the following quadratic SStF ˜ (x − P x V (x, xˆ) = (x − P x ˆ)T M ˆ),
(6.3)
˜ ≻ 0 are some matrices of appropriate dimensions. In order to show that V in (6.3) is an SStF where P and M b nl to Σnl , we require the following key assumption on Σnl . from Σ Assumption 1. Let Σnl = (A, B, C1 , C2 ,D,E,F,R, ϕ). Assume that for some constant 0 < κ b < 1 and k˜ > 0 11 ¯ 12 ¯ 21 22 ˜ ¯ ¯ there exist matrices M ≻ 0, K, L1 , Z, G, X , X , X , and X of appropriate dimensions such that the matrix equality D = ZG, and inequality (6.5) hold.
(6.4)
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 11
˜ (A + BK) (A + BK)T M ∗ ∗ ∗ ¯ 22 C2 ˜ + C2T X κ bM 12 ¯ X C2 −F 0
¯ 21 C2T X 11 ¯ X 0 0
˜Z (A + BK)T M T ˜ Z MZ ∗ ∗ −F T 0 2 b
0
˜ (BL1 + E) (A + BK)T M T ˜ Z M (BL1 + E) ˜ (BL1 + E) (BL1 + E)T M ∗ 0 0 0 T ˜ ˜ ˜ ˆ ˜ ˆ k(B R − P B) M (B R − P B)
˜ (B R ˜ − P B) ˆ (A + BK)T M ˜ (B R ˜ − P B) ˆ ZT M T ˜ ˜ − P B) ˆ (BL1 + E) M (B R ˜ − P B) ˆ TM ˜ (B R ˜ − P B) ˆ (B R (6.5)
Now, we provide one of the main results of this section showing under which conditions V in (6.3) is an SStF b nl to Σnl . from Σ b nl = (A, ˆ B, ˆ Cˆ1 , Cˆ2 , D, ˆ E, ˆ Fˆ , R, ˆ ϕ) be two stochastic Theorem 6.4. Let Σnl = (A, B, C1 , C2 , D, E, F, R, ϕ) and Σ control subsystems with the same external output space dimension. Suppose Assumption 1 holds and there exist ˆ such that matrices P , Q, H, L2 , and G AP = P Aˆ − BQ (6.6a) ˆ C1 P = C1 (6.6b) ¯ 12 C2 P = X ¯ 12 H Cˆ2 X ¯ 22 C2 P = X ¯ 22 H Cˆ2 X F P = Fˆ ˆ − B(L1 − L2 ) E = PE ˆ = Z G, ˆ PD
(6.6c) (6.6d) (6.6e) (6.6f ) (6.6g)
b nl to Σnl . hold. Then, function V defined in (6.3) is an SStF from Σ
The proof of Theorem 6.4 is provided in the Appendix. Note that the functions α ∈ K∞ , κ ∈ K, ρext ∈ ˜) ¯ in Definition 3.1 associated with the SStF in (6.3) are α(s) = λmin (M s2, K∞ ∪ {0}, and the matrix X T λ max (C1 C1 ) p ¯ 11 ¯ 12 ˜ (B R ˜ − P B)k ˆ 2 s2, ∀s ∈ R≥0 , and X ¯ = X 21 X 22 . Moreover, positive κ(s) := (1 − κ b)s, ρext (s) := κ ˜k M ¯ ¯ X X T ˜ T T ˆ P M ˜ PR ˆ . constant ψ in (3.2) is ψ = Tr R M R + R
Remark 6.5. Note that for any linear system Σnl = (A, B, C1 , C2 , D, R), stabilizability of the pair (A, B) is sufficient to satisfy Assumption 6.4 in where matrices E, F , and L1 are identically zero. ˆ equal to zero results in Remark 6.6. One can readily verify from the result of Theorem 6.4 that choosing R smaller constant ψ and, hence, more closeness between subsystems and their abstractions. Remark that this is not the case when one assumes the noise of the concrete subsystem and its abstraction are the same as in [ZRME17, Zam14]. ˆ and, hence, it Remark 6.7. Note that the results in Theorem 6.4 do not impose any condition on matrix B ˆ b nl fully can be chosen arbitrarily. As an example, one can choose B = Inˆ which makes the abstract system Σ actuated and, hence, the synthesis problem over it much easier. ˜ one can choose R ˜ such that it Remark 6.8. Since Theorem 6.4 does not impose any condition on matrix R, ˜ minimizes function ρext for V as suggested in [GP09]. The following expression for R ˜ = (B T M B)−1 B T M P B, ˆ R minimizes ρext .
In the next section, we demonstrate the effectiveness of the proposed results for an interconnected system consisting of three nonlinear stochastic control subsystems in a compositional fashion.
12
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
7. Case Study Consider a discrete-time nonlinear stochastic control system Σnl satisfying ¯ x(k + 1) = Gx(k)+ϕ(x(k))+ν(k)+Rς(k), Σnl : y(k) = Cx(k),
¯ = (In − τ L) ∈ Rn×n where τ L is the Laplacian matrix of an undirected graph with 0 < τ < for some matrix G 1/∆, where ∆ is the maximum degree of the graph [GR01]. Moreover, R = diag(0.0071n1 , . . . , 0.0071nN ), PN ς(k) = [ς1 (k); . . . ; ςN (k)], ϕ(x) = [1n1 ϕ1 (F1 x1 (k)); . . . ; 1nN ϕN (FN xN (k))] where n = i=1 ni , ϕi (x) = T ni T ∈ R ∀i ∈ {1, . . . , N }, and C has the block diagonal structure as sin(x), and Fi = 1 0 · · · 0 C = diag(C11 , . . . , C1N ). We partition x as x = [x1 ; . . . ; xN ] and ν as ν = [ν1 ; . . . ; νN ], where xi , νi ∈ Rni . Now, by introducing Σnli=(Ini , Ini ,C1i ,Ini , Ini , 1ni , Fi , 0.0071ni , ϕi ) satisfying xi (k + 1) = xi (k)+1ni ϕi (Fi xi (k))+νi (k)+wi (k)+0.0071ni ςi (k), Σnli : y1i (k) = C1i xi (k), y2i (k) = xi (k),
one can readily verify that Σnl = I(Σnl1 , . . . , ΣnlN ) where the coupling matrix M is given by M = −τ L. b nli = (0.5, 1, Cˆ1i , 1, 1, 0.1, 1, ϕi) which Our goal is to aggregate each xi into a scalar-valued x ˆi , governed by Σ satisfies: ˆi (k+1) = 0.5ˆ xi (k)+ 0.1ϕi (ˆ xi (k))+ νˆi (k) + w ˆi (k), x b nli : yˆ1i (k) = Cˆ1i x Σ ˆi (k), yˆ2i (k) = x ˆi (k),
ˆ i , ∀i ∈ {1, . . . , N }, are considered zero in order to reduce constants ψi where Cˆ1i = C1i 1ni . Note that here R for each Vi . One can readily verify that, for any i ∈ {1, . . . , N }, conditions (6.4) and (6.5) are satisfied with ¯ 11 = Ini , ˜ = 1ni , X ˜ i = Ini , κ bi = 0.95, κ ˜ i = 1, Ki = (λi − 1)Ini , λi = 0.5, Zi = Gi = Ini , L1i = −1ni , R M 12 21 22 ¯ =X ¯ = λi Ini . Moreover, for any i ∈ {1, . . . , N }, Pi = 1ni satisfies conditions (6.6) with ¯ = 0ni , and X X ˆ i = 1ni . Hence, function Vi (xi , x ˆi ) ˆi ) = (xi − 1ni xˆi )T (xi − 1ni x Qi = −0.51ni , L2 i = −0.11ni , and Hi = G 1 2 b is an SStF from Σnli to Σnli satisfying condition (3.1) with αi (s) = λmax (C T C1i ) s and condition (3.2) with 1i κi (s) := 0.05s, ρiext (s) = 0, ∀s ∈ R≥0 , Gi = Ini , Hi = 1ni , and λi Ini ¯ i = Ini X , (7.1) 0ni λi Ini where the input νi is given via the interface function in (9.4) as ˆi ) − 0.51ni xˆi + 1ni νˆi − 1ni ϕi (Fi xi ) + 0.11ni ϕi (Fi 1ni xˆi ). νi =(λi − 1)(xi − 1ni x b nl = I(Σ b nl1 , . . . , Σ b nlN ) with a coupling matrix M ˆ satisfying condition (5.2) as follows: Now, we look at Σ ˆ. − τ L diag(1n1 , . . . , 1nN ) = diag(1n1 , . . . , 1nN )M
(7.2)
ˆ satisfying (7.2) for graph Laplacian τ L means that the N subgraphs form an Note that the existence of M equitable partition of the full graph [GR01]. Although this restricts the choice of a partition in general, for the complete graph any partition is equitable. ¯ i in (7.1), matrix X ¯ cmp in (5.4) reduces to Choosing µ1 = · · · = µN = 1 and using X ¯ cmp = In λIn , X λIn 0n where λ = λ1 = · · · = λN = 0.5, and condition (5.1) reduces to T −τ L −τ L ¯ = τ 2 LT L−λτ L − λτ LT = τ L(τ L − 2λIn ) 0, Xcmp In In
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 13
b nl Figure 2. The specification with closed loop output trajectories of Σnl (black one) and Σ ¯ (red one). The sets S, Oi , i ∈ {1, 2, 3}, and Ti , i ∈ {1, 2} are given by: S = [−14, 14]3, O1 = [−10, −6] × [6, 10] × [10, 10], O2 = [−5, 5]3 , and O3 = [6, 10] × [−10, −6] × [10, 10], T¯1 = [−10, −6] × [−10, −6] × [−10, −6] and T¯2 = [6, 10] × [6, 10] × [6, 10].
without requiring any restrictions on the number or gains of the subsystems. In order to show the above inequality, we used τ L = τ LT 0 which is always true for Laplacian matrices of undirected graphs. Now, one Pn b nl to Σnl satisfying conditions ˆi ) is an SSF from Σ can readily verify that V (x, xˆ) = i=1 (xi − 1ni xˆi )T (xi − 1ni x (3.3) and (3.4). For the sake of simulation, we assume L is the Laplacian matrix of a complete graph and τ = 0.001. We fix N = 3, n = 222, ni = 74, and C1i = [1 0 0 . . . 0], i ∈ {1, 2, 3}. By using inequality (3.5) and starting the b nl from initial states −131222 and −1313, respectively, we guarantee that interconnected systems Σnl and Σ b nl will not exceed ε = 1 during the time horizon Td = 10 with the distance between outputs of Σnl and Σ probability at least 90%, i.e. P (kyaν (k) − yˆaˆνˆ (k)k ≤ 1, ∀k ∈ [0, 10]) ≥ 0.9 .
b nl to enforce the specification, defined by the Let us now synthesize a controller for Σnl via the abstraction Σ following scLTL formula (cf. Definition 4.1): ̟=
Td ^
j=0
3 ^ (¬Oi ) ∧ ✸T¯1 ∧ ✸T¯2 ,
j S ∧
(7.3)
i=1
which requires that any output trajectory y of the closed loop system evolves inside the set S, avoids sets Oi , i ∈ {1, 2, 3}, indicated with blue boxes in Figure 2, over bounded time interval [0, Td ], and visits each T¯i , i ∈ {1, 2}, indicated with red boxed in Figure 2. We want to satisfy ̟ over bounded time interval [0, 10] and b nl to enforce (7.3). In the synthesis process take Td = 10. We use SCOTS [RZ16] to synthesize a controller for Σ we restricted the abstract inputs νˆ1 , νˆ2 , νˆ3 to [−4, 4]. We also set the initial states of Σnl to xi = Pi x ˆi , so that b Vi (xi , xˆi ) = 0. A realization of closed-loop output trajectories of Σnl and Σnl is illustrated in Figure 2. Also, b nl are illustrated in Figure 3. Note that several realizations of the norm of error between outputs of Σnl and Σ it would not have been possible to synthesize a controller using SCOTS for the original 222-dimensional system b nl . Σnl , without the 3-dimensional intermediate approximation Σ
14
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
0.05
0.04
0.03
0.02
0.01
0 0
2
4
6
8
10
b nl , Figure 3. A few realizations of the norm of the error between the outputs of Σnl and of Σ e.g. ky − yˆk, for Td = 10. 8. Discussion In this paper, we provided a compositional approach for infinite abstractions of interconnected discrete-time stochastic control systems, with independent noises in the abstract and concrete domains. To do so, we leveraged the interconnection matrix and joint dissipativity-type properties of subsystems and their abstractions. We introduced new notions of stochastic storage and simulation functions in order to quantify the distance in probability between original stochastic control subsystems and their abstractions and their interconnections, respectively. Using those notions, one can employ the proposed results here to synthesize policies enforcing certain temporal logic properties over abstract systems and then refine them to the ones for the concrete systems while quantifying the satisfaction errors in this detour process. We also addressed a fragment of LTL known as syntactically co-safe LTL, and showed how to quantify the probability of satisfaction for such specifications. Furthermore, we provided a computational scheme for a class of discrete-time nonlinear stochastic control systems to construct their abstractions together with their corresponding stochastic storage functions. Finally, we demonstrated the effectiveness of the results by constructing an abstraction (totally 3 dimensions) of the interconnection of three discrete-time nonlinear stochastic control subsystems (together 222 dimensions) in a compositional fashion. We also employed the abstraction as a substitute to synthesize a controller enforcing a syntactically co-safe LTL specification.
References [AK01] [AMP16] [APLS08] [BKL08] [BS96] [BYG17] [DLT08]
[GP09] [GR01]
Murat Arcak and Petar Kokotovic. Observer-based control of systems with slope-restricted nonlinearities. IEEE Transactions on Automatic Control, 46(7):1146–1150, 2001. M. Arcak, C. Meissen, and A. Packard. Networks of dissipative systems. SpringerBriefs in Electrical and Computer Engineering. Springer, 2016. A. Abate, M. Prandini, J. Lygeros, and S. Sastry. Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems. Automatica, 44(11):2724–2734, 2008. C. Baier, J. P.r Katoen, and K. G. Larsen. Principles of model checking. MIT press, 2008. D. P. Bertsekas and S. E. Shreve. Stochastic Optimal Control: The Discrete-Time Case. Athena Scientific, 1996. Calin Belta, Boyan Yordanov, and Ebru Aydin Gol. Formal Methods for Discrete-Time Dynamical Systems, volume 89 of Studies in Systems, Decision and Control. Springer International Publishing, 2017. J. Desharnais, F. Laviolette, and M. Tracol. Approximate analysis of probabilistic processes: logic, simulation and games. In Proceedings of the International Conference on Quantitative Evaluation of SysTems (QEST 08), pages 264–273, September 2008. A. Girard and G. J. Pappas. Hierarchical control system design using approximate simulation. Automatica, 45(2):566–571, 2009. C. Godsil and G. Royle. Algebraic graph theory. Graduate Texts in Mathematics. Springe, 2001.
COMPOSITIONAL CONSTRUCTION OF INFINITE ABSTRACTIONS FOR NETWORKS OF STOCHASTIC CONTROL SYSTEMS 15
[HSA17]
Sofie Haesaert, Sadegh Soudjani, and Alessandro Abate. Verification of general Markov decision processes by approximate similarity relations and policy refinement. In SIAM Journal on Control and Optimization, volume 55, pages 2333–2367, 2017. [JP09] A. A. Julius and G. J. Pappas. Approximations of stochastic hybrid systems. IEEE Transactions on Automatic Control, 54(6):1193–1203, 2009. [Kus67] H. J. Kushner. Stochastic Stability and Control. Mathematics in Science and Engineering. Elsevier Science, 1967. [KV01] O Kupferman and M Y Vardi. Model checking of safety properties. Formal Methods in System Design, 19(3):291–314, 2001. [LSMZ17] A. Lavaei, S. Soudjani, R. Majumdar, and M. Zamani. Compositional abstractions of interconnected discrete-time stochastic control systems. In Proceedings of the 56th IEEE Conference on Decision and Control, 3551–3556. arXiv: 1709.10312, 2017. [LSZ18] A. Lavaei, S. Soudjani, and M. Zamani. From dissipativity theory to compositional construction of finite Markov decision processes. In Proceedings of the 21st ACM International Conference on Hybrid Systems: Computation and Control, to appear. arXiv: 1712.07793l, 2018. [MSSM17] K. Mallik, S. Soudjani, A.-K. Schmuck, and R. Majumdar. Compositional Construction of Finite State Abstractions for Stochastic Control Systems. arXiv: 1709.09546, September 2017. [Oks13] B. Oksendal. Stochastic differential equations: an introduction with applications. Springer Science & Business Media, 2013. [RZ16] M. Rungger and M. Zamani. SCOTS: A tool for the synthesis of symbolic controllers. In Proceedings of the 19th ACM International Conference on Hybrid Systems: Computation and Control, pages 99–104, 2016. [SA13] S. Soudjani and A. Abate. Adaptive and sequential gridding procedures for the abstraction and verification of stochastic processes. SIAM Journal on Applied Dynamical Systems, 12(2):921–956, 2013. [SAM15] S. Soudjani, A. Abate, and R. Majumdar. Dynamic Bayesian networks as formal abstractions of structured stochastic processes. In Proceedings of the 26th International Conference on Concurrency Theory, pages 1–14, 2015. [SGA15] S. Soudjani, C. Gevaerts, and A. Abate. FAUST2 : Formal abstractions of uncountable-state stochastic processes. In TACAS’15, volume 9035 of Lecture Notes in Computer Science, pages 272–286. Springer, 2015. [Sou14] S. Soudjani. Formal Abstractions for Automated Verification and Synthesis of Stochastic Systems. PhD thesis, Technische Universiteit Delft, The Netherlands, 2014. [TA11] I. Tkachev and A. Abate. On infinite-horizon probabilistic properties and stochastic bisimulation functions. In Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC), pages 526–531, 2011. [TMKA13] I. Tkachev, A. Mereacre, J.-P. Katoen, and A. Abate. Quantitative automata-based controller synthesis for nonautonomous stochastic hybrid systems. In Proceedings of the 16th ACM International Conference on Hybrid Systems: Computation and Control, pages 293–302, 2013. [ZA14] M. Zamani and A. Abate. Approximately bisimilar symbolic models for randomly switched stochastic systems. Systems & Control Letters, 69:38–46, 2014. [ZAG15] M. Zamani, A. Abate, and A. Girard. Symbolic models for stochastic switched systems: A discretization and a discretization-free approach. Automatica, 55:183–196, 2015. [Zam14] M. Zamani. Compositional approximations of interconnected stochastic hybrid systems. In Proceedings of the 53rd IEEE Conference on Decision and Control (CDC), pages 3395–3400, 2014. [ZMEM+ 14] M. Zamani, P. Mohajerin Esfahani, R. Majumdar, A. Abate, and J. Lygeros. Symbolic control of stochastic systems via approximately bisimilar finite abstractions. IEEE Transactions on Automatic Control, 59(12):3135–3150, 2014. [ZRME17] M. Zamani, M. Rungger, and P. Mohajerin Esfahani. Approximations of stochastic hybrid systems: A compositional approach. IEEE Transactions on Automatic Control, 62(6):2838–2853, 2017.
9. Appendix b to Σ, we have Proof. (Theorem 3.4) Since V is an SSF from Σ P
ˆ] sup kyaν (k) − yˆaˆνˆ (k)k ≥ ε | [a; a
0≤k≤Td
≤P
=P
ˆ] sup α (kyaν (k) − yˆaˆνˆ (k)k) ≥ α(ε) | [a; a
0≤k≤Td
sup V (xaν (k), x ˆaˆ νˆ (k)) ≥ α(ε) | [a; a ˆ] .
(9.1)
0≤k≤Td
The equality holds due to α being a K∞ function. The inequality is also true due to condition (3.3) on the SSF V . The results follows by applying Theorem 3 in [Kus67, pp. 81] to (9.1) and utilizing inequality (3.4).
ABOLFAZL LAVAEI1 , SADEGH SOUDJANI2 , AND MAJID ZAMANI1
16
b to Σ with ρext (·) ≡ 0 and ψ = 0, for any x(k) ∈ X and Proof. (Corollary 3.5) Since V is an SSF from Σ ˆ ˆ x ˆ(k) ∈ X and any νˆ(k) ∈ U , there exists ν(k) ∈ U such that h i E V (x(k + 1), x ˆ(k + 1) | x(k), xˆ(k), ν(k), νˆ(k) − V ((x(k), xˆ(k)) ≤ −κ(V (x(k), xˆ(k)),
implying that V (xaν (k), x ˆaˆ νˆ (k)) is a nonnegative supermartingale [Oks13] for any initial condition a and a ˆ and inputs ν, νˆ. Following the same reasoning as in the proof of Theorem 3.4 we have P sup kyaν (k) − yˆaˆνˆ (k)k ≥ ε | [a; a ˆ] P sup α kyaν (k) − yˆaˆνˆ (k)k ≥ α(ε) | [a; a ˆ] 0≤k