An Overview of Scalable FETI–DP Algorithms for Variational Inequalities

1 downloads 0 Views 131KB Size Report
scalable algorithms for the numerical solution of elliptic variational inequal- ities. ..... SIAM. Journal on Scientific Computing 13, 379–396 (1992). 15. Klawonn, A.
An Overview of Scalable FETI–DP Algorithms for Variational Inequalities Zdenˇek Dost´ al1 , David Hor´ ak1 and Dan Stefanica2 1

2

ˇ FEI VSB-Technical University Ostrava, CZ-70833 Ostrava, Czech Republic [email protected], [email protected] Baruch College, City University of New York, NY 10010, USA Dan [email protected]

Summary. We review our recent results concerning optimal algorithms for numerical solution of both coercive and semi-coercive variational inequalities by combining dual-primal FETI algorithms with recent results for bound and equality constrained quadratic programming problems. The convergence bounds that guarantee the scalability of the algorithms are presented. These results are confirmed by numerical experiments.

1 Introduction The Finite Element Tearing and Interconnecting (FETI) method was originally proposed by Farhat and Roux [14] as a parallel solver for problems described by elliptic partial differential equations. After introducing a so– called “natural coarse grid”, Farhat, Mandel and Roux [13] modified the basic FETI method to obtain a numerically scalable algorithm. A similar result was achieved by the Dual-Primal FETI method (FETI–DP) introduced by Farhat et al. [12]; see also [15]. In this paper, we use the FETI–DP method to develop scalable algorithms for the numerical solution of elliptic variational inequalities. The FETI–DP methodology is first applied to the variational inequality to obtain either a strictly convex quadratic programming problem with non-negativity constraints, or a convex quadratic programming problem with bound and equality constraints. This problems are then solved efficiently by recently proposed improvements [4, 11] of the active set based proportioning algorithm [3], possibly combined with a semimonotonic augmented Lagrangian algorithm [5, 6]. The rate of convergence of these algorithms can be bounded in terms of the spectral condition number of the quadratic problem, and therefore the scalability of the resulting algorithm can be established provided that suitable bounds on the condition number of the Hessian of the quadratic cost function exist. We present such estimates in terms of the decomposition parameter H and the discretization parameter h. These bounds are independent

2

Zdenˇek Dost´ al, David Hor´ ak and Dan Stefanica

of both the decomposition of the computational domain and the discretization, provided that we keep the ratio H/h fixed. We report numerical results that are in agreement with the theory and confirm the numerical scalability of our algorithm. Let us recall that an algorithm based on FETI–DP and on active set strategies with additional planning steps, FETI–C, was introduced by Farhat et al. [1]. The scalability of FETI–C was established experimentally.

2 Model problem To simplify our exposition, we restrict our attention to a simple model problem. The computational domain is Ω = Ω 1 ∪ Ω 2 , where Ω 1 = (0, 1) × (0, 1) and Ω 2 = (1, 2) × (0, 1), with boundaries Γ 1 and Γ 2 , respectively. We denote by Γui , Γfi , and Γci the fixed, free, and potential contact parts of Γ i , i = 1, 2. We assume that Γu1 has non-zero measure, i.e., Γu1 6= ∅. For a coercive model problem, Γu2 6= ∅, while for a semicoercive model problem, Γu2 = ∅; see Figure 1a. Let Γc = Γc1 ∪ Γc2 . The Sobolev space of the first order on Ω i is denoted by H 1 (Ω i ) and the space of Lebesgue square integrable functions is denoted by L2 (Ω i ). Let V = V 1 × V 2 , with  V i = v i ∈ H 1 (Ω i ) : v i = 0 on Γui , i = 1, 2. Let K ⊂ V be a closed convex subset of H = H 1 (Ω 1 ) × H 1 (Ω 2 ) defined by  K = (v 1 , v 2 ) ∈ V : v 2 − v 1 ≥ 0 on Γc .

We define the symmetric bilinear form a(·, ·) : H × H → R by a(u, v) =

2 Z X i=1

Ωi



∂ui ∂v i ∂ui ∂v i + ∂x1 ∂x1 ∂x2 ∂x2



dx.

Let f ∈ L2 (Ω) be a given function and f i ∈ L2 (Ω i ), i = 1, 2, be the restrictions of f to Ω i , i = 1, 2. We define the linear form l(·) : H → R by `(v) =

2 Z X i=1

f i v i dx Ωi

and consider the following problem: Find

min

1 a(u, u) − `(u) subject to u ∈ K. 2

(1)

The solution of the model problem may be interpreted as the displacement of two membranes under the traction f . The left membrane Ω 1 is fixed on the left edge as in Figure 1a and the left edge of Ω 2 is not allowed to penetrate below the right edge of Ω 1 . For the model problem to be well defined, we

An Overview of Scalable FETI–DP Algorithms for Variational Inequalities -3

f

h

-1



2

0.75

0.25 0.75

0.25



1

1

1 Γu

1

1 Γf

Γc

3

H Uc

2 Γf

Fig. 1a: Semi–coercive model problem

λ

E

λ

I

Fig. 1b: Decomposition: H = .5, H/h = 3

either require that the right edge of the right membrane Ω 2 is fixed, for the coercive problem, or, for the semicoercive problem, that the traction function f satisfies Z f dx < 0.

Ω2

3 FETI–DP discretization of the problem The first step in our domain decomposition method is to partition each domain Ω i , i = 1, 2, on a rectangular grid into subdomains of diameter of order H. Let W be the finite element space whose restrictions to Ω 1 and Ω 2 are Q1 finite element spaces of comparable mesh sizes of order h, corresponding to the subdomain grids in Ω 1 and Ω 2 . We call a crosspoint either a corner that belongs to four subdomains, or a corner that belongs to two subdomains and is located on ∂Ω 1 \Γu1 or on ∂Ω 2 \Γu2 . The nodes corresponding to the end points of Γc are not crosspoints; see Figure 1b. An important feature for developing FETI–DP type algorithms is that a single degree of freedom is considered at each crosspoint, while two degrees of freedom are introduced at all the other matching nodes across subdomain edges. Let v ∈ W . The continuity of v in Ω 1 and Ω 2 is enforced at every interface node that is not a crosspoint. For simplicity, we also denote by v the nodal values vector of v ∈ W . The discretized version of problem (1) with the auxiliary domain decomposition has the form min

1 T v Kv − v T f 2

subject to BI v ≤ 0 and BE v = 0,

(2)

where the full rank matrices BI and BE describe the non-penetration (inequality) conditions and the gluing (equality) conditions, respectively, and f represents the discrete analog of the linear form `(·). In (2), K = diag(K1 , K2 ) is the block diagonal stiffness matrix corresponding to the model problem (1). The block K1 corresponding to Ω 1 is nonsingular, due to the Dirichlet boundary conditions on Γu1 . The block K2 corresponding to Ω 2 is nonsingular for a coercive problem, and is singular, with the kernel made of a vector e with all

4

Zdenˇek Dost´ al, David Hor´ ak and Dan Stefanica

entries equal to 1, for a semicoercive problem. The kernel of K is spanned by the matrix R defined by   0 R = . e Even though R is a column vector for our model problem, we will regard R as a matrix whose columns span the kernel of K. We partition the nodal values of v ∈ W into crosspoint nodal values, denoted by vc , and remainder nodal values, denoted by vr . The continuity of v at crosspoints is enforced by using a global vector of degrees of freedom vcg and a global-to-local map Lc with one nonzero entry equal to 1 in each row, i.e., we require that vc = Lc vcg . Therefore,     vr vr = v = . Lc vcg vc Let fc and fr be the parts of the right hand side f corresponding to the corner and remainder nodes, respectively. Let BI,r and BI,c be the matrices made of the columns of BI corresponding to vr and vc , respectively; define BE,r and BE,c similarly. Let     BI,r BI,c Br = , Bc = , B = [Br Bc ]. BE,r BE,c Let Krr , Krc , and Kcc denote the blocks of K corresponding to the decomposition of v into vr and vc . Consider the shortened vectors   vr v = ∈ W. vcg Let λI and λE be Lagrange multipliers enforcing the inequality and redundancy conditions. The Lagrangian L(v, λ) = 1/2 v T Kv − v T f + v T B T λ associated with problem (2) can be expressed as follows: L(v, λ) = where   λI λ= , λE

K=



1 T T v Kv − v T f + v T B λ, 2

 Krr Krc Lc , T LTc Krc LTc Kcc Lc

B = [Br Bc Lc ] ,

(3)

f=



 fr . LTc fc

Using duality theory [2], we can eliminate the primal variables v from the mixed formulation of (2). For a coercive problem, K is nonsingular and we obtain the problem of finding 1 min Θ(λ) = min λT F λ − λT de s.t. λI ≥ 0, 2

(4)

−1 T −1 with F = B K B and de = B K f . For an efficient implementation of F it is important to exploit the structure of K; see [8, 10] for more details.

An Overview of Scalable FETI–DP Algorithms for Variational Inequalities

5

For a semicoercive problem, we obtain the problem of finding 1 min Θ(λ) = min λT F λ − λT de s.t. λI ≥ 0 and 2 †

T



T

e = ee, Gλ

(5)



e = RT B , ee = RT f. Here, K denotes a where F = B K B , de = B K f , G † suitable generalized inverse that satisfies K K K = K. Even though problem (5) is much more suitable for computations than (1) and was used for solving discretized variational inequalities efficiently [7], further improvement may be achieved as follows. Let Te denote a nonsingular matrix that defines e such that the matrix G = TeG e has the orthonormalization of the rows of G e orthonormal rows. Let e = T ee. Then, problem (5) reads min

1 T λ F λ − λT de s.t λI ≥ 0 and Gλ = e. 2

(6)

Next, we transform the problem of minimization on the subset of the affine e be an space to a minimization problem on the subset of a vector space. Let λ e = e. We look for the solution λ of (5) arbitrary feasible vector such that Gλ e After returning to the old notation by replacing µ by in the form λ = µ + λ. λ, it is easy to see that (6) is equivalent to min

1 T fI , λ F λ − dT λ s.t Gλ = 0 and λI ≥ −λ 2

(7)

e λ. e Our final step is based on the observation that the augmented with d = d−F Lagrangian for problem (7) may be decomposed by the orthogonal projectors Q = GT G

and

P =I −Q

on the image space of GT and on the kernel of G, respectively. Since P λ = λ for any feasible λ, problem (7) is equivalent to min

1 T fI . λ P F P λ − λT P d s.t Gλ = 0 and λI ≥ −λ 2

(8)

4 Optimality To solve the discretized variational inequality, we use our recently proposed algorithms [8, 10]. To solve the bound constrained quadratic programming problem (4), we use active set based algorithms with proportioning and gradient projections [4, 11]. The rate of convergence of the resulting algorithm can be estimated in terms of bounds on the spectrum of the Hessian of Θ. To solve the bound and equality constrained quadratic programming problem (8), we use semimonotonic augmented Lagrangian algorithms [5, 6]. The equality constraints are enforced by Lagrange multipliers generated in the

6

Zdenˇek Dost´ al, David Hor´ ak and Dan Stefanica

outer loop, while the bound constrained problems are solved in the inner loop by the above mentioned algorithms. The rate of convergence of this algorithm may be again described in terms of bounds on the spectrum of the Hessian of Θ. Summing up, the optimality of our algorithms is guaranteed, provided that we establish optimal bounds on the spectrum of the Hessian of Θ. Such bounds on the spectrum of the operator F , possibly restricted to ImP , are given in the following theorem: Theorem 1. If F denotes the Hessian matrix of Θ in (4), the following spectral bounds hold:  2 H ; λmin (F ) ≥ C. λmax (F ) = ||F || ≤ C h If F denotes the Hessian matrix of Θ in (5), the following spectral bounds hold:  2 H ; λmin (F |ImP ) ≥ C. λmax (F |ImP ) ≤ ||F || ≤ C h Proof: See [8, 10].

5 Numerical experiments We report some results for the numerical solutions of a coercive contact problem and of a semicoercive contact problem, in order to illustrate the performance and numerical scalability of our FETI–DP algorithms. In our experiments, we used a function f vanishing on (0, 1) × [0, 0.75) ∪ (1, 2) × [0.25, 1). For the coercive problem, f was equal to −1 on (0, 1) × [0.75, 1) and to −3 on (1, 2) × [0, 0.25), while for the semicoercive problem, f was equal to −5 on (0, 1) × [0.75, 1) and to −1 on (1, 2) × [0, 0.25). Each domain Ω i was partitioned into identical squares with side H = 1/2, 1/4, 1/8. These squares were then discretized by a regular grid with the stepsize h. For each partition, the number of nodes on each edge, H/h, was taken to be 4, 8, and 16. The meshes matched across the interface for every neighboring subdomains. All experiments were performed in MATLAB. The solution of both the coercive and semicoercive model problems for H = 1/4 and h = 1/4 are presented in Figure 2. Selected results of the computations for varying values of H and H/h are given in Table 1, for the coercive problem, and in Table 2 for the semicoercive problem. The primal dimension/dual dimension/number of corners are recorded in the upper row in each field of the table, while the number of the conjugate gradient iterations required for the convergence of the solution to the given precision is recorded in the lower row. The key point is that the number of the conjugate gradient iterations for a fixed ratio H/h varies very moderately with the increasing number of subdomains.

An Overview of Scalable FETI–DP Algorithms for Variational Inequalities

7

Table 1. Convergence results for the FETI–DP algorithm - coercive problem H 1/2 1/4 1/8 H/h = 16 2312/153/10 9248/785/42 36992/3489/154 33 39 43 H/h = 8 648/73/10 2592/369/42 10365/1633/154 20 32 34 H/h = 4 200/33/10 800/161/42 3200/705/154 19 24 27 Table 2. Convergence results for the FETI–DP algorithm - semicoercive problem H 1/2 1/4 1/8 H/h = 16 2312/153/10 9248/785/42 36992/3489/154 61 51 53 H/h = 8 648/73/10 2592/369/42 10365/1633/154 38 36 46 H/h = 4 200/33/10 800/161/42 3200/705/154 29 28 35

0 0

−0.1

−0.2

−0.2

−0.4

−0.3 −0.6

−0.4 −0.8

−0.5

1 −1

1

−0.6 0.8

−1.2

0.5

0.6 −0.7 0

0.4 0.2

0.4

0.6

0.8

1

0.2 1.2

1.4

1.6

1.8

2

0

−1.4 0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

0 2

Fig. 2a: Solution of coercive problem Fig. 2b: Solution of semi-coercive problem

6 Comments and conclusions We applied the FETI–DP methodology to the numerical solution of a variational inequality. Theoretical arguments and results of numerical experiments show that the scalability of the FETI–DP method which was established earlier for linear problems may be preserved even in the presence of nonlinear conditions on the contact boundary. The results are supported by numerical experiments. Similar results were obtained also for non-matching contact interfaces discretized by mortars [9]. Acknowledgments. The work of the first two authors was supported by Grant 101/04/1145 of the GA CR, by Grant S3086102 of GA CAS, and by Projects 1ET400300415 and ME641 of the Ministry of Education of the Czech Republic. The third author was supported by the Research Foundation of the City University of New York Awards PSC-CUNY 665463-00 34 and 66529-00 35.

8

Zdenˇek Dost´ al, David Hor´ ak and Dan Stefanica

References 1. Avery, P., Rebel, G., Lesoinne, M., Farhat, C.: A numerically scalable Dual–Primal substructuring method for the solution of contact problems - part I: the frictionless case. Computer Methods in Applied Mechanics and Engineering, 193, 2403–2426 (2004) 2. Bertsekas, D. P.: Nonlinear Optimization, Athena Scientific, Belmont (1999). 3. Dost´ al, Z.: Box constrained quadratic programming with proportioning and projections. SIAM Journal on Optimization 7, 3, 871–887 (1997) 4. Dost´ al, Z.: A proportioning based algorithm for bound constrained quadratic programming with the rate of convergence. Numerical Algorithms, 34, 2-4, 293-302 (2003) 5. Dost´ al, Z.: Inexact semi-monotonic Augmented Lagrangians with optimal feasibility convergence for quadratic programming with simple bounds and equality constraints. accepted in SIAM Journal on Numerical Analysis 6. Dost´ al, Z.: An optimal algorithm for bound and equality constrained quadratic programming problems with bounded spectrum. submitted to Computing 7. Dost´ al, Z., Friedlander, A., Santos, S.A.: Solution of contact problems of elasticity by FETI domain decomposition. Contemporary Mathematics 218, 82–93 (1998) 8. Dost´ al, Z., Hor´ ak, D., Stefanica, D.: A scalable FETI-DP algorithm for coercive variational inequalities. IMACS Journal Applied Numerical Mathematics, in print 9. Dost´ al, Z., Hor´ ak, D., Stefanica, D.: A Scalable FETI–DP Algorithm with Non–penetration Mortar Conditions on Contact Interface. submitted 10. Dost´ al, Z., Hor´ ak, D., Stefanica, D.: A Scalable FETI–DP Algorithm for Semi-coercive Variational Inequalities. submitted 11. Dost´ al, Z., Sch¨ oberl, J.: Minimizing quadratic functions over non-negative cone with the rate of convergence and finite termination. Computational Optimization and Applications 30, 1, 23–44 (2005) 12. Farhat, C., Lesoinne, M., LeTallec, P., Pierson, K., Rixen, D.: FETI-DP: A dual-prime unified FETI method. I: A faster alternative to the two-level FETI method. International Journal for Numerical Methods Engineering, 50, No.7, 1523–1544 (2001) 13. Farhat, C., Mandel, J., Roux, F.X.: Optimal convergence properties of the FETI domain decomposition method. Computer Methods in Applied Mechanics and Engineering, 115, 365–385 (1994) 14. Farhat, C., Roux, F.X.: An unconventional domain decomposition method for an efficient parallel solution of large-scale finite element systems. SIAM Journal on Scientific Computing 13, 379–396 (1992) 15. Klawonn, A., Widlund, O.B., Dryja, M.: Dual-Primal FETI Methods for Three-dimensional Elliptic Problems with Heterogeneous Coefficients. SIAM Journal on Numerical Analysis, Vol. 40, 159–179 (2002)

Suggest Documents