New Algorithms for Solving Large Sparse Systems of Linear ... - UBA

New Algorithms for Solving Large Sparse Systems of Linear Equations and their Application to Nonlinear Optimization H. D. Scolnik

Departamento de Computacion Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires, (1426) Buenos Aires, Argentina [email protected]

Abstract Given a system of equations Ax = b, where A is a full rank n n matrix, usually large and sparse, a starting point x0 , and a direction vector p, a new result is presented which allow us to nd the closest point of the form x0 + p to the unknown solution x in the metric de ned by AT A. From that result several algorithms had been developed, initially for symmetric positive de nite matrices. The basic idea is to optimize free parameters determining the search directions for approximating Newton's method. The main algorithm is of the Cimmino's type, where the directions given by the projections onto the hyperplanes are combined in an optimal way. The version for positive de nite systems is being applied to large scale nonlinear optimization problems. Finally, a couple of numerical experiences are presented. Keywords: Sparse systems, projection methods, nonlinear optimization.

1 Introduction Given a linear system Ax = b, where A 2 +2 kApk2;

then, from f 0 () = 0, it follows that = ; k2 . 0

Note that f 00 () > 0. Moreover,

f ( ) = jjr0 jj2 (1 ; cos2 (r0 ; Ap)) = jjr0 jj2 sin2 (r0 ; Ap):

(2)

Lemma 2.1: The merit function (2) is zero if and only if p coincides with the optimal direction x0 ; x .

Proof. Elementary. Suppose now we have two directions p1 and p2 . We want to nd in such a way that p() = p1 + (1 ; )p2 minimizes the merit function (2). This result is given in the following Theorem 2.2: Given the system Ax = b, where A 2 = < A

c Investigacion Operativa 1997

Volume 7, Number 1{2, January{June 1997

Investigacion Operativa

Proof. Let us de ne

107

0 ; Ap() >2 P () ; H () = < gkAp ()k2 Q()

(3)

and we will nd its critical points. We have that

< g0 ; Ap() > = < g0; A((p1 ; p2 ) + p2 ) > = < g0 ; A(p1 ; p2 ) > + < g0 ; Ap2 > : Then, de ning

a1 = < g0 ; A(p1 ; p2 ) >2 ; a2 = 2 < g0; A(p1 ; p2 ) >< g0 ; Ap2 >;

(4)

a3 = < g0 ; Ap2 >2 ; we get

P () =< g0 ; Ap() >2 = a1 2 + a2 + a3 :

Also

kAp()k = < Ap(); Ap() > = < A(p ; p ) + Ap ; A(p ; p ) + Ap > = kA(p ; p )k + 2 < A(p ; p ); Ap > +kAp k : 2

1

1

2

2

2

2

2

2

1

2

1

2

2

2

2

Therefore, de ning

we can write

b1 = kA(p1 ; p2 )k2 b2 = 2 < A(p1 ; p2 ); Ap2 > b3 = kAp2k2 ;

(5)

Q() = kAp()k2 = b1 2 + b2 + b3:


H. D. Scolnik

108

New Algorithms for Solving Large Sparse Systems ...

From (3) it follows that 0

0

H 0 () = P ()Q(Q) 2;(P) ()Q () :

(6)

Also, we can write

P () = (< g0 ; A(p1 ; p2 ) > + < g0 ; Ap2 >)2 ; which can be written as

P () = (c1 + c2 )2 ;

(7)

with a1 = c21 , a2 = 2c1c2 , a3 = c22 . Thus, from (7) we get

P 0 ()Q() ; P ()Q0 () = = (c1 + c2 )[(b2 c1 ; 2b1c2 ) + (2c1b3 ; c2 b2 )]:

(8)

Therefore, from (7) and (8) we obtain that one root is 1 = ;c2=c1 , but for this value is p(1 ) = 0. In other words, 1 is a maximizer of the merit function

F () = kr0 k2 (1 ; cos2 (r0 ; Ap()): The remaining root is

2c1 b3 2 = cb2 bc2 ; ; 2b c ; 2 1

1 2

which after some straightforward calculations can be written as

g0 ; Ap2 > Ap2 ; kAp2 k2 g0 > 2 = < A

3 Algorithms Given A 2 k = ; else k = ; kApk k2 end if end 0

1

2

0

3

0

1

2

1

2

2

2

1

2

3

xk+1 = xk + k pk rk+1 = Axk+1 ; b if krk+1 < 1kr0 k stop


H. D. Scolnik

110


set k = k + 1 end Now, let us prove that Algorithm 1 is globally convergent for symmetric matrices. Theorem 3.1: Given the system Ax = b, A 2 0, then for any starting point x0 2 > 0 cos k = ; =< gk+1 ; pk >= 0

(13)

Hence, from (12) we get

< gk+1 ; gk ; pk >= ; < gk ; pk >= k < Apk ; pk >

(14)

On the other hand, from (12)

< gk+1 ; gk ; pk > 1 k kpk k2

(15)

Then, using (14) and (15) we obtain k k k ; 1


(16)



111

Also k

f (xk+1 ) = f (xk ) + k < gk ; pk > + (2 ) < pk ; Apk > k2 = f (xk ) ; k < gk ; pk > ; (2 ) < pk ; Apk > 2

because of (14). De ning

k 2 k k c = kg k 1

We nally get

f (xk+1 ) f (xk ) + ckgk k2 cos2 k

(17)

Since f (x) is bounded below, it follows from (17) that 1 X k=1

kgk k cos k < 1 (The Zoutendijk condition [9]) 2

2

Taking into account that cos2 k 02 > 0, we conclude that lim kgk k = 0:

k!1

4 Numerical Experiments In order to test Algorithm 1, an experimental Fortran program was written. The following result were obtained using 486 DX2-66 Mhz PC and the Microsoft Fortran 5.1 compiler. We compare Algorithm 1 with PCG (Preconditioned Conjugate Gradients Method as implemented in the IMSL library) and GMRES(n).

MAT1: Let An = (aij ) be the non symmetric matrix de ned by 1; j i a = ij

aj ; j < i.

The determinant of An is given by det(An ) = (1 ; a1 )(1 ; a2 ) : : : (1 ; an;1 )


112

H. D. Scolnik


We chose n = 50 and aj = t for j = 1 : : : n ; 1. The system has xi = 1, and x0i = 0:5 for j = 1 : : : n. Method t CPU krk kx ; x k ; 9 New 0:8 0:17 0:130 10 0:991 10;10 ; 9 PCG 0:8 2:09 0:134 10 0:251 10;9 ; 8 GMRES 0:8 0:33 0:772 10 0:672 10;7 ; 9 New 0:9 0:16 0:501 10 0:787 10;9 ; 8 PCG 0:9 1:53 0:108 10 0:130 10;6 ; 9 GMRES 0:9 0:28 0:765 10 0:186 10;6 MAT2: Let us consider the Hilbert matrix An = (aij ) = ( i + j1 ; 1 ) i; j = 1; : : : ; n and the linear system An x = b for n = 7 and the same starting point and solution as before. The condition number is approximately e3:5n , and det(A7 )=4:8358 10;25 Method CPU kr k kx ; x k ; 6 New 0:03 0:533 10 0:381 10;2 ; 5 PCG 0:04 0:307 10 0:175 10;2 ; 6 GMRES 0:03 0:268 10 0:746 10;2 Remark 4.1: PCG cannot reach lower values of krk because of numerical failure. Note that closer iterates to x not necessarily imply lower values of krk. Reference: [5].

Conclusion: The results presented in this paper can be applied to a variety of

possible row action algorithms. Here we just compared an experimental program implementing one possible algorithm against two professionally developed products. These rst results are very encouraging because they show the new algorithms can deal with ill conditioned symmetric and non symmetric matrices eciently.

5 Applications to Nonlinear Optimization Let us consider the unconstrained minimization problem min f (x); x2, 2 a < v; g+ >, 3 kA+ Zg+k2 , 4 < A+ Zg+ ; A+ v >, 5 < g+; A+ v >, 6 kA+ vk2 , and

}|ZT

z }|v { T H

H H T H = H ; T H + T + | {zH } T ; T H T ; TH H : a z

+

{ T


H. D. Scolnik

114


Another way of trying to approximate Newton's direction in some sense (due to M.F. Marazzi) could be obtained by solving min kH + r2 f (x) ; I k

(21)

for some matrix norm. If the Hessian of f is not available, we may replace it by the current approximation B H ;1 , which is in hand, and solve min kH + B ; I k;

(22)

instead of solving (21). Lemma 5.2: The unique solution to (22) in the weighted Frobenius norm k : kH

kH ; = ( : )H = k F is 1 2

1 2

T = ; T H :

Remark 5.1: These results can be used with automatic dierentiation.

Acknowledgment: The author acknowledges very insightful discussions with N. Echebest, M.T. Guardarucci, M. Marazzi, C. Ruscitti and M.C. Vacchino.

References [1] R. Bramley and A. Sameh, \Row Projection Methods for Large Nonsymmetric Linear Systems". SIAM J. Sci. Sts. Comp., 13, 1992, pp.168-193. [2] G. Cimmino, \Calcolo approssimate per le soluzioni dei sistemi di equazioni lineari". La Ricerca scienti ca ed il Progresso tecnico nell' Economia nazionale (Roma), Consiglio Nazionale delle Ricerche, Ministero dell' Educazione nazionale, 9,1, 1938, pp.326-333. [3] H. Elman, Iterative methods for Large, Sparse, Nonsymmetric Systems of Linear Equations. Ph.D. Thesis and Research Report n 229, Dept.of Comp.Sc., Yale University, 1982. [4] U. Garca Palomares, \Parallel Projected Aggregation Methods for Solving the Convex feasibility Problem". SIAM J.Opt., 3, 1993, pp.882-900. [5] R. Gregory and D. Karney, A Collection of Matrices for Testing Computational Algorithms. J.Wiley, 1969. [6] G. H. Golub and C. F. Van Loan, Matrix Computations. Third Edition, The Johns Hopkins University Press, 1996. [7] L. Hageman and D. Young, Applied Iterative Methods. Academic Press, 1981.




115

[8] S. Kaczmarz, \Angenaherte Au osung von Systemen linearer Gleichungen". Bulletin internationel de l'academie Polonaise des Sciences et des lettres. Classe des Sciences mathematiques et naturelles. Series A: Sciences mathematiques. [9] G. Zoutendijk, \Nonlinear Programming, Computational Methods in Integerand Nonlinear Programming". J. Abadie, ed., North Holland (Amsterdam), 1970, pp. 37-86.


New Algorithms for Solving Large Sparse Systems of Linear ... - UBA

New Algorithms for Solving Large Sparse Systems of Linear ... - UBA

Suggest Documents

New Algorithms for Solving Tropical Linear Systems

Parallel Algorithms for Solving Large Sparse Linear ... - Atlantis Press

Solving Large Sparse Linear Systems Efficiently on Grid Computers ...

Algorithms for Solving Linear and Polynomial Systems of Equations ...

Scalable linear solvers for sparse linear systems

A Sparse QS-Decomposition for Large Sparse Linear System of

On Wiedemann's Method of Solving Sparse Linear

HARDNESS OF SOLVING SPARSE OVERDETERMINED LINEAR ...

New Technique for Solving Sparse Equation Systems HÃ¢vard ...

Parallel Solution of Large Sparse SPD Linear Systems Based on ...

Solving large sparse linear systems in a grid environment using Java

Solving large scale systems of linear equations with a stabilized

Scalable Solvers for Sparse Linear Systems - CiteSeerX

HYPERGRAPH PARTITIONING FOR SPARSE LINEAR SYSTEMS: A

A New Approach for Solving Fully Fuzzy Linear Systems

Solving Large-Scale and Sparse-Reward DEC

Efficient Solving of Large Non-linear Arithmetic

4 Iterative Methods for Solving Linear Systems

Iterative Methods for Solving Fuzzy Linear Systems

4 Iterative Methods for Solving Linear Systems

Parallel Methods for Solving Linear Systems

Iterative algorithms for solution of large sparse ... - Bilkent University

Certifying Inconsistency of Sparse Linear Systems - CiteSeerX

A novel approach for solving an arbitrary sparse linear system