Bounds for the Smallest and the Largest ... - Semantic Scholar

International Journal of Algebra, Vol. 9, 2015, no. 8, 379 - 394 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ija.2015.5954

Bounds for the Smallest and the Largest Eigenvalues of Hermitian Matrices1 Rachid Marsli Mathematics and Statistics Department King Fahd University of Petroleum and Minerals Dhahran, 31261, Kingdom of Saudi Arabia c 2015 Rachid Marsli. This article is distributed under the Creative ComCopyright mons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract In this paper the author derives a lower bound for the largest eigenvalue and an upper bound for the smallest eigenvalue of Hermitian matrices, based on Weyl’s inequalities. Some related results, consequences, applications, and examples are provided. In the fifth section, the main result is applied to some integer matrices.

Mathematics Subject Classification: 15A18 Keywords: Hermitian matrix; smallest eigenvalue; largest eigenvalue; spread

1

Introduction

In matrix theory, some of the most useful inequalities are Weyl’s inequalities, named after Hermann Weyl, and which compare the eigenvalues of the sum A1 + A2 of n × n Hermitian matrices with the sum of the eigenvalues of A1 and A2 . These inequalities are as follows, for 1 ≤ i ≤ n: λi (A1 + A2 ) ≤ λi+j (A1 ) + λn−j (A2 ) for j = 0, 1, ..., n − i ; λi (A1 + A2 ) ≥ λi−j+1 (A1 ) + λj (A2 ) for j = 1, 2, ..., i, 1

This work was partially supported by NSA grant, H98230-14-1-0152.

380

Rachid Marsli

where λ1 (A1 ) ≤ λ2 (A1 ) ≤ ... ≤ λn (A1 ) are the eigenvalues of A1 in nondecreasing order, and similarly for A2 and A1 + A2 . In particular we have λi (A1 ) + λ1 (A2 ) ≤ λi (A1 + A2 ) ≤ λi (A1 ) + λn (A2 ),

(1)

for 1 ≤ i ≤ n. Letting i = 1 and i = n we then have, respectively: λn (A1 + A2 ) ≤ λn (A1 ) + λn (A2 ), and λ1 (A1 + A2 ) ≥ λ1 (A1 ) + λ1 (A2 ). For t n × n Hermitian matrices A1 , A2 , ..., At , we get λn (

t X

Ak ) ≤

t X

k=1

k=1

t X

t X

(λn (Ak )),

(2)

(λ1 (Ak )).

(3)

and λ1 (

k=1

Ak ) ≥

k=1

In the following sections, based on the above inequalities, we derive some other inequalities and bounds for the eigenvalues of Hermitian matrices. We also provide related results, consequences, applications, and examples. For these purposes we use the following notation. Let S = {P1 , . . . , Pn! } be the set of all n × n permutation matrices, let w be any vector in Cn , and let A be an n × n matrix. Then we define: L(A) =

n! X

PkT APk ;

k=1

for n ≥ 2, n

n

n

n

n

n

X 1 XX 1X 1 aij , aii − aij }; γ1 (A) = min{ n i=1 j=1 n i=1 n(n − 1) i6=j X 1 XX 1X 1 γ2 (A) = max{ aij , aii − aij }; n i=1 j=1 n i=1 n(n − 1) i6=j for n = 1, γ1 (A) = γ2 (A) = a11 . As usual, ρ(A) is the spectral radius of A, kAk1 and kAk∞ are the respective induced matrix norms of A, and kwk2 is the Euclidean norm of vector w . As before, if A is Hermitian, then we use the ordering λ1 (A) ≤ λ2 (A) ≤ ... ≤ λn (A).

Bounds for the smallest and the largest eigenvalues of Hermitian matrices

2

381

The main result

Lemma 2.1. Let A = [aij ] be an n × n Hermitian matrix. Then (L(A))ii = (n − 1)!

n X

app ,

for i = 1, 2 · · · n;

p=1

and (L(A))ij = (n − 2)!

X

apq ,

for i 6= j.

p6=q

Proof. Let e1 , e2 , ..., en be the n canonical vectors in Cn . In general, for 1 ≤ i ≤ n and 1 ≤ k ≤ n, ei is the k th column in (n − 1)! elements of S. Also, for i 6= j and k 6= m, ei and ej are respectively the k th and mth columns in (n − 2)! elements of S. Let ek,i be the ith column of the permutation matrix Pk . Then  T  ek,1  eTk,2    PkT A Pk =  .  A ek,1 ek,2 · · · ek,n ,  ..  eTk,n so that (PkT A Pk )ii = eTk,i A ek,i which is equal to a11 for (n − 1)! terms of L(A)ii of the form (PkT A Pk )ii since ek,i = e1 for (n − 1)! permutation matrices, and it is equal to a2,2 for another (n − 1)! terms and so on. Hence, for 1 ≤ i ≤ n, (L(A))ii =

n! X

eTk,i A ek,i = (n − 1)!(a11 ) + (n − 1)!(a22 ) + ... + (n − 1)!(ann )

k=1

= (n − 1)!

n X

app .

p=1

(PkT A Pk )ij = eTk,i A ek,j ,

Also, for i 6= j,

which is equal to a12 for (n − 2)! terms of (L(A))ij , since ek,i and ek,j are respectively, the first and the second columns in (n − 2)! elements of S. In an analogous way, eTk,i A ek,j is equal to a1,3 for another (n − 2)! terms and so on. Hence, for i 6= j, (L(A))ij =

n! X k=1

eTk,i A ek,j = (n − 2)!(a12 ) + (n − 2)!(a13 ) + ...

382

Rachid Marsli

= (n − 2)!

n X

apq .

p6=q

Applying inequalities (2) and (3) to L(A), and using the fact that all the Hermitian matrices (Pk A Pk ) are similar to A, and therefore have the same eigenvalues, we obtain: Lemma 2.2. Let A be an n × n Hermitian matrix. Then λ1 (A) ≤ λ1 (L(A))/n! ≤ λn (L(A))/n! ≤ λn (A). Lemma 2.3. Let α and β be complex numbers, and let M be an n × n matrix with α on all diagonal positions and β on all off-diagonal positions. Then M has eigenvalues α−β with multiplicity n−1 and α+(n−1)β with multiplicity 1. Proof. Observe that M = (α − β)I + βJ where I is the identity matrix and J is the all 10 s matrix. Now J has eigenvalues n , 0 ... 0, so that M has eigenvalues (α − β) + nβ, ie α + (n − 1)β, with multiplicity 1 and α − β with multiplicity n − 1. We now come to the main result of the paper. Theorem 2.4. Let A be an n × n Hermitian matrix. Then λ1 (A) ≤ γ1 (A) ≤ γ2 (A) ≤ λn (A). Proof. By Lemma 2.1, L(A) has one distinct diagonal entry which is equal n X to (n − 1)! aii and one distinct off-diagonal entry which is equal to (n − i=1 X aij . Hence, by Lemma 2.3 2)! i6=j

λ1 (L(A)) = min{(n − 1)!

n X

aii − (n − 2)!

i=1

X

aij ,

(n − 1)!

n XX

aij }

i=1 j=1

i6=j

= n!γ1 (A), and λn (L(A)) = max{(n − 1)!

n X

aii − (n − 2)!

i=1

= n!γ2 (A). Then the theorem follows from Lemma (2.2).

X i6=j

aij , (n − 1)!

n XX i=1 j=1

aij }


383

Remarks 1. Let B be an n × n Hermitian matrix with one distinct diagonal element α and one distinct off-diagonal elements β. Then, by Lemma 2.3 and the definition of γ1 (A) and γ2 (A), γ1 (B) = λ1 (B) = min{α − β , α + (n − 1)(β)}, and γ2 (B) = λ2 (B) = max{α − β , α + (n − 1)(β)}. Since γ1 (B), γ2 (B), λ1 (B), and λ2 (B) are continuous functions of the entries of B, a relatively small perturbation applied to B and leading to a Hermitian matrix C will result into small differences (γ1 (C) − λ1 (C)) and (λn (C) − γ2 (C)). As an example, let     200 100 100 201 101 102 B = 100 200 100 and C = 101 202 103 . 100 100 200 102 103 203 Then ( λ1 (B) = γ1 (B) = 100 λ3 (B) = γ2 (B) = 400

( λ1 (C) ≈ 99.4183 ≤ 100 = γ1 (C) and λ3 (C) ≈ 406.0087 ≥ 406 = γ2 (C)

2. For a real n × n matrix A = [aij ] with real eigenvalues, which is not necessary Hermitian, we have n 1 X aii ≤ λn (A). λ1 (A) ≤ n i=1

This follows from the fact that

n X i=1

aii =

n X

λi . If A is Hermitian, then

i=1

we also have n

λ1 (A) ≤ γ1 (A) ≤

1X aii ≤ γ2 (A) ≤ λn (A). n i=1

Hence, for every Hermitian matrix A, γ1 (A) and γ2 (A) give equal or trace(A) better bounds for, respectively, λ1 (A) and λn (A) than . n 3. If a matrix is not Hermitian then Theorem 2.4 may apply to it and may not. For example, Theorem 2.4 applies to 1 −1 C = −2 1

384

Rachid Marsli

and does not apply to

0 −1 0 −1 A = and B = 0 0 −2 0 4. For the eigenvalues of a given n × n Hermitian matrix A = [aij ] other than λ1 (A) and λn (A), we can derive an upper bound that is dependent on the entries of A and λn (A), and a lower bound dependent on the entries of A and λ1 (A). In fact for 1 < i < n, inequalities in (1) applied to L(A), with induction and argument similar to the proof of Lemma 2.2, imply λi (L(A)) − (n! − 1) λn (A) ≤ λi (A) ≤ λi (L(A)) − (n! − 1) λ1 (A), which leads to λn (A) − n!h1 ≤ λi (A) ≤ λ1 (A) + n!h2 , with h1 = λn (A) − γ1 (A) ; h2 = γ1 (A) − λ1 (A) if

X

aij ≥ 0,

i6=j

and h1 = λn (A) − γ2 (A) ; h2 = γ2 (A) − λ1 (A) if

X

aij ≤ 0.

i6=j

3

Consequences and applications of Theorem 2.4

The following corollary of Theorem 2.4 gives a lower bound for the spread of a Hermitian matrix in terms of its entries. Corollary 3.1. Let A = [aij ] be an n×n Hermitian matrix, with n ≥ 2. Then P | i6=j aij | . λn (A) − λ1 (A) ≥ n−1 Proof. From Theorem 2.4, we have λ1 (A) ≤ γ1 (A) ≤ γ2 (A) ≤ λn (A), which implies: P | i6=j aij | λn (A) − λ1 (A) ≥ γ2 (A) − γ1 (A) = . n−1 The next result also follows easily from Theorem 2.4, and gives necessary conditions for Hermitian matrices to be similar to each other.


385

Corollary 3.2. Let A and C be two Hermitian matrices that are similar to each other, and let λ1 = λ1 (A) = λ1 (C) and λn = λn (A) = λn (C). Then, in addition to the inequalities γ2 (A)−γ1 (A) ≤ λn − λ1 and γ2 (C)−γ1 (C) ≤ λn − λ1 , we also have |γ2 (C) − γ2 (A)| ≤ λn − λ1 ; |γ2 (C) − γ1 (A)| ≤ λn − λ1 ; |γ1 (C) − γ1 (A)| ≤ λn − λ1 ; |γ1 (C) − γ2 (A)| ≤ λn − λ1 . Example 3.3. Let   1 0 0 0 A = 0 3 0 0 −2



 1 −2 −4 and C = −2 2 −1 . −4 −1 3

Then λ3 (A) − λ1 (A) = 5 and γ2 (C) − γ1 (C) = 7. This implies, according to Corollary 3.2, that A and C are not similar. Corollary 3.4. Let A be a Hermitian matrix, and kAk be a matrix norm of A. Then max {|γ1 (A)| , |γ2 (A)|} ≤ ρ(A) ≤ kAk. and ρ(Ak ) ≥ max {|γ1 (A)|k , |γ2 (A)|k }. Proof. Clearly, −ρ(A) ≤ λ1 ≤ γ1 (A) ≤ γ2 (A) ≤ λn (A) ≤ ρ(A) ≤ kAk. The second part of the corollary follows from the first part and the fact that ρ(Ak ) = (ρk (A)). Theorem 3.5. Let A and B be n × n commuting Hermitian matrices. Let α and β be, respectively, eigenvalues of A and B, such that |α| = ρ(A), |β| = ρ(B), Av = αv and Bv = βv for some nonzero vector v. Then for every positive integers k and m we have ρ(Ak B m ) ≥ max {(|γ1 (A))|k , |γ2 (A)|k } max {(|γ1 (B))|m , |γ2 (B)|m } . Proof. αβ is an eigenvalue of A B, so that ρ(A B) = αβ = ρ(A)ρ(B) and more generally ρ(Ak B m ) = ρ(A)k ρ(B)m . Then we apply Corollary 3.4. Theorem 3.6. Let N be an n × n normal matrix and let β1 , β2 · · · βn be its eigenvalues with |β1 | ≤ |β2 | ≤ · · · ≤ |βn |. Then p p |β1 | ≤ γ1 (N N ∗ ) ≤ γ2 (N N ∗ ) ≤ |βn |. In particular, if γ1 (N N ∗ ) = 0, then N is singular.

386

Rachid Marsli

Proof. Apply Theorem 2.4 to the matrix N N ∗ which is hermitian, and its eigenvalues are |β1 |2 , |β2 |2 , · · · , |βn |2 . Hermitian matrices are normal. Therefore we have: Corollary 3.7. Let A be Hermitian matrix. If γ1 (A2 ) = 0 then A is singular. Lemma 3.8. Let A be an n × n matrix, and let C2 (A) be the matrix with the same entries as A, except one off-diagonal entry of smallest absolute value in each row of A is replaced by 0 in C2 (A). Let λ be an eigenvalue of A with geometric multiplicity greater or equals to 2. Then n o |λ| ≤ min kC2 (A)k1 , kC2 (A)k∞ . The above result and its proof can be found in [3]. Theorem 3.9. Let A be an n × n Hermitian matrix and let C2 (A) be as in Lemma 3.8. n o If min kC2 (A)k1 , kC2 (A)k∞ < γ2 (A), then λn (A) is simple, and o n if min kC2 (A)k1 , kC2 (A)k∞ < −γ1 (A), then λ1 (A) is simple. Proof. The first part of the theorem follows from Theorem 2.4 and Lemma 2.3. In the second part, use the facts: λ1 (A) = −λn (−A), γ2 (A) = −γ1 (−A), kC2 (A)k1 = kC2 (−A)k1 and kC2 (A)k∞ = kC2 (−A)k∞ Then apply Theorem 2.4 and Lemma 2.3 to (−A). Example 3.10. Let 

 3 3 4 A = 3 2 3 , 4 3 0

kC2 (A)k1 = 7
0, and define r1 = min{

1 1 , }, −1 |γ1 (A )| γ2 (A−1 )

r2 = max{

1 1 , }. −1 |γ1 (A )| γ2 (A−1 )

and

Then Theorem 3.11 implies that min{α , |β|} ≤ r1 , and max{α , |β|} ≤ r2 . Based on the discussion above, we state the following corollary. Corollary 3.12. Let A be a nonsingular Hermitian matrix such that γ1 (A) < 0 and γ2 (A) > 0. Let rˆ1 = min

1 1 1 1 , , and rˆ2 = max |γ1 (A)| γ2 (A) |γ1 (A)| γ2 (A)

. Then • A−1 has both positive and negative eigenvalues, • A−1 has at least one eigenvalue within the interval [−ˆ r1 , rˆ1 ], • A−1 hast at least two distinct eigenvalues within the the interval [−ˆ r2 , rˆ2 ]. Remark 3.13. Let A be a Hermitian matrix for which γ1 (A) < 0 and γ2 (A) > 0. Then A has both positive and negative eigenvalues. If we interchange the positions of some diagonal entries of A, or if we interchange the positions of some off-diagonal entries of A such that the obtained matrix B is Hermitian, then B also has negative and positive eigenvalues. This is because γ1 (A) = γ1 (B) and γ2 (A) = γ2 (B).

388

Rachid Marsli

Example 3.14. Consider the Hermitian matrix     0 3 2 λ1 (A) ≈ −6, 4058 1  with A = 3 −5 λ2 (A) ≈ −2.3351   2 1 −1 λ3 (A) ≈ 2.7409 Since A has both positive and negative eigenvalues, then according to Remark 3.13, matrices such as     0 3 1 −5 3 2 2  and C =  3 −1 1 B = 3 −5 1 2 −1 2 1 0 has both positive and negative eigenvalues.

4

Some other results related to positive definite matrices

An argument similar to the proof of Theorem 3.11 yields the following. Theorem 4.1. Let A be a positive definite or a negative definite matrix. Then 1 1 ≥ γ2 (A−1 ) ≥ γ1 (A−1 ) ≥ . λ1 (A) λn (A) Example 4.2. The following example illustrates the case of a positive definite 1 gives a better upper-bound for λ1 (A) than γ1 (A). matrix where γ2 (A−1 )     2 0 1 3 1 −2 1 3 −2 If A = 0 2 1 , then A−1 =  1 4 1 1 2 −2 −2 4 and λ1 (A) ≈ 0.5858 ≤

1 ≈ 0.9231 ≤ γ1 (A) ≈ 1.3333. γ2 (A−1 )

Theorem 4.3. Let A be a Hermitian matrix. If γ1 (A) γ2 (A) ≤ 0, then A is neither positive definite nor negative definite. This is also the case if A is nonsingular and γ1 (A−1 ) γ2 (A−1 ) ≤ 0. In other words, for A to be positive definite, both γ1 (A) and γ2 (A) need to be positive, and for A to be negative definite, both γ1 (A) and γ2 (A) need to be negative.


389

Proof. The first part follows directly from Theorem 2.4, and the second follows from Theorem 4.1

Next, we give lower bound for the condition number of positive definite and negative definite matrices. Theorem 4.4. Let A and B be, respectively, positive definite and negative definite. Then λn (A) γ2 (A) ≥ > 0 and λ1 (A) γ1 (A)

5

λ1 (B) γ1 (B) ≥ > 0. λ2 (B) γ2 (B)

Application of Theorem 2.4 to some integer matrices

We first give a result that follows directly from Theorem 2.4. Corollary 5.1. Let {An } be a sequence of n × n Hermitian matrices. If

lim γ2 (An ) = ∞,

n→∞

and if

5.1

lim λn (An ) = ∞,

then

n→∞

lim γ1 (An ) = −∞,

then

n→∞

lim λ1 (An ) = −∞.

n→∞

Toeplitz matrices

Consider the set of Toeplitz matrices of the form: 

1

     Tn =      

2

2 1

3 2 ..

. 3 2 .. . . . . . . . .. .. . . n ... ...

... .. . .. . .. . .. . 3

 ... n ..  .  . . ..  . . ,  .. . 3   .. . 2 2

1

390

Rachid Marsli

where n ranges over all positive integers greater or equal to 2. Then n−2

n

n−1 n X n }| { z }| { z X z }| { (Tn )ij = (1 + · · · + 1) + 2(2 + · · · + 2 + 2 (3 + · · · + 3) + · · · + 2n i=1 j=1

= n + 2(2(n − 1) + 2(3(n − 2) + · · · + 2(n(n − (n − 1))) = n + 2 [2n + 3n + · · · + n(n)] − [2 + 3 × 2 + · · · + n × (n − 1)] n−1 X n(n + 1) − 1)] − k(k + 1) = n + 2 [n( 2 k=1

=

n3 + 3 n2 − n . 3

Hence γ2 (Tn ) =

n2 + 3n − 1 , 3

and n X

γ1 (Tn ) =

i=1

X

(Tn )ii −

n X n X (Tn )ij

(Tn )ij

i6=j

n n(n − 1) Now by applying Theorem 2.4, we obtain:

=

i=1 j=1

n(n − 1)

= −

n+1 . 3

Theorem 5.2. Let Tn be defined as above with n ≥ 2. Then Tn is neither positive definite nor positive semidefinite, and we have n+1 λ1 (Tn ) ≤ − so that lim λ1 (Tn ) = −∞, n→∞ 3 and n2 + n − 1 λn (Tn ) ≥ so that lim λn (Tn ) = ∞. n→∞ 3

5.2

Hankel matrices

Consider the set of Hankel matrices Hn of the form:   1 2 3 ... ... n   . . . 2 .. .. 3 .. n+1    ..  . . . . . . . . 3 . . . . .     ..  . . . . Hn =  .. . . . . . . . . .  .  .  . . . . . . . . . . . . . . 2n − 2     n n + 1 . . . . . . 2n − 2 2n − 1


391

where n ranges over all positive integers greater or equal to 2. Then n X

(Hn )ii = n2 ;

i=1

and n X n X (Hn )ij = 1 + 2 × 2 + · · · + n × n + (n + 1) × (n − 1) + i=1 j=1

(n + 2) × (n − 1) + · · · + 2n − 1 = 1 + 4 + · · · + n2 + (n2 − 1) + (n2 − 4) + · · · + (n2 − (n − 1)2 ) = n2 + n2 + · · · + n2 = n3 Hence γ2 (Hn ) = n2

and γ1 (Hn ) = −1.

Therefore, by applying Theorem 2.4, we obtain: Theorem 5.3. Let Hn be defined as above, with n ≥ 2. Then Hn is neither positive definite nor semi-positive definite, and we have λ1 (Hn ) ≤ −1 and λn (Hn ) ≥ n2 .

5.3

GCD matrices

Let Bn = [bij ] be the n × n symmetric matrix such that bij = gcd(i, j) = the greatest common divisor of i and j. Since bii = i, n X n X

γ2 (Bn ) =

n X

bij

i=1 j=1

n

=

i=1

X

bij

n

X

bii +

bij

i6=j

n

2 n+1 = + 2

X

gcd(i, j)

i

Bounds for the Smallest and the Largest ... - Semantic Scholar

Bounds for the Smallest and the Largest ... - Semantic Scholar

Suggest Documents

Largest and Smallest q-Singular Values - Semantic Scholar

Largest and Smallest q-Singular Values - Semantic Scholar

UPPER BOUNDS FOR THE SECOND LARGEST

On Ostrowski's disk theorem and lower bounds for the smallest ...

On the smallest possible dimension and the largest ... - Science Direct

The Smallest Transistor-Based Nonautonomous ... - Semantic Scholar

On Approximating the Smallest Enclosing ... - Semantic Scholar

Solution Methodologies for the Smallest Enclosing ... - Semantic Scholar

Improved bounds for the largest eigenvalue of trees - Science Direct

Consistency and Generalization Bounds for ... - Semantic Scholar

The Bounds Test Approach for Cointegration and ... - Semantic Scholar

Lower Bounds and Exact Algorithms for the Graph ... - Semantic Scholar

BOUNDS AND CONSTRUCTIONS FOR NE.C ... - Semantic Scholar

generalized confidence intervals for the largest ... - Semantic Scholar

An Algorithm for Finding the Largest Approximately ... - Semantic Scholar

The Smallest Pair of Noncrossing Paths in a ... - Semantic Scholar

On Maximizing the Second Smallest Eigenvalue of ... - Semantic Scholar

Molecular delineation of the smallest commonly ... - Semantic Scholar

Energetics of the smallest: do bacteria breathe at ... - Semantic Scholar

The industry's smallest and lightest

Lower bounds for the scheduling problem with ... - Semantic Scholar

Tight Bounds for the Probability of Overfitting - Semantic Scholar

The Expressiveness of Silence: Tight Bounds for ... - Semantic Scholar

BEST BOUNDS FOR THE HILBERT TRANSFORM ... - Semantic Scholar