The Finite Dimensional Normed Linear Space Theorem ... - Oswego

9 downloads 176 Views 283KB Size Report
The claim that follows, which I have called the finite-dimensional normed linear space theorem, essen- tially says that all such spaces are topologically R.
The Finite Dimensional Normed Linear Space Theorem Richard DiSalvo Dr. Elmer Mathematical Foundations of Economics Fall/Spring, 2011-2012

The claim that follows, which I have called the nite-dimensional normed linear space theorem, essentially says that all such spaces are topologically cases the intuition we obtain in

2

R, R ,

and

carry over into not only higher dimension

R

Rn

3

Rn

with the Euclidean norm. This means that in many

by imagining intervals, circles, and spheres, respectively, will

but also any vector space that has nite dimension. Through-

out this discussion I assume that all scalars for forming linear combinations in this nite dimensional linear space are real numbers (I have heard of complex vector spaces, but in this mathematical foundations for economics independent study we are solely concerned with real vector spaces). By nite-dimensional normed linear space I mean a set

X

of vectors along with addition of vectors and

X is closed under arbitrary linear || · || : X → R satisfying the usual axioms of a norm (e.g., the triangle inequality), with the induced distance function on X dened in the usual way: ρX (x, y) = ||x−y||. Finally, by nite dimensional, I mean that there exists a nite set of vectors {x1 , x2 , ..., xn } ⊆ X that are linearly independent and span X (every vector in X can be written as a linear combination of vectors in {x1 , x2 , ..., xn }, and none of the xi are redundant). A standard example of such a vector space is X = R3 with Euclidean norm and basis {e1 , e2 , e3 } = {(1, 0, 0), (0, 1, 0), (0, 0, 1)}. These assumptions force X to be highly structured. In particular, it can be shown that every open ball in X is simply a translation and rescaling of the unit open ball centered around the origin. So the metric on X is, in this context, determined entirely by the shape of its unit ball. For example, the Euclidean norm

scalar multiplication satisfying the usual axioms for a vector space (e.g., combinations of its elements), and a norm function

has this unit ball be the normal sphere; the taxicab norm leads to a diamond-shape; the sup norm has its unit ball a cube. Within this context, we have the following.

Claim (The Finite-Dimensional Normed Linear Space Theorem ):1 1. Any nite-dimensional normed linear space is complete (Cauchy

⇐⇒

convergent for sequences in that

space). 2. In any nite-dimensional normed linear space, a set is compact if and only if it is closed and bounded (Compact

⇐⇒

Closed & Bounded.

space will always be

Rn )

Interestingly, in some applied contexts (when e.g.

the linear

this is used as the denition for compact set.).

3. All norms on a nite dimensional normed linear space induce the same topology on that space; that is, all norms dene (through their induced metric) the same open sets on that space.

(I.e., it does

not matter what norms are used for purposes of answering the question what are the open sets, and consequently answering questions of what are the closed sets, bounded sets, what sequences convergent and to what limiting vectors, and all other questions that are dened in terms of open sets.)

1I

followed the exercises on pages 122 through 125 of the following in order to develop these arguments:

Carter, Michael.

Foundations of Mathematical Economics.

Cambridge: MIT Press, 2001.

1

Lemma (the ratio of the total representation magnitude and the magnitude of any vector in a normed linear space is bounded):2 Let

S = {x1 , x2 , ..., xn } be a linearly independent c > 0 such that for every x ∈linS

set of vectors in a normed linear space. Then there is

a real number

||x|| ≥ c (|α1 | + |α2 | + · · · + |αn |) where

x

Proof:

is expressed as

x = α1 x1 + α2 x2 + · · · + αn xn .

∀c > 0, ∃w ∈lin{x1 , x2 , ..., xn } such that ||w|| < w = α1 x1 + α2 x2 + · · · + αn xn . 1 Step 1. Since the above is true for all positive c, it is true for all c in the descending set { m |m ∈ N}. 1 ∞ So let us consider the sequence { }m=1 . For every term in this aforementioned sequence we are guaranteed m m m |αm m 1 |+|α2 |+···+|αn | where the αi s are taken from (by the supposition) a wm ∈linS such that ||wm || < m 3 the basis representation for wm . Choose any particular list of these wm to dene the sequence {wm }. I seek to push the sequence of norms {||wm ||} to tend to 0, but it is not clear that I can do that with m m m the sum |α1 | + |α2 | + · · · + |αn | in the numerator of each norm. But I can dene another sequence {ym }, to be: given the chosen {wm } sequence, let ym = |αm |+|αm1|+···+|αm | wm (∀m), where the αi s are n 1 2 taken from the basis representation for wm . Then the norm of each term in the sequence {ym } is ||ym || = |+···+|αn | 1 1 1 m · |α1 |+|α2m = m , again where the ai s are from the basis ||wm || < |α1 |+|α2 |+···+|α |α1 |+|α2 |+···+|α n| n| representation for wm . Consequently, {yn } denes a sequence of vectors with corresponding sequence of 1 norms ||ym || < m → 0 (as m → ∞). m m m Step 2. My expression for {ym } can be written as ym = β1 x1 + β2 x2 + · · · + βn xn where the sum of Pn m 4 the absolute values of the betas satises For each m, for each i = 1, 2, 3, ..., n, we must i=1 |βi | = 1. m then have |βi | ≤ 1 (simply because the sum of these numbers is 1 and they are all positive, so none of Suppose for the sake of contradiction that

c (|α1 | + |α2 | + · · · + |αn |),

2 Comment:

where

My statement here is an expression of a rewriting the claim as:

the claim) such that for any

∃M > 0

(where

M =

x 6= 0, |α1 | + |α2 | + · · · + |αn | ||x|| there is a real number c > 0

1 in the statement of c

M ≥ Of course another way to express the thing is that

c≤

small enough so that for any

x 6= 0

||x|| |α1 | + |α2 | + · · · + |αn |

it is clear that there must be a small enough choice of

c

(or large enough choice of

M)

that works

globally

in a particular

given linear space using a particular basis. -

{(1, 0), (0, 1)} for p R2 . Then (x, y) ∈ (x, y) = x(1, 0)+y(0, 1) by this basis. I used the Euclidean norm of (x, y), that is ||(x, y)|| = x2 + y 2 . Given any xed k = ||(x, y)||, then it is intuitively clear that to maximize |x| + |y| we must choose (x, y) to be on a 45 degree ray emanating from the origin. Since the same situation occurs in all four directions, we might as well suppose that x and y √ are on the positive 45 degree ray, and so x = y ≥ 0. In such a case then ||(x, y)|| = x 2 and x + y = 2x. We want to choose c so as to ensure that ||(x, y)|| ≥ c(|x| + |y|), and this argument shows that in the worst case this expression will look like (for a √ √ given (x, y)) ||(x, y)|| = x 2 ≥ c(2x) =⇒ 2 ≥ 2c (the inequality is denitely true if x = 0), and so it would suce to take c √ Comment: To build my intuition, I tried nding such a

c

in

R2 .

Take, for example, the basis

R2 is expressed as

to be

2 1 (or √ ). 2 2

In three space (but same context), my intuition leads me to believe that a worst case would be again all coordinates being the same, resting upon the 45 degree ray in the positive octant. In such a case the expression boils down to

c=

√1 n



3 ≥ 3c,

that is, we can take

c=

3 3

√ x 3 ≥ 3cx,

which

√1 . The pattern is interesting enough to make a conjecturethat 3 will work in the statement of the lemma in Rn normed with Euclidean measurement using the standard basis

would be satised if taking



=

vectors as our linearly independent spanning set. The lemma, of course, is much more general, since it deals with arbitrary normed linear spaces and arbitrary spanning sets.

3 That

is,

{xm }

1 {m } in that each term xm in the {xm } sequence lists vectors 1 c is taken to be m . term wm by dividing wm by the exact necessary scalar for this

is a sequence that corresponds to the sequence

that are assumed to contradict the lemma we're trying to show when

4 They

satisfy this sum because I dened each

ym

from the

to happen. To be more precise, let

m m wm = αm 1 x1 + α 2 x2 + · · · + α n xn

be the basis representation for the

2

mth

term in the

w

sequence.

{ym }

them can be bigger than 1). The weights on the basis vectors used to write each term in the sequence individually dene sequences of real numbers, which we may denote coordinate sequences is bounded (by

1

{βik }∞ k=1 , i = 1, 2, 3, ..., n.

Each of these

in particular).

{β1k }, is bounded, the Bolzano-Weierstrass Then let p(m) be the strictly increasing sequence

We proceed as follows. Since the rst coordinate sequence, theorem guarantees that it has a convergence subsequence. of natural numbers such that the subsequence the limit of this sequence by

β¯1 .

Apply

p(·)

p(m)

{β1

}

of the rst coordinate sequence converges. Denote

to form sequence:

p(m)

yp(m) = β1

p(m)

x 1 + β2

x2 + · · · + βnp(m) xn

we know that the rst coordinate sequence of this new sequence converges to

β¯1 , but we know little about

the others so far. Yet, they are subsequences of their original coordinate sequences, and so they are bounded sequences of real numbers! So we are now guaranteed, again by the Bolzano-Weierstrass theorem, that the

p(m)

β2

q be the function from the range of p(m) q(p(m)) ¯ into the positive integers such that β2 converges to β2 . Now, apply this this function q to each of the coordinate sequences of yp(m) 's basis representation, obtaining a new sequence yq(p(m)) dened by: sequence of real numbers

has a convergent subsequence. Let

q(p(m))

yq(p(m)) = β1 q(p(m))

β1

Note that now

q(p(m))

x1 + β2

x2 + · · · + βnq(p(m)) xn

, being a subsequence of the convergent sequence

p(m)

β1

Also, now have have pinned down the second coordinate, so that it converges to Continue this process. That is, the third coordinate sequence

q(p(m))

{β3

, must still converge to

β¯2 .

β¯1 .

} in the above sequence of vectors

is a bounded sequence of real numbers and therefore has a convergent subsequence. Use the mapping which gives this convergent subsequence, which is from the range of

q(p(m))

into

N,

in order to restrict every

coordinate sequence so that the third coordinate sequence converges. Note that rst and second coordinate sequences still converge.

We can repeat this argument,

n

times in total, until every coordinate sequence

converges. Let

∗ ym = β1sm x1 + β2sm x2 + · · · + βnsm xn be the sequence of vectors thus obtained.

∗ ym

ym had the property that the 1, then every term in the sequence sm sm ∗ s has this property as well. The sequence {|β1 | + |β2 | + · · · + |βnm |} = {1} converges to 1. We know ym sm sm ¯ that if the sequence {β1 } converges to β1 , then the sequence {|β1 |} converges to |β¯1 |. We also know that Step 3. Now

is a subsequence of

ym .

Since every term in the sequence

sum of the absolute values of the weights in the basis representation was

if we have any set of sequences all of which converge individually, then their sum converges to the sum of the

sm {|β1sm | + |β2s m | + · · · + |β |} must converge to β¯1 + β¯2 + · · · + β¯n . n ∗ Since the limit of a sequence is unique, then in fact β¯1 + β¯2 + · · · + β¯n = 1. So the sequence ym converges ∗ ¯ ¯ ¯ ¯ to the vector y = β1 x1 + β2 x2 + · · · + βn xn where not all of the βi can be zero, since the sum of the absolute values of these coecients is 1. ∗ ∗ ∗ Now we have it. We know ||ym || → 0. Therefore, ||ym || → 0. Hence ym → 0. Yet, {ym } converges to

individual limits. So then the sequence

a vector which is a weighing of the basis vectors such that at least one weight must be nonzero. Therefore,

∗ {ym }

actually does not converge to the zero vector

claimed.

Then we dened

ym

So there must always exist such a constant

c,

as

by

ym = In the

⇒⇐.



n=2

1 1 m m w = m (αm 1 x1 + α2 x2 + · · · + αn xn ) m | + · · · + |αm | m m | + · · · + |αm | |αm | + |α |α | + |α n n 1 2 1 2

case, saying that the absolute values of the betas sum to 1 is the same as saying in the expression for

It's clear that this argument can be

α1 α2 |α1 | + |α2 | |α | + |α | + |α | + |α | = ||α | + |α || = 1 1 2 1 2 1 2 generalized for arbitrary n ∈ N.

3

ym

above:

Claim (1):5 Every nite-dimensional normed linear space is complete. Proof: Let X be a nite dimensional normed linear space of dimension n.

X converges X . Then each

arbitrary Cauchy sequence in

{x1 , x2 , ..., xn }

be a basis for

We must show that any

X . Let {xm } be an arbitrary Cauchy m term x has a unique representation: in

sequence in

X.

Let

xm = α1m x1 + α2m x2 + · · · + αnm xn Let

x ∈ X,

c

be the constant guaranteed by the previous lemma. That is, let

the product of

c

c

be that constant so that for any

times the sum of the absolute value of the weights in the basis representation of

(by the basis I specied in the rst paragraph above) is no greater than the norm of Since

{xm }

is a Cauchy sequence in

X,

c > 0

for any

there exists an

N ∈N

x

x.

such that, for all integers

r, m ≥ N , ||xm − xr || < c Substituting the representations for

xm

xr

and

by the basis vectors, we obtain:

|| (α1m x1 + α2m x2 + · · · + αnm xn ) − (α1r x1 + α2r x2 + · · · + αnr xn ) || < c some manipulation, and we obtain:

|| [α1m − α1r ] x1 + [α2m − α2r ] x2 + · · · + [αnm − αnr ] xn || < c By the lemma we know that the constant

c

satises:

c (|α1m − α1r | + |α2m − α2r | + · · · + |αnm − αnr |) < || [α1m − α1r ] x1 + [α2m − α2r ] x2 + · · · + [αnm − αnr ] xn || < c so that

|α1m − α1r | + |α2m − α2r | + · · · + |αnm − αnr | < c/c =  Now consider

|αim − αir |

for any

i = 1, 2, ..., n.

It is clear that this, being a term in the sum of all positive

terms of the left hand side, is at least as small as the right hand side of the above inequality. Therefore, we can say for any

i, |αim − αir | ≤ |α1m − α1r | + |α2m − α2r | + · · · + |αnm − αnr | < 

This shows that given any between the

xr

ith

>0

N so that for all m, r ≥ N the distance xm and the ith weight in the basis representation for any  > 0, ∃N ∈ N such that, for all m, r ≥ N ,

we can nd a positive integer

weight in the basis representation for

is strictly smaller than

.

That is, for each

i,

for

|αim − αir | <  Therefore each sequences of weights

{αim }∞ m=1 , i = 1, 2, 3, ..., n,

(which we called the coordinate sequences

R is complete, for each i, each such {αim } converges in R to some αi ∈ R. m Dene x = α1 x1 + α2 x2 + · · · + αn xn . Then x ∈ X . It is clear that x → x, since each of the coordinates m in the basis representation for x converges to the corresponding coordinate of x. This shows that every Cauchy sequence in X converges. Therefore, X is complete. ♦ in the lemma) are themselves Cauchy sequences of real numbers. Since sequence

5 Comment:

This argument rests on the fact that

that we've been taking our scalars from

R.

R

is complete. In order to get this into the game, we heavily rely on

Given that we can target any point in our linear space by weighting a nite basis

for that space, we can in fact think of our points as, essentially, these weights. Which is to say, our points are eectively in The argument just expresses these thoughts more formally.

4

Rn .

Claim (2):

In a nite dimensional normed linear space, a set is compact if and only if it is closed and

bounded.

Proof:

The proof that any compact set is both closed and bounded is on page 61 of Carter; I do not

repeat the argument. That the converse holds in any nite dimensional normed linear space is the result I seek to show here. Let S be a closed and bounded subset of a nite-dimensional normed linear space X . Let X have the basis {x1 , x2 , ..., xn }. Let {xm } be a sequence in S . We want to show that {xm } has a convergent subsequence m (sequential compactness). Let x = α1m x1 + α2m x2 + · · · + αnm xn denote the basis representation of xm . Since S is bounded, then it is contained in an open ball. But in a normed linear space, every open ball is merely a translation and rescaling of the unit ball. So say S is contained in the open ball x0 + rB , where r > 0 is a real number representing the scaling of the unit ball B , and x0 is the translation of the unit ball. m Then the magnitude of any vector in S can be no greater than M = ||x0 || + |r|. Therefore any term x ∈S m has ||x || ≤ M , and so by the lemma there is a constant c such that

c|αim | ≤ c (|α1m | + |α2m | + · · · + |αnm |) < ||xm || ≤ M i = 1, 2, 3, ..., n; that is, |αim | < M/c. Since M/c {αim } of the sequence {xm } is bounded.

for any sequence

is a constant, this shows that each coordinate

We are going to apply the same argument as in the lemma. Since the coordinate sequence

p(m) bounded sequence of real numbers it has a convergent subsequence {αi } that converges to the subsequence formed by selecting coordinates: p(m)

xp(m) = α1 such that

p(m)

→ α1 ,

α1

p(m)

x1 + α 2

α1 .

{αim }

is a

Construct

x2 + · · · + αnp(m) xn

and who knows what the other coordinate sequences doexcept that they are

bounded sequences of real numbers and therefore have convergent subsequences. So repeat the argument

p(m)

q(p(m))

{α2 }, so that for some function q(·), α2 → α2 . Continuing in this way we obtain m m a subsequence of {x }, call it {x∗ } whose coordinate scalars converge to the vector (α1 , ..., αn ). Let x∗ = α1 x1 + α2 x2 + · · · + αn xn . Then xm ∗ → x∗ because every coordinate in the former converges to the corresponding coordinate of the latter. Since S is closed, every convergent sequence whose terms all m come from S converges to an element of S , so x∗ ∈ S . Therefore the arbitrary sequence {x } in S has a 6 subsequence which converges in S . Hence, S is compact. ♦

for the sequence

6 Speaking

of convergence, the following conjecture arose when I was experimenting with some norms on

R2 .

I have not

proven it. Conjecture: the sequence of norms

{||x||2n }∞ n=1

converges to

||x||p =

q p

||x||∞ ,

where

xp1 + xp2 + · · · + xpn

||x||∞ = max{|x1 |, |x2 |, ..., |xn |} Comment: Looking at this conjecture a half of a year later, it seems even better to conjecture that

lim

p→∞

p p |x1 |p + |x2 |p + · · · + |xn |p = max{|x1 |, |x2 |, ..., |xn |}

Well, the CES production function looks like:

q p α1 xp1 + α2 xρ2 + · · · + αn xρn where the

xi ≥ 0

αi > 0. As p → −∞, this function converges to min{x1 , x2 , ..., xn } n = 2). But the p-norm is just this production function with p > 1.

and the

in February for the case

5

(this was proven by Dr. Elmer

Lemma (exercise 1.212):7 are positive constants

A

and

B

Let

|| · ||a

and

so that for any

|| · ||b be x ∈ X,

two norms on a given linear space

X.

Assume there

||x||a ≤ A||x||b ||x||b ≤ B||x||a A as a number that will always stretch the vector x to greater magnitude when measured b than it is when measured by norm A; and similarly for the number B .) Then the two norms || · ||a and || · ||b are topologically equivalent, by which I mean (not having studied any formal topology) that any open ball around a vector x that we construct by using norm a contains within it an open ball around x constructed using norm b, and the converse. Consequently every open set in X under || · ||a is open when X is analyzed under || · ||b , and the converse. Hence, all things dened in terms of the open sets on X , will (I'm thinking of

in norm

result in the same notions, regardless of which norm we use.

Proof:

a.

Let

Bra (x)

be an arbitrary open ball of radius

r

around the point

x∈X

constructed using norm

Since every open ball in a normed linear space is the same as a rescaling and translation of the unit open

Bra (x) = x + rB a for unit ball B a = {x| ||x||a < 1}. 1 1 b 1 b Consider A · B = {x| ||x||b < A } = {x| A||x||b < 1}. Now whenever x ∈ A B , then A||x||b < 1, and 1 b a a we know by the given equations that ||x||a ≤ A||x||b < 1, hence x ∈ B . Therefore A B ⊆ B . Then 1 b a x + r A B ⊆ x + rB . Therefore any open ball under norm a has within it an open ball under norm b. 1 b a On the other hand, if Br (x) is an arbitrary open ball around x using norm b, then B · B = {x| ||x||a < 1 1 a b b B } = {x| B||x||a < 1} ⊆ B , and so x + r B B works as a suitable open ball contained within Br (x), as desired. ♦ ball, we may write

7 Comment:

I recall once seeing the hypotheses of this lemma taken as the

sometimes phrased as:

∃C, D,

positive constants, so that

denition

of two norms being equivalent. It is

∀x ∈ X :

C||x||a ≤ ||x||b ≤ D||x||a but dividing over and breaking these inequality into two parts leads to the assumption I have made.

x, if you measure the magnitude of x using b, then, D you can stretch x by to make it at least as large as its magnitude under b, under norm a; and there is a constant quantity C you can squash x by in order to make its magnitude smaller under norm a than it is under norm b. Because C and D are constants, we can do this globally in the linear space; and so, any open ball under b will both be contained in, and contain within it, and open ball under norm a. This is enough for us to conclude that the norms are topologically equivalent, since then all the open sets under norm a will be the same under norm b, and that is all we need to dene the topological properties of X . [Note that these intuitive characterizations rest on the norm axiom: ||cx|| = |c|||x|| = c||x|| for c > 0, and cx is a rescaling of x.] Intuitively, the phrasing above leads to this characterization: given any

there is a constant quantity

6

Claim (exercise 1.213):

In a nite dimensional linear space, any two norms are topologically equivalent.

Proof:

be two arbitrary norms in a nite dimensional linear space

Let

||·||a

and

||·||b

X

with

dim X = n.

I will show that the previous lemma holds between these two norms using the lemma proven at the beginning of this paper, thus showing that these two norms are in fact topologically equivalent (as discussed). Let unique

x ∈ X be arbitrary. Let {x1 , x2 , x3 , ..., xn } be a basis for X . Then x = α1 x1 + α2 x2 + · · · + αn xn αi 's.8 We know by our initial lemma that there are constants ca and cb such that

for

||x||a ≥ ca [|α1 | + |α2 | + · · · + |αn |] ||x||b ≥ cb [|α1 | + |α2 | + · · · + |αn |] on the other hand by the triangle inequality

||x||a = ||α1 x1 + α2 x2 + · · · + αn xn ||a ≤ |α1 |||x1 ||a + |α2 |||x2 ||a + · · · + |αn |||xn ||a ≤ max {||xi ||a } 1≤i≤n

1≤i≤n

ka = max1≤i≤n {||xi ||a }

and

kb = max1≤i≤n {||xi ||b }.9

For simplicity of notation, let

|αi |

i=1

||x||b = ||α1 x1 + α2 x2 + · · · + αn xn ||b ≤ |α1 |||x1 ||b + |α2 |||x2 ||b + · · · + |αn |||xn ||b ≤ max {||xi ||b } Let

n X

n X

|αi |

i=1

Pn

i=1

|αi | = M .

Then the above four statements say:

ka M ≥ ||x||a ≥ ca M kb M ≥ ||x||b ≥ cb M By using the diagonals of these inequalities we obtain the result we need. Since the above implies:

ka M ≥ ||x||a ||x||b ≥ cb M then rearranging we obtain:

ka ||x||b ≥ ||x||a cb through similar logic we may obtain:

kb ||x||a ≥ ||x||b ca Let

ka cb

=A

and

kb ca

= B.

Then we have:

||x||a ≤ A||x||b ||x||b ≤ B||x||a these

A, B

are the constants we sought.

8 Comment:



x and basis set for X , the (1, 2) = 1(1, 0) + 2(0, 1) in R2 , regardless of the constants ca and cb are clearly not (we

A hurdle (maybe) for understanding this argument is to realize that, given any

representation-by-basis for

x

is completely unrelated to the norm on

whether our norm is Euclidean, taxicab, sup, etc.

This

X.

The point

is invariant to norm, but

xed the norm in the lemma where we obtained these constants, and we showed that they must exist for any norm, but they are denitely functions of the norm we choose).

9 Comment:

The maximum norms in the basis set plays a pivotal role in this proof. They exist because we are in

dimensional space.

7

nite

Suggest Documents