Proving Newton's Propositio Kepleriana using Geometry and ...

7 downloads 177 Views 343KB Size Report
then present in Section 4, as a case study, the proof of the Kepler Proposition. This is a key proof of the ... Euclid's proposition 29 of Book I 10], can be easily proved using the full-angle method. ...... University of California Press, 1995. 4. H.-B. Li ...
Proving Newton's Propositio Kepleriana using Geometry and Nonstandard Analysis in Isabelle Jacques D. Fleuriot and Lawrence C. Paulson Computer Laboratory { University of Cambridge New Museums Site, Pembroke Street Cambridge CB2 3QG

fjdf21,[email protected]

Abstract. The approach previously used to mechanise lemmas and Kepler's Law of Equal Areas from Newton's Principia 13] is here used

to mechanically reproduce the famous Propositio Kepleriana or Kepler Problem. This is one of the key results of the Principia in which Newton demonstrates that the centripetal force acting on a body moving in an ellipse obeys an inverse square law. As with the previous work, the mechanisation is carried out through a combination of techniques from geometry theorem proving (GTP) and Nonstandard Analysis (NSA) using the theorem prover Isabelle. This work demonstrates the challenge of reproducing mechanically Newton's reasoning and how the combination of methods works together to reveal what we believe to be aw in Newton's reasoning.

1 Introduction The reasoning of Newton's Philosophi Naturalis Principia Mathematica (the Principia 14]), as it was originally published, is a mixture of geometric and algebraic arguments together with Newton's own proof techniques. These combine to produce a complex mathematical reasoning that is used to explain the physical world. The demonstrations of Lemmas and Propositions in the Principia are, in fact, proof sketches that require a lot of work on the part of the reader for a detailed understanding. There are several reasons that make the Principia a very di cult text to master. First of all, the proofs are very involved and one requires an adequate knowledge of geometry to be able to understand many of the steps made by Newton. Secondly, Newton's exposition can be tedious and di cult to grasp in places. Many mathematicians contemporary to Newton, despite their grounding in ancient Greek geometry and familiarity with the style of the exposition, had di culties understanding Newton's mathematical reasoning. This gives an indication of the demands that a thorough study of the Principia has on the modern reader. As we mentioned in a previous paper, Newton's geometry is also notable for his use of limit or ultimate arguments in his proofs 13]. These are implicit notions of dierential calculus that are at the core of Newton's treatment and give Newton's geometry an innitesimal nature. Newton further adds motion to

his reasoning and enriches the geometry with various kinematics concepts that enable points to move towards points for example. Thus, Newton's geometry consists in studying the relations, such as ratios, between various parts of the constructed diagrams as certain of its elements tend towards limiting positions or become innitely small. In this paper, we build on the tools and techniques that we presented before 13]. In section 2, we review the geometric methods and concepts that we have formalised in this work. We also give examples of theorems proved in Isabelle using these techniques. Section 3 is a brief introduction to the concepts from NSA that we use it also outlines the innitesimal aspects of our geometry. We then present in Section 4, as a case study, the proof of the Kepler Proposition. This is a key proof of the Book 1 of the Principia and our work follows, in its steps, the analysis made by Densmore 9]. This extended case study shows our combination of techniques from geometry and NSA at work to provide a formal proof of a major proposition. The challenge inherent to the mechanisation of Newton's reasoning{ especially in an interactive environment such as Isabelle where the user guides the proof{ will become obvious as we highlight the steps and di culties encountered. Section 5 oers our comments and conclusions.

2 Geometry We use methods that are based on geometric invariants 5, 6] and high level geometry lemmas about these invariants. A particular property is ideal as an invariant if it ensures that the proofs generated are short. This enables some of the proofs to be derived automatically using the powerful tactics of Isabelle's classical reasoner. Also, the methods should be powerful enough to prove many properties without adding auxiliary points or lines. The other important aspect is to achieve diagram independence for the proofs, that is, the same proof can be applied to several diagrams.

2.1 The Signed Area Method

In this method, there are basic rules about geometric properties called signed areas. These can be used to express various geometric concepts such as collinearity (coll), parallelism (k) and so on. Moreover, the basic rules can be combined to prove more complex theorems which deal with frequently-used cases and help simplify the search process. We represent the line from point A to point B by A|B, its length by len(A ; B ), and the signed area SdeltaABC of a triangle is the usual notion of area with its sign depending on how the vertices are ordered. We follow the usual approach of having SdeltaABC as positive if A|B|C is in anti-clockwise direction and negative otherwise. Some of the rules and denitions used are:

a ; b k c ; d  (Sdelta a b c = Sdelta a b d) coll a b c =) len(a ; b)  Sdelta p b c = len(b ; c)  Sdelta p a b

We can also introduce new points using the following property and dene the signed area of a quadrilateral Squad a b c d in terms of signed areas of triangles:

a b c = Sdelta a b d + Sdelta a d c + Sdelta d b c Squad a b c d  Sdelta a b c + Sdelta a c d Sdelta

We have proved a number of theorems about the sign of Squad a b c d that depend on the ordering of the vertices, for example Squad a b c d = ;Squad a d c b. When dealing with geometry proofs, we often take for granted conditions that need to be stated explicitly for machine proofs: for example, two points making up a line should not coincide. The machine proofs are valid only if these conditions are met. These are known as non-degenerate conditions and are required in many cases to prevent the denominators of fractions from becoming zero in the various algebraic statements.

2.2 The Full-Angle Method A full-angle hu vi is the angle from line u to line v measured anti-clockwise. We dene the equivalence relation of angular equality as follows:

x =a y  9n 2 IN: jx ; yj = n and can use it to express that two lines are perpendicular

a ; b ? c ; d  ha ; b c ; di =a 2

Other properties of full-angles concern their sign and how they can be split or joined. The same rule therefore either introduces a new line or eliminates a common one from the full angles depending on the direction in which it is used.

hu vi =a ;hv ui hu vi =a hu xi + hx vi Full-angles are used instead of traditional angles because their use simplies many proofs by eliminating case-splits. Moreover, as we have already mentioned, these methods are useful to us since they relate closely to the geometric properties used by Newton 13]. They preserve the intuitive nature of his geometry and can easily be extended with innitesimal notions, as we will see shortly.

2.3 A Simple Example: Euclid I.29 Euclid's proposition 29 of Book I 10], can be easily proved using the full-angle method. The proposition states that if A ; B k C ; D and the transversal P ; Q intersects A ; B and C ; D then hA ; B P ; Qi = hC ; D P ; Qi.

D

A

Q

P

C

B

Fig. 1. Euclid Proposition I.29 We prove this theorem easily by using the rules about full-angles given in Section 2.2 and the fact the angle between two parallel lines is zero.

Proof:

A ; B k C ; D =) hA ; B C ; Di =a 0 hA ; B P ; Qi + hP ; Q C ; Di =a 0 hA ; B P ; Qi =a ;hP ; Q C ; Di hA ; B P ; Qi =a hC ; D P ; Qi:

2.4 Extending the Geometric Theory The main aim of the Principia is to investigate mathematically the motion of bodies such as planets. Thus, we need to have denitions for geometric gures such as the circle, the ellipse and their tangents. The ellipse is especially important for this work since Kepler's Problem is concerned with elliptical motion. The circle can be viewed as a special case of the ellipse where the foci coincide.

f1 f2 r  fp: jlen(f1 ; p)j + jlen(f2 ; p)j = rg circle x r  ellipse x x (2 r) arc len x a b  jlen(x ; a)j  ha ; x x ; bi e tangent (a ; b) f1 f2 E  (is ellipse f1 f2 E ^ a 2 E ^ hf1 ; a a ; bi =a hb ; a a ; f2 i) ellipse

We need to prove a number of properties relating to the ellipse such as the one stating that all parallelograms described around a given ellipse are equal to each other (Figure 2). This relationship appears (in slightly dierent wording) as Lemma 12 of the Principia where it is employed in the solution of Proposition 11, the famous Propositio Kepleriana or Kepler problem. Newton refers us to the \writers on the conics sections" for a proof of the lemma. This lemma is demonstrated in

P

P D

D

K C

C

F K

Fig. 2. Circumscribed Parallelograms Book 7, Proposition 31 in the Conics of Apollonius of Perga 1]. Of course, unlike Newton, we have to prove this result explicitly in Isabelle to make it available to any other proof that might use it.

3 Innitesimal Geometry In this section, we give a brief overview of our geometry containing innitesimals. We rst give formal denitions for the various types of numbers that exist and which can be used to describe geometric quantities.

3.1 The Nonstandard Universe IR De nition 1. In an ordered eld extension IR IR, an element x 2 IR is said to be an innitesimal if jxj < r for all positive r 2 IR nite if jxj < r for some r 2 IR innite if jxj > r for all r 2 IR. The extended, richer number system IR is known as the hyperreals. It has been developed in Isabelle through purely denitional means using an ultrapower construction. We will not give more details of this substantial construction in the present paper so as to concentrate on the geometric aspects only. De nition 2. x y 2 IR are said to be innitely close, x y if jx ; yj is innitesimal. This is an important equivalence relation that will enable us to reason about innitesimal quantities. For example, we can formalise the notion of two points coinciding by saying that the distance between them is innitely close to zero. Two geometric quantities that become ultimately equal can also be modelled using it. The relation and its properties are used to formalise ultimate situations that might be considered degenerate by ordinary GTP methods 13]. Using the relation, we can also dene the concept of two full-angles being innitely close:

a1 a a2  9n 2 IN: ja1 ; a2 j n

Various other new properties, such as ultimately similar and ultimately congruent triangles, can then be dened as we showed previously 13]. These are then used to prove various Lemmas that are needed to demonstrate the important Propositions of the Principia.

4 An Overview of the Proof This is Proposition 11 of Book 1 of the Principia. This Proposition is important for both mathematical and historical reasons as it lays the foundations for Kepler's rst law of Gravitation. It provides the mathematical analysis that could explain and conrm Kepler's guess that planets travelled in ellipses round the sun 15]. The proof of this proposition will be studied in detail as it gives a good overview of the mixture of geometry, algebra and limit reasoning that is so characteristic of Newton's Principia. It also gives an idea of the depth and amount of mathematical expertise involved in Newton's proof. The proof that Newton describes, though relatively short on paper, becomes a major demonstration when expanded and reproduced using Isabelle. The elegance of many of the constructions, which could be glossed over, are revealed through the detailed analysis. We give formal justications of the steps made by Newton in ultimate situations through our formal and logical use of innitesimals. Innitesimal reasoning is notorious for leading to contradictions. However, nonstandard analysis is generally believed to be consistent and hence ensure that our mechanisation is rigorous. We will give the enunciation of the Proposition and the proof (sketch) provided by Newton. We will then expand on the sketch and provide detailed proofs of the steps that are made by Newton. This will require the use of the rules from the geometric and NSA theories developed in Isabelle.

4.1 Proposition 11 and Newton's Proof Proposition 11 is in fact stated as a problem by Newton at the start of Section 3 of the Principia. This section deals with \the motion of bodies in eccentric conic section ". Particular orbits and laws governing forces that are relevant to the universe are investigated. The mathematical tools are developed for later use in Book III of the Principia when natural phenomena of our world are investigated. Our task consists in expressing Newton's result as a goal which is then proved. Figure 3 shows Newton's original diagram used for this Proposition. Proposition 11 If a body revolves in an ellipse it is required to nd the law of the centripetal force tending to the focus of the ellipse

Newton's Proof:

Let S be the focus of the ellipse. Draw SP cutting the diameter DK of the ellipse in E , and the ordinate Qv in x and complete the parallelogram QxPR. It is evident that EP is equal to the greater semiaxis AC : for drawing HI from the other focus H of the ellipse parallel to EC , because CS , CH are equal, ES ,

Fig. 3. Newton's Original Diagram for Proposition 11 EI will be also equal so that EP is the half-sum of PS , PI , that is (because of the parallels HI , PR, and the equal angles IPR, HPZ ), of PS , PH , which taken together are equal to the whole axis 2AC . Draw QT perpendicular to SP , 2 and putting L for the principal latus rectum of the ellipse (or for 2BC ), we shall AC have

L QR : L Pv = QR : Pv = PE : PC = AC : PC , also, L Pv : Gv Pv = L : Gv, and, Gv Pv : Qv2 = PC 2 : CD2 By Corollary 2, Lemma 7, when the points P and Q coincide, Qv2 = Qx2 , and Qx2 or Qv2 : QT 2 = EP 2 : PF 2 = CA2 : PF 2 , and (by Lemma 12) = CD2 : CB 2 . Multiplying together corresponding terms of the four proportions, and by simplifying, we shall have

L QR : QT 2 = AC L PC 2 CD2 : PC Gv CD2 CB 2 = 2PC : Gv, since AC L = 2BC 2 . But the points Q and P coinciding, 2PC and Gv are 2 , proportional to these, will equal. And therefore the quantities L QR and QT 2 2 also be equal. Let those equals be multiplied by SP QR , and L SP will become equal 2 2 QT to SPQR . And therefore (by Corollary 1 and 5, Proposition 6) the centripetal force is inversely as L SP 2 , that is, inversely as the square of the distance SP . Q.E.I. Newton's derivation concludes that the centripetal force, for a body moving in an ellipse, is inversely proportional to the square of the distance.

Our proof proceeds in several steps where we set up various relationships that we will need for the conclusion. This involves proving Newton's intermediate results and storing them as intermediate theorems (we avoid calling them lemmas so as not to confuse them with Newton's own Lemmas).

4.2 A Geometric Representation for the Force

An investigation of the Proposition and Newton's result indicates that our goal is to prove that 9 k 2 IR: force k  SP1 2 (i.e. force / SP1 2 ultimately). We now demonstrate through a combination of geometric and innitesimal procedures how to prove the theorem. Our combination of methods was previously used to prove Kepler's Law of Equal Areas 13]. This is an important result which states that a body moving under the inuence of a centripetal force describes equal areas in equal times. Using this result we can now derive a completely geometric representation for the force acting on the orbiting body. R P

Q T Force S

Fig. 4. Diagram for Geometric Representation of Force Consider Figure 4 in which a point P is moving along an arc of nite curvature under the inuence of a centripetal force acting towards S . Let Q be a point innitely close to P , that is, the length of the arc from P to Q is innitesimal. QR, parallel to SP , represents the displacement from the rectilinear motion (along the tangent) due to the force acting on P . QT is the perpendicular dropped to SP . From Newton's Lemma 10, Corollary 3, we have that displacement \in the very beginning of motion" is proportional to the force and the square of the time, and hence (for some real proportionality constant k1 ) that force k  len(Q ; R) (1) 1

Time2

Since the distance between P and Q is innitesimal, the angle hP ; S S ; Qi is innitely small, and hence the area of the sector SPQ (Sarc S P Q) is innitely close to that of the triangle SPQ:

hP ; S S ; Qi a 0 =) Sarc S P Q Sdelta S P Q =) Sarc S P Q 1=2  len(Q ; T)  len(S ; P)

(2)

From Kepler's Law of Equal Areas, we can replace Time by Sarc SPQ 13] and, hence, using (1) and (2), we have the following geometric representation for the force (for some new proportionality constant k) (Q ; R) force k  len(Q ;len (3) T)2  len(S ; P)2 This is a general result (Proposition 6 of the Principia ) that applies to any motion along an arc under the inuence of a central force. We justify the use of a circular arc for the general arc by the fact that it is possible to construct a circle at the point P that represents the best approximation to the curvature there. This circle, sometimes known as the osculating circle1 , has the same rst and second derivative as the curve at the given point P . Thus, the osculating circle has the same curvature and tangent at P as the general curve and an innitesimal arc will also be same. We refer the reader to Brackenridge for more details on the technique 2, 3]. With this result set up, to prove the Kepler Problem, we need to show that the ratio involving the innitesimal quantities QR and QT is equal or innitely close to some constant (nite) quantity. Thus, the proof of Proposition 11 involves, in essence, eliminating the innitesimals from relation (3) above. This relation is transformed using the geometry of the ellipse to one involving only macroscopic (i.e. non-innitesimals) aspects of the orbit. We show next how the various GTP and NSA techniques are applied to the analysis of an elliptical orbit to determine the nature of the centripetal force.

4.3 Expanding Newton's Proof A detailed account of our mechanisation of Newton's argument for Proposition 11 would take several pages since the proof sketch given by Newton is complex and we would have to present a large number of derivations. We will highlight the main results that were proved and, in some cases, details of the properties that needed to be set up rst. We will also mention the constraints that needed to be satised within our framework before the various ratios that were proved could be combined. Our mechanisation was broken down into several steps that roughly followed from Newton's original proof. The main results that are set up are as follows (see Fig. 3):

{ len(E ; P) = len(A ; C) { len(A ; C)=len(P ; C) = L  len(Q ; R)=L  len(P ; v) { L  len(P ; v)=(len(G ; v)  len(P ; v)) = L=len(G ; v) { len(G ; v)  len(P ; v)=len(Q ; v)2 = len(P ; C)2=len(C ; D)2 { len(Q ; v)2=len(Q ; T)2 len(C ; D)2=len(C ; B)2 Step 1: Proving len(E ; P) = len(A ; C) 1

from the Latin osculare meaning to kiss{ the term was rst used by Leibniz

This result shows that the length of EP is independent of P and Newton's proof uses several properties of the ellipse. We will give a rather detailed overview of this particular proof as it gives an idea of the amount of work involved in mechanising Newton's geometric reasoning. Moreover, the reader can then compare Newton's proof style and prose with our own proof and see the GTP methods we have formalised in action. R D

P I

Z

E S G

A C

H K

Fig. 5. Construction for Step 1 of Proposition 11 In Figure 5, the following holds { C is the centre of the ellipse with S and H the foci { P is a point of the curve { RZ is the tangent at P { the conjugate diameter D ; K k P ; Z { P ; S intersects D ; K at E { H ; I k E ; C and H ; I intersects P ; S at I Since H ; I k E ; C, the following theorem holds, H ; I k E ; C =) Sdelta C E I = Sdelta C E H (4) But the foci are collinear with and (by Apollonius III.45 1]) equidistant from the centre of the ellipse so the following can be derived using the signed-area method, coll S C H =) len(S ; C)  Sdelta C E H = len(C ; H)  Sdelta C S E =) Sdelta C E H = Sdelta C S E (5) Also, points S , E and I are collinear and therefore combining with (4) and (5) above, we verify Newton's \ES ,EI will also be equal" coll S E I =) len(S ; E)  Sdelta C E I = len(E ; I)  Sdelta C S E =) len(S ; E) = len(E ; I) (6)

Next, the following derivations can be made, with the help of the last result proving Newton's \EP is the half-sum of PS , PI " coll E I P =) len(E ; P) = len(E ; I) + len(I ; P) =) len(E ; P) = len(S ; E) + len(I ; P) =) 2  len(E ; P) = len(E ; P) + len(S ; E) + len(I ; P) =) 2  len(E ; P) = len(S ; P) + len(I ; P) (7) =) len(E ; P) = len(S ; P) + len(I ; P) 2

Note the use of the following theorem in the derivation above coll S E P =) len(S ; E) + len(E ; P) = len(S ; P) Next, Newton argues that in fact (7) can be written as len(S ; P) + len(H ; P) len(E ; P) = 2

(8)

So, a proof of len(I ; P) = len(H ; P) is needed to progress further. This will follow if it can be shown that 4PHI is an isosceles, that is hP ; H H ; Ii = hH ; I I ; Pi (9) To prove (9), both H ; I k P ; Z and H ; I k P ; R are derived rst using H ; I k E ; C ^ E ; C k P ; Z =) H ; I k P ; Z (10) H ; I k P ; Z ^ coll P Z R =) H ; I k P ; R (11) From (10), (11), and the proof of Euclid I.29 given in Section 2.3 H ; I k P ; Z =) hP ; H H ; Ii = hH ; P P ; Zi H ; I k P ; R =) hH ; I I ; Pi = hR ; P P ; Ii =) hH ; I I ; Pi = hR ; P P ; Si

(12) (13)

From the denition of the tangent to an ellipse and the collinearity of P , I , and S (also recall that full-angles are angles between lines rather than rays and are measured anti-clockwise), e tangent (P

; Z) S H Ellipse =) hH ; P P ; Zi = hR ; P P ; Ii =) hH ; P P ; Zi = hR ; P P ; Si (14)

From (12), (13) and (14), the following is deduced as required

hP ; H H ; Ii = hH ; I I ; Pi Thus, we have len(I ; P) = len(H ; P) (Euclid I.6 10]), and hence (8) is proved, that is, Newton's assertion that \EP is the half sum of] PS PH ". Next, it follows from the denition of an ellipse that the sum of len(S ; P) and len(P ; H) is equal to the length of the major axis, that is, P 2 Ellipse =) len(S ; P) + len(P ; H) = 2  len(A ; C) (15) From (15) and (8), we can nally derive the property that Newton states as being evident: \EP is equal to the greater semiaxis AC " len(E

; P) = len(A ; C)

(16)

The rst step has shown Newton's geometric reasoning in action. For the next steps, as the various ratios are derived, we will not always show the detailed derivations of the geometric theorems. We will concentrate on the setting up of the proportions and how everything is put together to get the nal result. We will state Newton's Lemmas when they are used and theorems about innitesimals that we use. QR PE AC Step 2: Showing LLQR Pv = Pv = PC = PC R Q D

P x E S

G

T C

Z

v A

H K

Fig. 6. Construction for Steps 2|4 of Proposition 11

{ { { {

In Figure 6, in addition to properties already mentioned, the following holds

QT ? SP QxPR is a parallelogram Q, x, and v are collinear Q is innitely close to P

It is easily proved that v ; x k C ; E and so the following theorem follows v

; x k C ; E =) hP ; v v ; xi = hP ; C C ; Ei

(17)

From (17) and the fact that 4Pvx and 4PCE share P as a common vertex, it follows that they are similar. Also, since QxPR is a parallelogram, we have len(Q ; R) = len(P ; x). Thus, the following derivations follow len(P ; x) len(Q ; R) len(A ; C) len(P ; E) = = = ) len (P ; C) len(P ; v) len(P ; v) len(P ; C) (18)

SIM P V x P C E =

One of the substitution used in (18) follows from (16) proved in the previous step. The equations above verify Newton's ratios. L Step 3: Showing GvLPv Pv = Gv The proof of the ratio L  len(P ; v) = len(GL ; v) (19) len(G ; v)  len(P ; v) is trivial and we will not expand on it. We only note that the constant L is known as the latus rectum 2 of the ellipse at A. PC 22 Step 4: Showing GvQvPv2 = CD By Apollonius I.21 1], if the lines DC and Qv are dropped ordinatewise to the diameter PG, the squares on them DC 2 and Qv2 will be to each other as the areas contained by the straight lines cut o GC , CP , and Gv, vP on diameter PG. Algebraically, we proved the following property of the ellipse,

; C)2 = len(G ; C)  len(C ; P) len(Q ; v)2 len(G ; v)  len(P ; v) (P ; C)2 = len(G len ; v)  len(P ; v) len(D

Rearranging the terms, we get the required ratio, len(G

Step 5: Showing

Qv2 QT 2

; v)  len(P ; v) = len(P ; C)2 len(Q ; v)2 len(D ; C)2

2 CD CB 2 and intermediate ratios

In Figure 7, we have the additional property, 2 The latus rectum is dened as L = 2  len(B ; C)2 =len(A ; C)

(20)

R Q

D

E S

G

B

P x

T

Z

v C

A

H F K

Fig. 7. Construction for Step 5 of Proposition 11

{ PF ? DK Again, it can be easily proved that Qx k EF. The following theorem then follows from Euclid I.29 as given in Section 2.3 Q

; x k E ; F =) hQ ; x x ; Ei = hF ; E E ; xi =) hQ ; x x ; Ti = hF ; E E ; Pi

(21)

Since hP ; F F ; E i = hx ; T T ; Qi = =2 and (21), it follows that 4PEF and 4QxT are similar. The next theorems (using (16) where needed) then hold and verify Newton's intermediate results for the current Step. (Q ; x)2 = len(P ; E)2 = len(C ; A)2 ) len len(Q ; T)2 len(P ; F)2 len(P ; F)2

SIM P E F Q x T =

(22)

Newton's Lemma 12 (See Figure 2) is now needed for the next result. According to the Lemma, the parallelogram circumscribed about DK and PG is equal to the parallelogram circumscribed about the major and minor axes of the ellipse. Thence, we have the following theorem len(C

; A)  len(C ; B) = len(C ; D)  len(P ; F)

(23)

Rearranging (23), we have len(C ; A)=len(P ; F) = len(C ; D)=len(C ; B) and substituting in (22), leads to

; x)2 = len(C ; D)2 len(Q ; T)2 len(C ; B)2 len(Q

(24)

By Newton's Lemma 7, Corollary 2, when the distance between Q and P becomes innitesimal as they coincide, we have the following result 13]: len(Q ; v) 1 (25) len(Q ; x) Now, to reach the nal result for this step, we need to substitute len(Q ; v) for len(Q ; x) in (22). However, we cannot simply carry out the substitution even though the quantities are innitely close. Indeed, one has to be careful when multiplying the quantities on both sides of the relation because they might no longer be innitely close after the multiplication. Consider, the nonzero innitesimal ,

2 but  1= 6 2  1= It is possible, however, to multiply two innitely close quantities by any nite quantity the results are still innitely close. This follows from the theorem:

x y ^ u 2 Finite =) x  u y  u

(26)

Now, assuming that len(C ; D) and len(C ; B) are both nite but not innitesimal (for example, len(C ; D) len(C ; B) 2 IR), then len(C ; D)=len(C ; B) is Finite. Hence, the ratio of innitesimals len(Q ; x)=len(Q ; T) is Finite. Therefore, from (25), (26), and using (24) the following theorem is derived: len(C ; D)2 len(Q ; v)2 (27) len(Q ; T)2 len(C ; B)2 This gives the result that we wanted for the fth step of the proof of Proposition 11. We are now ready for putting all the various results together in the next and nal step. This will then conclude the formal proof of the Proposition.

Step 6: Putting the ratios together

Combining (20) and (27), with the help of theorem (26) and some algebra yields,

; v)2 len(C ; D)2 ^ len(G ; v)  len(P ; v) 2 Finite len(Q ; T)2 len(C ; B)2 len(Q ; v)2 len(P ; C)2 v)  len(P ; v) (28) =) len(G ; len(Q ; T)2 len(C ; B)2 len(Q

which is combined with (19) to derive the next relation between ratios. The reader can check that both sides of the relation are multiplied by nite quantities ensuring the results are innitely close:

L

 len(P ; v) len(P ; C)2  L 2 len(Q ; T) len(C ; B)2  len(G ; v)

(29)

The next task is to combine the last result (29) with (18) to yield the following ratio which is equivalent to Newton's \L  QR : QT 2 = AC  L  PC 2  CD2 : PC  Gv  CD2  CB 2 " len(P ; C)  L  len(A ; C) L  len(Q ; R) (30) 2 len(Q ; T) len(C ; B)2  len(G ; v) But, we know that L = 2  len(B ; C)2 =len(A ; C), so (30) can be further simplied to give Newton's other ratio \L  QR : QT 2 = 2PC : Gv"

 len(Q ; R) 2  len(P ; C) (31) len(Q ; T)2 len(G ; v) Once these ratios have been derived, Newton says \But the points Q and P coinciding, 2PC and Gv are equal. And therefore the quantities L QR and QT 2, proportional to these are also equal." We formalise this by showing that len(P ; v) 0 as the distance between Q and P becomes innitesimal thus, it follows that 2  len(P ; C)=len(G ; v) 1 and so, using (31) and the transitivity of , we have the result L  len(Q ; R) 1 (32) len(Q ; T)2 The nal step in Newton's derivation is \Let those equals be multiplied 2 2 will become equal to SP 2 QT 2 ". This nal ratio gives by SP and L SP QR QR L

the geometric representation for the force, as we showed in Section 4.2, and hence enables Newton to deduce immediately that the centripetal force obeys an inverse square law. We would like to derive Newton's result in the same way, but remark that len(S ; P) 2 Finite ; Infinitesimal ^ len(Q ; R) 2 Infinitesimal (S ; P)2 2 Infinite (33) =) len len(Q ; R) as Q and P become coincident. So, there seems to be a problem with simply multiplying (32) by Newton's ratio SP 2=QR since we cannot ensure that the results are innitely close. Our formal framework forbids the multiplication that Newton does as the result is not necessarily a theorem! Therefore, we need to nd an alternative way of arriving at the same result as Newton. Recall from Section 4.2, that we have proved the following geometric representation for the centripetal force: force k 

; R) 1 ; T)2  len(S ; P)2

len(Q

len(Q

(34)

Now from (32), we can deduce that since L 2 Finite ; Infinitesimal, the following theorems hold

; R) ; T)2 2 Finite ; Infinitesimal len(Q ; T)2 L len(Q ; R) len(Q ; T)2 2 Finite len(Q ; R) len(Q

len(Q

(35) (36) (37)

Since (35) holds and 1=len(S ; P)2 2 Finite, it follows that force 2 Finite and so we can now use the following theorem about the product of nite, innitely close quantities

a b ^ c d ^ a 2 Finite ^ c 2 Finite =) a  c b  d with (34), (36), and (37) to yield force  L k 

; R)  1 len(Q ; T)2  2 2 len(Q ; T) len(S ; P) len(Q ; R) len(Q

k  len(S1; P)2

(38)

Note that we also used the fact that is symmetric in the derivation above. Finally from (38), we get to the celebrated result since L is nite (real) and constant for a given ellipse, force



force /ultimate

k L

1

len(S 1

len(S

; P)2

; P)2

(39)

5 Final Comments We would like to conclude by mentioning some important aspects of this mechanisation and possible changes to the geometry theory that could improve automation. We also briey review what we have achieved.

5.1 On Finite Geometric Witnesses We have made an important remark about steps involving innitesimals, ratios of innitesimals and the innitely close relation. Whenever we are dealing with such ratios, care needs to be exercised as we cannot be sure what the result of dividing two innitesimals is: it can be innitesimal, nite or innite. We notice, when carrying out our formalisation, that whenever Newton is manipulating the ratio of vanishing quantities, he usually makes sure that this can be expressed in terms of some nite quantity as in the proof for Step 5 of Section 4.3. Thus, the ratio of innitesimals is shown to be innitely close or even equal to some nite quantity. This ensures that such a nite ratio can be used safely and soundly within our framework. The importance of setting up such nite geometric witnesses cannot be under-stated since the rigour of NSA might prevent steps involving ratios of innitesimals from being carried out otherwise. Indeed, we have seen that the lack of a nite witness in the last step of Newton's original argument prevents us from deriving the nal result in the same way as he does. The alternative way we went about deriving the result, however, is sound and follows from rules that have been proved within our framework.

5.2 Further Work In our previous work 13], we mentioned the existence of other methods, such as the Cliord algebra, that provide short and readable proofs 4, 11]. Although these algebraic techniques are more di cult to relate with the geometric concepts that are actually used in Newton's reasoning, interesting work done by Wang et al. has come to our attention in which powerful sets of rewrite rules have been derived to carry out proofs in Euclidean geometry 12, 16]. It would be interesting to see how these could be integrated with Isabelle's powerful simplier to provide a greater degree of automation in some of our proofs. In a sense, such an approach would match in the level of details some of the results that Newton states (as obvious) and does not prove in depth. As an interesting observation, it is worth noting that the Kepler Problem can be proved, or even discovered, using algebraic computations. This has been demonstrated through the work on mechanics done by Wu 17], and also by Chou and Gao 7, 8], in the early nineties.

5.3 Conclusions We have described in detail the machine proof of Proposition 11 of the Principia and shown how the theories developed in Isabelle can be used to derive Newton's geometric representations for physical concepts. We have used the same combination of geometry and NSA rules introduced in our previous work to conrm, through a study of one of the most important Propositions of the Principia, that Newton's geometric and ultimate procedures can be cast within the rigour of our formal framework. The discovery of a step in Newton's reasoning that could not be justied formally{ in contrast with other ones where Newton explicitly sets

up nite witnesses{ is an important one. The alternative derivation presented in this work shows how to use our rules to deduce the same result soundly. Once again, the mechanisation of results from the Principia has been an interesting and challenging exercise. Newton's original reasoning, though complex and often hard to follow, displays the impressive deductive power of geometry. The addition of innitesimal notions results in a richer, more powerful geometry in which new properties can emerge in ultimate situation. Moreover, we now have new, powerful tools to study the model built on Newton's exposition of the physical world.

Acknowledgements Support from the ORS and Cambridge Commonwealth Trusts is gratefully acknowledged. This research was also funded by EPSRC grant GR/K57381 `Mechanizing Temporal Reasoning'. We also wish to thank Data Connection for supporting us with a Postgraduate Travel Award.

References 1. Apollonius. Conics. Translation by R. Catesby Taliaferro. In Great Books of the Western World, Vol. 11, Chicago, Encyclopedia Britannica (1939) 2. J. B. Brackenridge. Newton's mature dynamics and the Principia: A simplied solution to the Kepler Problem. Historia Mathematica 16 (1989), 36|45. 3. J. B. Brackenridge. The Key to Newton's Dynamics: The Kepler Problem and Newton's Principia. University of California Press, 1995. 4. H.-B. Li and M.-T. Cheng. Cliord algebraic reduction method for mechanical theorem proving in dierential geometry. J. Automated Reasoning 21 (1998), 1{ 21. 5. S. C. Chou, X. S. Gao, and J. Z. Zhang. Automated generation of readable proofs with geometric invariants, I. Multiple and shortest proof generation. J. Automated Reasoning 17 (1996), 325{347. 6. S. C. Chou, X. S. Gao, and J. Z. Zhang. Automated generation of readable proofs with geometric invariants, II. Theorem proving with full-angles. J. Automated Reasoning 17 (1996), 349{370. 7. S. C. Chou, X. S. Gao. Automated reasoning in dierential geometry and mechanics: II. Mechanical theorem proving. J. Automated Reasoning 10 (1993), 173{189. 8. S. C. Chou, X. S. Gao. Automated reasoning in dierential geometry and mechanics: III. Mechanical formula derivation. IFIP Transaction on Automated Reasoning 1{12, North-Holland, 1993. 9. D. Densmore. Newton's Principia: The Central Argument. Green Lion Press, Santa Fe, New Mexico, 1996. 10. Euclid. Elements. Translation by Thomas L. Heath. Dover Publications Inc., 1956. 11. D. Wang. Geometry Machines: From AI to SMC. 3rd International Conference on Articial Intelligence and Symbolic Mathematical Computation. (Stey, Austria, September 1996), LNCS 1138, 213{239. 12. S. Fevre and D. Wang. Proving geometric theorems using Cliord algebra and rewrite rules. Proceedings of the 15th International Conference on Automated Deduction (CADE-15) (Lindau, Germany, July 5-10, 1998), LNAI 1421, 17{32.

13. J. D. Fleuriot and L. C. Paulson. A combination of nonstandard analysis and geometry theorem proving, with application to Newton's Principia. Proceedings of the 15th International Conference on Automated Deduction (CADE-15) (Lindau, Germany, July 5-10, 1998), LNAI 1421, 3{16. 14. I. Newton. The Mathematical Principles of Natural Philosophy. Third edition, 1726. Translation by A. Motte (1729). Revised by F. Cajory 1934. University of California Press. 15. D. T. Whiteside. The mathematical principles underlying Newton's Principia Mathematica. Glasgow University Publication 138, 1970. 16. D. Wang. Cliord algebraic calculus for geometric reasoning with application to computer vision. Automated Deduction in Geometry (D. Wang et al., Eds.), LNAI 1360, Springer-Verlag, Berlin Heidelberg, 1997, 115{140. 17. W.-t. Wu. Mechanical theorem proving of dierential geometries and some of its applications in mechanics. J. Automated Reasoning 7 (1991), 171{191.

Suggest Documents