PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 124, Number 5, May 1996
IDENTIFYING CONGRUENCE SUBGROUPS OF THE MODULAR GROUP TIM HSU (Communicated by Ronald M. Solomon)
Abstract. We exhibit a simple test (Theorem 2.4) for determining if a given (classical) modular subgroup is a congruence subgroup, and give a detailed description of its implementation (Theorem 3.1). In an appendix, we also describe a more “invariant” and arithmetic congruence test.
1. Notation We describe (conjugacy classes of) subgroups Γ ⊂ PSL2 (Z) in terms of permutation representations of PSL2 (Z), following Millington [11, 12] and Atkin and Swinnerton-Dyer [1]. We recall that a conjugacy class of subgroups of PSL2 (Z) is equivalent to a transitive permutation represention of PSL2 (Z). Such a representation can be defined by transitive permutations E and V which satisfy the relations 1 = E2 = V 3.
(1.1) The relations (1.1) are fulfilled by 0 1 (1.2) E= , −1 0
V =
1 −1
1 . 0
Alternately, such a representation can be defined by transitive permutations L and R which satisfy (1.3)
1 = (LR−1 L)2 = (R−1 L)3 ,
with the relations being fulfilled by 1 1 (1.4) L= , 0 1
1 0 R= . 1 1
One can also use permuations E and L such that (1.5)
1 = E 2 = (L−1 E)3 ,
with E and L corresponding to the indicated matrices in (1.2) and (1.4), respectively. Received by the editors September 1, 1994. 1991 Mathematics Subject Classification. Primary 20H05; Secondary 20F05. Key words and phrases. Congruence subgroups, classical modular group. The author was supported by an NSF graduate fellowship and DOE GAANN grant #P200A10022.A03. c
1996 American Mathematical Society
1351
1352
T. HSU
The various notations can be translated using the following conversion table: (1.6)
E = LR−1 L,
(1.7) (1.8)
L = EV −1 ,
V = R−1 L, R = EV −2 , R=E
−1
L
−1
E.
Example 1.1. The permutations (1.9)
E = (1 2)(3 4)(5 6)(7 8)(9 10), V = (1 3 5)(2 7 4)(6 8 9),
or, alternately, (1.10)
L = (1 4)(2 5 9 10 8)(3 7 6), R = (1 7 9 10 6)(2 3)(4 5 8),
describe a conjugacy class of subgroups of index 10 in PSL2 (Z). Remark 1.2. Note that any concrete method of specifying a modular subgroup can easily be converted to permutation form. For instance, one way in which a modular subgroup Γ might be specified is by a list of generators. Such a list can be converted into permutations as follows: First, use the Euclidean algorithm to express each generator matrix as a product of L’s and R’s, where L and R are the elements in (1.4). Then enumerate the cosets of Γ in terms of these generators and presentation (1.3). This coset enumeration is easily converted into appropriate permutations L and R. Similarly, any reasonable membership test for Γ can be used to enumerate the cosets of Γ, with the same results as before. 2. Congruence subgroups and the level We recall the following definitions. Definition 2.1. Γ(N ) is defined to be the group (2.1)
{γ ∈ PSL2 (Z) | γ ≡ ±I
(mod N )}.
Γ(N ) is the kernel of the natural projection from PSL2 (Z) to SL2 (Z/N )/{±I}. We say that a modular subgroup Γ is a congruence subgroup if Γ contains Γ(N ) for some integer N . Otherwise, we say Γ is a non-congruence subgroup. An important invariant of (conjugacy classes of) modular subgroups is the following. Definition 2.2. The level of a modular subgroup Γ, as specified by permutations L and R, is defined to be the order of L (or the order of R, since L is conjugate to R−1 ). We need the following result, sometimes known as Wohlfahrt’s Theorem (Wohlfahrt [13]). Theorem 2.3. Let N be the level of a modular subgroup Γ. Γ is a congruence subgroup if and only if it contains Γ(N ). Proof. This amounts to proving that, for congruence subgroups, our definition of the level is the same as the classical definition of the level. See Wohlfahrt [13].
IDENTIFYING CONGRUENCE SUBGROUPS
1353
Theorem 2.4. Let Γ be a modular subgroup of level N , and let hL, R |r1 , r2 , . . . i
(2.2)
be a presentation for SL2 (Z/N )/{±I} which is compatible with (1.4). Then Γ is a congruence subgroup if and only if the representation of PSL2 (Z) induced by Γ respects the relations {ri }. Proof. From Theorem 2.3, we only need to check if Γ contains Γ(N ). Now, since Γ(N ) is normal in PSL2 (Z), Γ contains Γ(N ) if and only if the normal kernel of Γ contains Γ(N ). However, the normal kernel of Γ is exactly the kernel of the representation induced by Γ, and since the relations {ri } generate Γ(N ) as their normal closure, the theorem follows. Compare Magnus [9, Ch. III], Britto [4], Wohlfahrt [13], and Larcher [8]. Lang, Lim, and Tan [7] have also developed a congruence test; see the related paper Chan, Lang, Lim, and Tan [5]. Example 2.5. Suppose Γ is the conjugacy class of subgroups specified by (1.10). Since L has order 30, we need to use a presentation for SL2 (Z/30)/{±I}. We find that SL2 (Z/30)/{±I} has a presentation with defining relations (2.3)
1 = L30 ,
(2.4)
1 = [L2 , R15 ] = [L3 , R10 ] = [L5 , R6 ]
in addition to the relations in (1.3). (The commutator [x, y] is defined to be x−1 y −1 xy, so 1 = [x, y] means “x commutes with y”.) Only the commutator relations (2.4) need to be checked. However, (2.5)
L2 = (2 9 8 5 10)(3 6 7),
which does not commute with (2.6)
R15 = (2 3),
so Γ is a non-congruence subgroup. (It is worth mentioning that Larcher’s results also imply that Γ is non-congruence, since L does not contain a 30-cycle.) Remark 2.6. The results in this section extend essentially verbatim to the Bianchi groups √ SL2 (Od ), where Od is the ring of algebraic integers of an imaginary quadratic field Q −d with class number 1. (See Fine [6] for more on the Bianchi groups.) However, for practical use, one needs a uniform presentation of SL2 (Od ) /A for A any ideal of Od . 3. Implementation To assure the reader that the procedure described by Theorem 2.4 is practical, we provide the following detailed algorithm. Suppose we are given a subgroup Γ of finite index in PSL2 (Z). 1. Describe Γ in terms of permutations L and R. If necessary, use conversion (1.7), conversion (1.8), or another similar conversion. (See also Remark 1.2.) 2. Let N be the order of L, and let N = em, where e is a power of 2 and m is odd. 3. We have three cases: (a) N is odd: Γ is a congruence subgroup if and only if the relation (A)
1 = (R2 L− 2 )3 1
1354
T. HSU
is satisfied, where 12 is the multiplicative inverse of 2 mod N . 1 (b) N is a power of 2: Let S = L20 R 5 L−4 R−1 , where 15 is the multiplicative inverse of 5 mod N . Γ is a congruence subgroup if and only if the relations (LR−1 L)−1 S(LR−1 L) = S −1 , S −1 RS = R25 ,
(B)
1 = (SR5 LR−1 L)3 are satisfied. (c) Both e and m are greater than 1: (i) Let 12 be the multiplicative inverse of 2 mod m, and let 15 be the multiplicative inverse of 5 mod e. (ii) Let c be the unique integer mod N such that c ≡ 0 (mod e) and c ≡ 1 (mod m), and let d be the unique integer mod N such that d ≡ 0 (mod m) and d ≡ 1 (mod e). 1 (iii) Let a = Lc , b = Rc , l = Ld , r = Rd , and let s = l20 r 5 l−4 r−1 . (iv) Γ is a congruence subgroup if and only if the relations 1 = [a, r], 1 = (ab−1 a)4 , (ab−1 a)2 = (b−1 a)3 , (ab−1 a)2 = (b2 a− 2 )3 , 1
(C)
(lr−1 l)−1 s(lr−1 l) = s−1 , s−1 rs = r25 , (lr−1 l)2 = (sr5 lr−1 l)3 are satisfied. Theorem 3.1. The above procedure determines if Γ is a congruence subgroup. Before proving Theorem 3.1, we need an algebraic trick (Lemma 3.2) and some known results (Lemma 3.3, due to Behr and Mennicke [2]; and Lemma 3.4, due to Mennicke [10]). Lemma 3.2 (Braid trick). Let x and y be elements which generate a group G and satisfy the relation (3.1)
(xyx)2 = (yx)3 .
Then the element (xyx)2 = (yx)3 is central in G. Furthermore, (3.2)
xyx = yxy
and (3.3)
(xyx)−1 x(xyx) = y.
We call this the “braid trick” because (3.2) is the defining relation for the 3-string braid group. Proof. The elements X = xyx and Y = yx also generate G, and the element Z = (xyx)2 = (yx)3 = X 2 = Y 3 commutes with both X and Y , so Z is central. (3.2) and (3.3) follow from cancellation in xyxxyx = yxyxyx.
IDENTIFYING CONGRUENCE SUBGROUPS
Lemma 3.3. Let m be an odd integer, and let mod m. SL2 (Z/m) is isomorphic to D G = a, b
1 2
1355
be the multiplicative inverse of 2
(3.4)
1 = am ,
(3.5)
1 = (ab−1 a)4 ,
(3.6)
(ab−1 a)2 = (b−1 a)3 ,
E 1 (ab−1 a)2 = (b2 a− 2 )3 . 1 1 1 Relations (3.4)–(3.7) are fulfilled by a = and b = 0 1 1 (3.7)
0 in SL2 (Z/m). 1
Proof. G is equivalent to Behr and Mennicke’s presentation [2, (2.12)] by the following Tietze transformations. Add generators A = b and B = ab−1 a. Applying the braid trick to (3.6), we get that B 2 is central, and from (3.2), we also get that BA = b−1 a.
(3.8)
(3.8) implies that a = ABA, which means that we can eliminate a and b. Using (3.3), (3.8), and the centrality of B 2 , we see that (3.4)–(3.6) become (3.9)
1 = Am = B 4 ,
(3.10)
B 2 = (AB)3 ,
so it remains to convert (3.7) to Behr and Mennicke’s form. However, applying (3.3), we have (3.11)
B 2 = (b2 a− 2 )3 = (A2 B −1 A 2 B)3 , 1
1
so, using 1 = B 8 and the centrality of B 2 , (3.12)
1 = (A2 B −1 A 2 B)3 B 6 = (A2 BA 2 B)3 . 1
1
Lemma 3.4. Let e = 2n , let 15 be the multiplicative inverse of 5 mod e, and let 1 s = l20 r 5 l−4 r−1 . SL2 (Z/e) is isomorphic to D G = l, r (3.13)
1 = le ,
(3.14)
1 = (lr−1 l)4 ,
(3.15)
(lr−1 l)2 = (r−1 l)3 ,
(3.16)
(lr−1 l)−1 s(lr−1 l) = s−1 ,
(3.17)
s−1 rs = r25 ,
E (lr−1 l)2 = (sr5 lr−1 l)3 . 1 1 1 Relations (3.13)–(3.18) are fulfilled by l = ,r= 0 1 1 in SL2 (Z/e). (3.18)
5 0 , and s = 1 0
0 1 5
1356
T. HSU
Proof. As the reader may verify, the relations (3.13)–(3.18) and s = l20 r 5 l−4 r−1 are satisfied in SL2 (Z/e), so it suffices to show that G is a homomorphic image of Mennicke’s presentation [10, p. 210]. Add generators A = r, B = lr−1 l, and T = s. Applying the braid trick to (3.15), we get that B 2 is central, BA = r−1 l, and l is conjugate to A−1 . As in the proof of the previous lemma, we can then eliminate generators l and r. Then (3.13), (3.14), (3.15), (3.16), (3.17), and (3.18) become Mennicke’s relations (X), (Y), (P), (Z), (Q), and (R), respectively. 1
For Lemma 3.5, we consider the following relations: (3.19)
1 = LN ,
(3.20)
1 = [a, r],
(3.21)
1 = [b, l],
(3.22)
1 = (ab−1 a)4 ,
(3.23)
(ab−1 a)2 = (b−1 a)3 ,
(3.24)
(ab−1 a)2 = (b2 a− 2 )3 ,
(3.25)
1 = (lr−1 l)4 ,
(3.26)
(lr−1 l)2 = (r−1 l)3 ,
(3.27)
(lr−1 l)−1 s(lr−1 l) = s−1 ,
(3.28)
s−1 rs = r25 ,
(3.29)
(lr−1 l)2 = (sr5 lr−1 l)3 .
1
All notation is as described in (2) and (3c)(i–iii) of the algorithm. Note that 1 = LN implies that L = al and R = br. defining Lemma 3.5. SL2 (Z/N ) has a presentation with generators L and R, and 1 1 1 0 relations (3.19)–(3.29). The relations are fulfilled by L = and R = 0 1 1 1 in SL2 (Z/N ). Proof. The Chinese Remainder Theorem implies that ∼ SL2 (Z/m) × SL2 (Z/e) . (3.30) SL2 (Z/N ) =
1 1 It also follows from the Chinese Remainder Theorem that, if L = and R = 0 1 1 0 in SL2 (Z/N ), the SL2 (Z/m) factor is precisely ha, bi and the SL2 (Z/e) 1 1 factor is precisely hl, ri. Therefore, the above relations are satisfied in SL2 (Z/N ). On the other hand, since (3.19) implies (3.4) and (3.13), comparison with Lemmas 3.3 and 3.4 shows that the above presentation is the direct product of SL2 (Z/m) and SL2 (Z/e). The lemma follows. Proof of Theorem 3.1. After steps 1 and 2 of the procedure, we know that the relations (3.31)
1 = LN ,
(3.32)
1 = (LR−1 L)2 ,
(3.33)
1 = (R−1 L)3
IDENTIFYING CONGRUENCE SUBGROUPS
1357
must be satisfied. From Theorem 2.4, we see that if (3.31)–(3.33) and (A) (resp. (B), (C)) are defining relations for SL2 (Z/N )/{±I} when N is odd (resp. N is a power of 2, and e and m are greater than 1), then Theorem 3.1 follows. Comparing (A) and Lemma 3.3, with a = L and b = R, and comparing (B) and Lemma 3.4, with l = L and r = R, the first two cases follow easily, so it remains to check the third. Comparing (C) and (3.19)–(3.29), we see that it is enough to show that given (3.31)–(3.33) and (3.19)–(3.29), the relations (3.21), (3.25), and (3.26) are redundant. First, (3.31), (3.32), (3.20), and (3.21) give us 1 = (LR−1 L)4 = (alr−1 b−1 al)4
(3.34)
= (ab−1 a)4 (lr−1 l)4 , which means that (3.22) implies (3.25). Similarly, (3.31), (3.32), (3.33), (3.20), and (3.21) imply (LR−1 L)2 = (R−1 L)3 ,
(3.35)
(ab−1 a)2 (lr−1 l)2 = (b−1 a)3 (r−1 l)3 ,
which means that (3.23) implies (3.26). Finally, since (3.32), (3.33), and the braid trick (3.3) imply that L is conjugate to R−1 , we can eliminate (3.21), since it is implied by (3.20). For hand calculations, and for further study, we note the following relations which occur in SL2 (Z/N ): (SL2 )
Z = (LR−1 L)2 = (R−1 L)3 , 1 = Z 2 ,
(level)
1 = LN = R N ,
(ab ≡ 0
(mod N )) 1 = [La , Rb ],
(ab ≡ −1 (mod N )) (La Rb )3 = Z, (ab ≡ −2 (mod N )) (La Rb )2 = Z. It has been verified by coset enumeration that the relations (SL2 ), (level), and (ab ≡ 0 (mod N )) are defining relations when N | 360. This means that if the level N divides 360, the congruence test reduces to checking that the relations (ab ≡ 0 (mod N )) are satisfied. Acknowledgements The author would like to thank J. H. Conway and the referee for many helpful comments and suggestions. Appendix A. An arithmetic congruence test In this appendix, we present an arithmetic h i and “invariant” congruence test which uses the Ihara modular group SL2 Z p1 .
1358
T. HSU
We begin by quoting the following result (Theorem A.1) of J. Mennicke [10]. (Note that Mennicke’s Schur multiplier calculation and subsequent argument require the repairs described in F.R. Beyl [3, §5], but the main result still holds.) Let N beh an i integer, let p be a prime not dividing N , let RN be the kernel 1 in SL2 Z p resulting from reduction mod N , and let QN be the normal clo h i sure of LN in SL2 Z p1 . Theorem A.1. RN = QN . Let Γ be a modular subgroup of level N and index m in SL2 (Z). Consider the commutative diagram in Figure A.1. SL2(Z[
– p
r
]) i
f1
SL2(Z/N) r
SL2(Z)
f2
ρ
Sm Figure A.1. Commutative diagram for Theorem A.2 Here, Sm is the symmetric group on m objects (the cosets of Γ in SL2 (Z)), r is reduction mod N , i is inclusion, and ρ is the permutation representation of SL2 (Z) induced by Γ. Note that f2 exists if and only if Γ is a congruence subgroup, and that such an f2 is uniquely determined. The setup in Figure A.1 provides us with an invariant congruence test. Theorem A.2. In the notation of Figure A.1, a map f1 exists if and only if f2 exists. In other words, h i Γ is congruence if and only if ρ can be factored through inclusion in SL2 Z 1p . Proof. If f2 exists, let f1 = f2 r. Conversely, if f1 exists, since LN is in the kernel of ρ, LN must be in the kernel of f1 , so in fact, f1 is well defined on h i. h i. (A.1) SL2 Z p1 QN = SL2 Z p1 RN ∼ = SL2 (Z/N ) , which means that f1 defines an appropriate map f2 . Corollary A.3. In Figure A.1, f1 is determined uniquely if it exists. One curious feature of Theorem A.2 is that if we know a given family of modular subgroups all have levels relatively prime to p, then we can handle all of them in a uniform manner. This is the principle behind Behr and Mennicke’s presentation of SL2 (Z/N ) for N odd, as these cases can be handled in SL2 Z 12 . We also note that if we fix the level N , then we can choose any p not dividing N to use in Theorem A.2. This leads to the following idea: For a given family of modular subgroups of level N , it seems plausible that one might be able to reduce the extensibility of ρ to the question of whether there exists a p which satisfies certain congruences mod N . Dirichlet’s theorem might then be used to find a p which satisfies those congruences.
IDENTIFYING CONGRUENCE SUBGROUPS
1359
References [1] A. O. L. Atkin and H. P. F. Swinnerton-Dyer, Modular forms on noncongruence subgroups, Proc. Symp. Pure Math., Combinatorics (Providence) (T. S. Motzkin, ed.), vol. 19, AMS, Providence, 1971, pp. 1–26. MR 49:2550 [2] H. Behr and J. Mennicke, A presentation of the groups P SL(2, p), Can. J. Math. 20 (1968), 1432–1438. MR 38:4566 [3] F. R. Beyl, The Schur multiplicator of SL(2, Z/mZ) and the congruence subgroup property, Math. Z. 191 (1986), 23–42. MR 87b:20071 [4] J. Britto, On the construction of non-congruence subgroups, Acta Arith. XXXIII (1977), 261–267. MR 56:12142 [5] S.-P. Chan, M.-L. Lang, C.-H. Lim, and S.-P. Tan, Special polygons for subgroups of the modular group and applications, Internat. J. Math. 4 (1993), no. 1, 11–34. MR 94j:11045 [6] B. Fine, Algebraic theory of the Bianchi groups, Marcel Dekker, Inc., New York, 1989. MR 90h:20002 [7] M.-L. Lang, C.-H. Lim, and S.-P. Tan, An algorithm for determining if a subgroup of the modular group is congruence, preprint, 1992. [8] H. Larcher, The cusp amplitudes of the congruence subgroups of the classical modular group, Ill. J. Math. 26 (1982), no. 1, 164–172. MR 83a:10040 [9] W. Magnus, Noneuclidean tesselations and their groups, Academic Press, 1974. MR 50:4774 [10] J. Mennicke, On Ihara’s modular group, Invent. Math. 4 (1967), 202–228. MR 37:1485 [11] M. H. Millington, On cycloidal subgroups of the modular group, Proc. Lon. Math. Soc. 19 (1969), 164–176. MR 40:1484 [12] , Subgroups of the classical modular group, J. Lon. Math. Soc. 1 (1969), 351–357. MR 39:5477 [13] K. Wohlfahrt, An extension of F. Klein’s level concept, Ill. J. Math. 8 (1964), 529–535. MR 29:4805 Department of Mathematics, Princeton University, Princeton, New Jersey 08544 E-mail address:
[email protected] Current address: Department of Mathematics, University of Michigan, Ann Arbor, Michigan 48109 E-mail address:
[email protected]