For almost 50 years, Paul Sally was a towering figure in mathematics education at the University of Chicago. Although he was 80 years old, and had two prosthetic legs and an eyepatch (associated with the Type 1 diabetes he had his whole life), it was nevertheless a complete shock to our department when he passed away last December, and we struggled just to cover his undergraduate teaching load this winter and spring. As my contribution, I have been teaching an upper-division undergraduate class on “topics in geometry”, which I have appropriated and repurposed as an introduction to the classical geometry and topology of surfaces.
I have tried to include at least one problem in each homework assignment which builds a connection between classical geometry and some other part of mathematics, frequently elementary number theory. For last week’s assignment I thought I would include a problem on the well-known connection between Pythagorean triples and the modular group, perhaps touching on the Euclidean algorithm, continued fractions, etc. But I have introduced the hyperbolic plane in my class mainly in the hyperboloid model, in order to stress an analogy with spherical geometry, and in order to make it easy to derive the identities for hyperbolic triangles (i.e. hyperbolic laws of sines and cosines) from linear algebra, so it made sense to try to set up the problem in the language of the orthogonal group , and the subgroup preserving the integral lattice in
.
First, let’s recall the definition of the hyperboloid model of the hyperbolic plane. In we consider the quadratic form
, and let
denote the group of real
matrices preserving this form. The vectors with
are those lying on a 2-sheeted hyperboloid; the positive sheet H is the one consisting of vectors whose z coefficient is positive, and
is the subgroup preserving this sheet. For each vector v in H, the tangent space
is naturally isomorphic to the set of vectors
with
; i.e. the subspace of vectors “perpendicular” to v with respect to the form. The restriction of the quadratic form to the tangent space is positive definite, so it makes H into a Riemannian manifold, in such a way that
acts by isometries. This group acts transitively, and the stabilizer of a point is conjugate to
; thus H with this metric is homogeneous and isotropic, and is a model for the hyperbolic plane.
Another model is the upper half-space model of the hyperbolic plane. In this model, we define H to be the subspace of complex numbers with positive imaginary part, and let denote the group of real
matrices, which acts on H by fractional linear transformations:
This action is not faithful; the subgroup acts trivially, so the action descends to the quotient
. The group acts transitively, and the stabilizer of a point is conjugate to
; thus (again) H is homogeneous and isotropic, and is a model for the hyperbolic plane. This reflects the exceptional isomorphism of groups
.
The subgroup acts discretely with finite covolume (i.e. it is a lattice in the Lie group
); the quotient is the modular surface — an orbifold with underlying surface a sphere with one puncture, and two cone points with order 2 and 3 respectively; one sometimes calls this the
-triangle orbifold, since it is made from two semi-ideal hyperbolic triangles with angles
at the vertices (the third “ideal” vertex is at infinity, and corresponds to the puncture). There is an associated tessellation of the hyperbolic plane by such triangles whose symmetry group is
in which the ideal vertices lie exactly at the rational numbers (plus infinity) on the boundary of hyperbolic space. Thus
acts in a natural way on the set of rational numbers union infinity, which can be thought of as the projective line over
. As an abstract group,
is the free product of two cyclic groups of order 2 and 3 respectively, corresponding to the matrices
and
and all torsion elements in are conjugate to these elements or their inverse (note that these matrices have orders 4 and 6 respectively in
; it is only in
that they have orders 2 and 3).
The group is an example of what is known as an arithmetic lattice; roughly speaking, the arithmetic lattices in semisimple Lie groups G are those with “integer entries”, in a suitable sense. Arithmetic lattices are characterized by the existence of many hidden symmetries — i.e. their finite index subgroups have surprisingly large normalizers in G. More formally, for a subgroup
in G, we define the commensurator of
to be the subgroup of G consisting of elements g such that the conjugate of
by g intersects
in a finite index subgroup. With this definition, Margulis famously proved that the arithmetic lattices are precisely those whose commensurators are dense, and that all other lattices (i.e. the non-arithmetic ones) have a commensurator which is discrete (and hence contains the lattice itself with finite index). In
, all the arithmetic lattices are derived from quaternion algebras over totally real number fields. Roughly speaking, if
is a totally real number field — i.e. a finite extension of
obtained by adjoining some root of an integer polynomial with all real roots — and if
is a quaternion algebra over
, then we can find a group
consisting of “integer” elements of
of norm 1. Each real embedding of
embeds
in a quaternion algebra over
; this is either the Hamiltonian quaternions (which is a division algebra), or the algebra of
real matrices (which has zero divisors). Then
embeds as a lattice in a product of copies of
and
, one for each real embedding in the Hamiltonian quaternions and in
respectively. The
factors are compact, so if there is exactly one
factor,
embeds as a lattice in it, and projects to a lattice in
; these are exactly the arithmetic lattices.
It is a theorem of Borel that the only way to get an arithmetic lattice in which is not cocompact is to take
— in other words,
.
OK, now — how to reproduce this picture in the hyperboloid model? The most natural guess is to look at — the group of
matrices with integer entries preserving the quadratic form
and the positive sheet of the hyperboloid. So, what exactly is this group? Let’s let A be a matrix in this group, and denote its column vectors by u,v,w. One obvious matrix to take is the identity matrix; for that matrix, the vector w is
which lies on the hyperboloid H, whereas the vectors u and v are orthonormal vectors in
. But this property of a triple of vectors is preserved by the action of any element of
, and therefore in general there is a bijection between such matrices and triples u,v,w where w lies on H, and u,v are orthonormal vectors in
.
Now consider the condition that the entries of the matrix be integers. Let’s abstract the discussion slightly. Suppose V is a real vector space of dimension n, with a symmetric nondegenerate quadratic form Q. Let L be a lattice in V; this is a slightly different use of the word “lattice” than above (at least in flavor) — it means a discrete cocompact additive subgroup, isomorphic as a group to . We suppose that the lattice L is integral and unimodular; the first condition means that
is an integer for all
in L, and the second means that the
matrix with entries
has determinant 1 or -1 for any basis
of L. Now, for any nonzero vector
the linear function
has image of finite index (because Q is nondegenerate and L has full rank) and therefore the kernel
has rank (n-1). If
has norm 1 or -1, then
is itself an integral unimodular lattice in the vector space
with respect to the quadratic form which is the restriction of Q.
In with the quadratic form Q as above, suppose we can find an integer vector w on the hyperboloid H. Then the intersection of
with the lattice of integer vectors has rank 2, and since the form Q is positive definite there, we can find an orthonormal basis u,v of integer vectors for
. Hence there is a matrix A in
taking
to w, and
acts transitively on such vectors, with stabilizer isomorphic to
, the group of symmetries of the square. If we want to restrict attention to orientation-preserving symmetries, then
is cyclic of order 4, generated by
Let’s find another matrix. An integral vector w on the hyperbolic H is a triple of integers x,y,z so that ; one simple example is
, and then it is straightforward to find vectors
and
for u and v. This gives the matrix
Actually, it is pretty easy to see that no other integral vector on H is closer to than
, since
is not a sum of two squares. Let’s let
be the group generated by R and T. Some experimentation with fundamental domains confirms that this group is a lattice, and that the quotient is a sphere with one puncture and two orbifold points of orders 2 and 4; in particular, this is the entire group
, and its quotient is the
triangle orbifold.
So, this group is certainly not . In fact, a rotation of order 4 realized as an element of
necessarily has a trace of
, so it can’t even have rational entries. But wait — this is surely an arithmetic lattice (for any conceivable definition of arithmetic), and therefore corresponds to some lattice derived from a quaternion algebra over a totally real number field. Since it is not cocompact, the only possibility is that the number field is
, so that this lattice is commensurable with
. At this point I vaguely recall something from a course on arithmetic lattices I took from Walter Neumann over 20 years ago in Melbourne, in which he stressed that the trace field of an arithmetic lattice (i.e. the field generated by the traces of the elements, thought of as a subgroup of
) is not by itself a commensurability invariant — rather the trace field generated by the squares of the elements is invariant; and the squares of the elements in this group all have integer trace after conjugating into
. So mathematics is consistent after all, and I learn the surprising (to me) fact that the
and
triangle orbifolds are commensurable. Hard to believe I have been working with Kleinian groups for 20 years without noticing that before . . .
Here’s a picture of the tiling of the hyperbolic plane whose symmetry group is :
The center is the projection of and the adjacent 8-valent vertices are the projection of
.
(Update May 20, 2014): As galoisrepresentations points out, the fact that the field generated by traces of squares of elements is a commensurability invariant is a theorem of Alan Reid.
A nice example! I also remember that course (assuming you mean the one that ran during summer of 93-94). It was my first job; I got $250 a week to learn math, watch the cricket while drinking milkshakes at the student union, and have two hour lunches with Matt which usually involved a stop for gelati on Lygon street.
I think the invariant trace field was first defined in a paper by Walter with Alan (Reid). There a further 7 non-compact arithmetic triangle groups (classified by Takeuchi), some of which are more transparently commensurable to the modular group than others. For example, if you take a fundamental domain for
to consist of the ideal triangle with vertices
then the corresponding group is the normalizer of the congruence subgroup
, if I computed correctly.
Ahh, I miss the old Lygon street gelati . . .