You are currently browsing the monthly archive for September 2009.
Let be a polynomial in two variables; i.e.
where each
is non-negative, and the coefficients
are complex numbers which are nonzero for only finitely many pairs
. For a generic choice of coefficients, the equation
determines a smooth complex curve
in
(i.e. a Riemann surface). How can one see the geometry of the curve directly in the expression for
? It turns out that there are several ways to do it, some very old, and some more recent.
The most important geometric invariant of the curve is the genus. To a topologist, this is the number of “handles”; to an algebraic geometer, this is the dimension of the space of holomorphic -forms. One well-known way to calculate the genus is by means of the Newton polygon. In the (real) plane
, consider the finite set consisting of the points with integer coordinates
for which the coefficient
of
is nonzero. The convex hull of this finite set is a convex integral polygon, called the Newton polygon of
. It turns out that the genus of
is the number of integer lattice points in the interior of the Newton polygon. In fact, one can find a basis for the space of holomorphic
-forms directly from this formulation. Let
denote the partial derivative of
with respect to
. Then for each lattice point
in the interior of the Newton polygon, the
-form
is a holomorphic
-form on
, and the set of all such forms is a basis for the space of all holomorphic
-forms.
This is direct but a bit unsatisfying to a topologist, since the connection between the dimension of the space of -forms and the topological idea of handles is somewhat indirect. In some special cases, it is a bit easier to see things. Two important examples are:
- Hyperelliptic surfaces, i.e equations of the form
for some polynomial
of degree
. The Newton polygon in this case is the triangle with vertices
and it has
interior lattice points. Geometrically one can “see” the surface by projecting to the
plane. For each generic value of
, the complex number
has two distinct square roots, so the map is 2 to 1. However, at the
roots of
, there is only 1 preimage. So the map is a double cover, branched over
points, and one can “see” the topology of the surface by cutting open two copies of the complex line along slits joining pairs of points, and gluing.
- A generic surface of degree
. The Newton polygon in this case is the triangle with vertices
and it has
interior lattice points. One way to “see” the surface in this case is to first imagine
lines in general position (a quite special degree
curve). Each pair of lines intersect in a point, so there are
points of intersection. After deforming the curve, these points of intersection are resolved into tubes, so one obtains
complex lines joined by
tubes. The first
tubes are needed to tube the lines together into a (multiply)-punctured plane, and the remaining
tubes each add one to the genus.
It turns out that there is a nice way to directly see the topology of in the Newton polygon, via tropical geometry. I recently learned about this idea from Mohammed Abouzaid in one of his Clay lectures; this point of view was pioneered by Grisha Mikhalkin. The idea is as follows. First consider the restriction of
to the product
; i.e. remove the intersection with the coordinate axes. For generic
, this amounts to removing a finite number of points from
, which will not change the genus. Then on this punctured curve
, consider the real valued function
. The image is a subset of
, called an amoeba. If one varies the (nonzero) coefficients of
generically, the complex geometry of the curve
will change, but its topology will not. Hence to see the topology of
one should deform the coefficients in such a way that the topology of the amoeba can be read off from combinatorial information, encoded in the Newton polygon. The terms in
corresponding to lattice points in a boundary edge of the Newton polygon sum to a polynomial which is homogeneous after a suitable change of coordinates. In the region in which these terms dominate,
looks more and more like a collection of cylinders, each asymptotic to a cone on some points at infinity. The image in the amoeba is a collection of asymptotically straight rays. If the polynomial were genuinely homogeneous, the preimage of each point in the amoeba would be a circle, parameterized by a choice of argument of (a certain root of) either
or
. So the amoeba looks like a compact blob with a collection of spikes coming off. As one deforms the coefficients in a suitable way, the compact blob degenerates into a piecewise linear graph which can be read off from purely combinatorial data, and the topology of
can be recovered by taking the boundary of a thickened tubular neighborhood of this graph.
More explicitly, one chooses a certain triangulation of the Newton polygon into triangles of area and with vertices at integer lattice points (by Pick’s theorem this is equivalent to the condition that each triangle and each edge has no lattice points in the interior). This triangulation must satisfy an additional combinatorial condition, namely that there must exist a convex piecewise linear function on the Newton polygon whose domains of linearity are precisely the triangles. This convex function is used to deform the coefficients of
; roughly, if
is the function, choose the coefficient
and take the limit as
gets very big. The convexity of
guarantees that in the preimage of each triangle of the Newton polygon, the terms of
that contribute the most are those corresponding to the vertices of the triangle. In particular, as
goes to infinity, the amoeba degenerates to the dual spine of the triangle (i.e. a tripod). The preimage of this tripod is a pair of pants; after a change of coordinates, any given triangle can be taken to have vertices
corresponding to a linear equation
whose solution set in
(for generic
) is a line minus two points — i.e. a pair of pants.
One therefore has a concrete combinatorial description of the degenerate amoeba: pick a triangulation of the Newton polygon satisfying the combinatorial conditions above. Let be the graph dual to the triangulation, with edges dual to boundary edges of the triangulation extended indefinitely. The surface
is obtained by taking the boundary of a thickened neighborhood of
. The genus of
is equal to the rank of the first homology of the graph
; this is evidently equal to the number of lattice points in the interior of the polygon.
As a really concrete example, consider a polynomial like
(the exact coefficients are irrelevant; the only issue is to choose them generically enough that the resulting curve is smooth (actually I did not check in this case – please pretend that I did!)). The Newton polygon is a rectangle with vertices . This can be subdivided into twelve triangles of area
as in the following figure:
The dual spine is then the following:

which evidently has rank of equal to
, equal on the one hand to the number of interior points in the Newton polygon, and on the other hand to the genus of
.
A geometric structure on a manifold is an atlas of charts with values in some kind of “model space”, and transformation functions taken from some pseudogroup of transformations on the model space. If is the model space, and
is the pseudo-group, one talks about a
-structure on a manifold
. One usually (but not always) wants
to be homogeneous with respect to
. So, for instance, one talks about smooth structures, conformal structures, projective structures, bilipschitz structures, piecewise linear structures, symplectic structures, and so on, and so on. Riemannian geometry does not easily fit into this picture, because there are so few (germs of) isometries of a typical Riemannian metric, and so many local invariants; but Riemannian metrics modeled on a locally symmetric space, with
a Lie group of symmetries of
, are a very significant example.
Sometimes the abstract details of a theory are hard to grasp before looking at some fundamental examples. The case of geometric structures on -manifolds is a nice example, which is surprisingly rich in some ways.
One of the most important ways in which geometric structures arise is in the theory of ODE’s. Consider a first order ODE in one variable, e.g. an equation like
- Indefinite integral
. The group in this case is
, acting on
by translation. The equation is solved by integrating:
.
- Linear homogeneous ODE
. The group in this case is
, acting on
by multiplication (notice that this group action is not transitive; the point
is preserved; this corresponds to the fact that
is always a solution of a homogeneous linear ODE). The Lie algebra is
, and the ODE is “solved” by exponentiating the vector field, and integrating. Hence
is the general solution. In fact, in the previous example, the Lie algebra of the group of translations is also identified with
, and “exponentiating” is the identity map.
- Linear inhomogeneous ODE
. The group in this case is the affine group
where the first factor acts by dilations and the second by translation. The affine group is not abelian, so one cannot “integrate” a vector field directly, but it is solvable: there is a short exact sequence
. The image in the Lie algebra of the group of dilations is the term
, which can be integrated as before to give an integrating factor
. Setting
gives
which is an indefinite integral, and can be solved by a further integration. In other words, we do one integration to change the structure group from
to
(“integrating out” the group of dilations) and then what is left is an abelian structure group, in which we can do “ordinary” integration. This procedure works whenever the structure group is solvable; i.e. whenever there is a finite sequence
where each
surjects onto an abelian group, with kernel
, and after finitely many steps, the last kernel is trivial.
- Ricatti equation
. In this case, it is well-known that the equation can blow up in finite time, and one does not obtain a group of transformations of
, but rather a group of transformations of the projective line
; another point of view says that one obtains a pseudogroup of transformations of subsets of
. The group in this case is the projective group
, acting by projective linear transformations. Let
be a
-parameter family of matrices in
, say
, with
. Matrices act on
by fractional linear maps; that is,
for
. Differentiating
at
one obtains
which is the general form of the Ricatti equation. Since the group
is not solvable, the Ricatti equation cannot be solved in terms of elementary functions and integrals. However, if one knows one solution
, one can find all other solutions as follows. Do a change of co-ordinates, by sending the solution
“to infinity”; i.e. define
. Then as a function of
, the Ricatti equation reduces to a linear inhomogeneous ODE. In other words, the structure group reduces to the subgroup of
fixing the point at infinity (i.e. the solution
), which is the affine group
. One can therefore solve for
, and by substituting back, for
.
The Ricatti equation is important for the solution of second order linear equations, since any second order linear equation can be transformed into a system of two first order linear equations in the variables
and
. A system of first order ODEs in
variables can be described in terms of pseudogroups of transformations of (subsets of)
. A system of linear equations corresponds to the structure group
, hence in the case of a
system, to
. The determinant map is a homomorphism from
to
with kernel
; hence, after multiplication by a suitable integrating factor, one can reduce to a system which is (equivalent to) the Ricatti equation.
Having seen these examples, one naturally wonders whether there are any other interesting families of equations and corresponding Lie groups acting on -manifolds. In fact, there are (essentially) no other examples: if one insists on (finite dimensional) simple Lie groups, then
is more or less the only example. Perhaps this is one of the reasons why the theory of ODEs tends to appear to undergraduates (and others) as an unstructured collection of rules and tricks. Nevertheless, recasting the theory in terms of geometric structures has the effect of clearing the air to some extent.
Geometric structures on
A third significant way in which geometric structures arise on circles is in the theory of conformal welding. Let
This list of examples of geometric structures on
I am in Kyoto right now, attending the twenty-first Nevanlinna colloquium (update: took a while to write this post – now I’m in Sydney for the Clay lectures). Yesterday, Junjiro Noguchi gave a plenary talk on Nevanlinna theory in higher dimensions and related Diophantine problems. The talk was quite technical, and I did not understand it very well; however, he said a few suggestive things early on which struck a chord.
The talk started quite accessibly, being concerned with the fundamental equation
where are coprime positive integers. The abc conjecture, formulated by Oesterlé and Masser, says that for any positive real number
, there is a constant
so that
where is the product of the distinct primes appearing in the product
. Informally, this conjecture says that for triples
satisfying the fundamental equation, the numbers
are not divisible by “too high” powers of a prime. The abc conjecture is known to imply many interesting number theoretic statements, including (famously) Fermat’s Last Theorem (for sufficiently large exponents), and Roth’s theorem on diophantine approximation (as observed by Bombieri).
Roth’s theorem is the following statement:
Theorem(Roth, 1955): Let be a real algebraic number. Then for any
, the inequality
has only finitely many solutions in coprime integers
.
This inequality is best possible, in the sense that every irrational number can be approximated by infinitely many rationals to within
. In fact, the rationals appearing in the continued fraction approximation to
have this property. There is a very short and illuminating geometric proof of this fact.
In the plane, construct a circle packing with a circle of radius with center
for each coprime pair
of integers.
This circle packing nests down on the -axis, and any vertical line (with irrational
-co-ordinate) intersects infinitely many circles. If the
co-ordinate of a vertical line is
, every circle the line intersects gives a rational
which approximates
to within
. qed.
On the other hand, consider the corresponding collection of circles with radius . Some “space” appears between neighboring circles, and they no longer pack tightly (the following picture shows
).
The total cross-sectional width of these circles, restricted to pairs in the interval
, can be estimated as follows. Each
contributes a width of
. Ignoring the coprime condition, there are
fractions of the form
in the interval
, so the total width is less than
which converges for positive
. In other words, the total cross-sectional width of all circles is finite. It follows that almost every vertical line intersects only finitely many circles.
Some vertical lines do, in fact, intersect infinitely many circles; i.e. some real numbers are approximated by infinitely many rationals to better than quadratic accuracy; for example, a Liouville number like .
Some special cases of Roth’s theorem are much easier than others. For instance, it is very easy to give a proof when is a quadratic irrational; i.e. an element of
for some integer
. Quadratic irrationals are characterized by the fact that their continued fraction expansions are eventually periodic. One can think of this geometrically as follows. The group
acts on the upper half-plane, which we think of now as the complex numbers with non-negative imaginary part, by fractional linear transformations
. The quotient is a hyperbolic triangle orbifold, with a cusp. A vertical line in the plane ending at a point
on the
-axis projects to a geodesic ray in the triangle orbifold. A rational number
approximating
to within
is detected by the geodesic entering a horoball centered at the cusp. If
is a quadratic irrational, the corresponding geodesic ray eventually winds around a periodic geodesic (this is the periodicity of the continued fraction expansion), so it never gets too deep into the cusp, and the rational approximations to
never get better than
for some constant
depending on
, as required. A different vertical line intersecting the
-axis at some
corresponds to a different geodesic ray; the existence of good rational approximations to
corresponds to the condition that the corresponding geodesic goes deeper and deeper into the cusp infinitely often at a definite rate (i.e. at a distance which is at least some fixed (fractional) power of time). A “random” geodesic on a cusped hyperbolic surface takes time
to go distance
out the cusp (this is a kind of equidistribution fact – the thickness of the cusp goes to zero like
, so if one chooses a sequence of points in a hyperbolic surface at random with respect to the uniform (area) measure, it takes about
points to find one that is distance
out the cusp). If one expects that every geodesic ray corresponding to an algebraic number looks like a “typical” random geodesic, one would conjecture (and in fact, Lang did conjecture) that there are only finitely many
for which
for any
.
A slightly different (though related) geometric way to see the periodicity of the continued fraction expansion of a quadratic irrational is to use diophantine geometry. This is best illustrated with an example. Consider the golden number . The matrix
has
and
as eigenvectors (here
denotes the “conjugate”
), and thus preserves a “wedge” in
bounded by lines with slopes
and
. The set of integer lattice points in this wedge is permuted by
, and therefore so is the boundary of the convex hull of this set (the sail of the cone). Lattice points on the sail correspond to rational approximations to the boundary slopes; the fact that
permutes this set corresponds to the periodicity of the continued fraction expansion of
(and certifies the fact that
cannot be approximated better than quadratically by rational numbers).
There is an analogue of this construction in higher dimensions: let be an
integer matrix whose eigenvalues are all real, positive, irrational and distinct. A collection of
suitable eigenvectors spans a polyhedral cone which is invariant under
. The convex hull of the set of integer lattice points in this cone is a polyhedron, and the vertices of this polyhedron (the vertices on the sail) are the “best” integral approximations to the eigenvectors. In fact, there is a
subgroup of
consisting of matrices with the same set of eigenvectors (this is a consequence of Dirichlet’s theorem on the structure of the group of units in the integers in a number field). Hence there is a group that acts discretely and co-compactly on the vertices of the sail, and one gets a priori estimates on how well the eigenvectors can be approximated by integral vectors. It is interesting to ask whether one can give a proof of Roth’s theorem along these lines, at least for algebraic numbers in totally real fields, but I don’t know the answer.
I was in Stony Brook last week, visiting Moira Chas and Dennis Sullivan, and have been away from blogging for a while; this week I plan to write a few posts about some of the things I discussed with Moira and Dennis. This is an introductory post about the Goldman bracket, an extraordinary mathematical object made out of the combinatorics of immersed curves on surfaces. I don’t have anything original to say about this object, but for my own benefit I thought I would try to explain what it is, and why Goldman was interested in it.
In his study of symplectic structures on character varieties , where
is the fundamental group of a closed oriented surface and
is a Lie group satisfying certain (quite general) conditions, Bill Goldman discovered a remarkable Lie algebra structure on the free abelian group generated by conjugacy classes in
. Let
denote the set of homotopy classes of closed oriented curves on
, where
is itself a compact oriented surface, and let
denote the free abelian group with generating set
. If
are immersed oriented closed curves which intersect transversely (i.e. in double points), define the formal sum
In this formula, are
thought of as based loops at the point
,
represents their product in
, and
represents the resulting conjugacy class in
. Moreover,
is the oriented intersection number of
and
at
.
This operation turns out to depend only on the free homotopy classes of and
, and extends by linearity to a bilinear map
. Goldman shows that this bracket makes
into a Lie algebra over
, and that there are natural Lie algebra homomorphisms from
to the Lie algebra of functions on
with its Poisson bracket.
The connection with character varieties can be summarized as follows. Let be a (smooth) class function (i.e. a function which is constant on conjugacy classes) on a Lie group
. Define the variation function
by the formula
where is some (fixed)
-invariant orthogonal structure on the Lie algebra
(for example, if
is reductive (eg if
is semisimple), one can take
). The tangent space to the character variety
at
is the first cohomology group of
with coefficients in
, thought of as a
module with the
action, and then as a
module by the representation
. Cup product and the pairing
determine a pairing
where the last equality uses the fact that is a closed surface group; this pairing defines the symplectic structure on
.
Every element determines a function
by sending a (conjugacy class of) representation
to
. Note that
only depends on the conjugacy class of
in
. It is natural to ask: what is the Hamiltonian flow on
generated by the function
? It turns out that when
is a simple closed curve, it is very easy to describe this Hamiltonian flow. If
is nonseparating, then define a flow
by
when
is represented by a curve disjoint from
, and
if
intersects
exactly once with a positive orientation (there is a similar formula when
is separating). In other words, the representation is constant on the fundamental group of the surface “cut open” along the curve
, and only deforms in the way the two conjugacy classes of
in the cut open surface are identified in
.
In the important motivating case that , so that one component of
is the Teichmüller space of hyperbolic structures on the surface
, one can take
, and then
is just the length of the geodesic in the free homotopy class of
, in the hyperbolic structure on
associated to a representation. In this case, the symplectic structure on the character variety restricts to the Weil-Petersson symplectic structure on Teichmüller space, and the Hamiltonian flow associated to the length function
is a family of Fenchel-Nielsen twists, i.e. the deformations of the hyperbolic structure obtained by cutting along the geodesic
, rotating through some angle, and regluing. This latter observation recovers a famous theorem of Wolpert, connected in an obvious way to his formula for the symplectic form
where
is angle and
is length, and the sum is taken over a maximal system of disjoint essential simple curves
for the surface
.
The combinatorial nature of the Goldman bracket suggests that it might have applications in combinatorial group theory. Turaev discovered a Lie cobracket on , and showed that together with the Goldman bracket, one obtains a Lie bialgebra. Motivated by Stallings’ reformulation of the Poincaré conjecture in terms of group theory, Turaev asked whether a free homotopy class contains a power of a simple curve if and only if the cobracket of the class is zero. The answer to this question is negative, as shown by Chas; on the other hand, Chas and Krongold showed that a class
is simple if and only if
is zero. Nevertheless, the full geometric meaning of the Goldman bracket remains mysterious, and a topic worthy of investigation.

Recent Comments