You are currently browsing the category archive for the ‘Surfaces’ category.
The “header image” for this blog is an example of an interesting construction in 2-dimensional conformal geometry, due to Richard Kenyon, that I learned of some time ago; I thought it might be fun to try to explain where it comes from.
The example comes from the idea of a Riemann surface lamination. This is an object that geometrizes some ideas in 1-dimensional complex analysis. The basic idea is simple: given a noncompact infinite Riemannian -manifold , one gives it a new topology by declaring that two points on the surface are “close” in the new topology if there are balls of big radius in the surface centered at the two points which are “almost isometric”. Points that were close in the old topology are close in the new topology, but points that might have been far away in the old topology can become close in the new. For example, if is a covering space of some other Riemannian surface , then points in the orbit of the deck group are “infinitely close” in the new topology. This means that the resulting topological space is not Hausdorff; one “Hausdorffifies” by identifying pairs of points that are not contained in disjoint open sets, and the quotient recovers the surface (assuming that the metric on is sufficiently generic; otherwise, it recovers modulo its group of isometries). Morally what one is doing is mapping into the space of pointed locally compact metric spaces (which is itself a locally compact topological space), and giving it the subspace topology. In more detail, a point in is a pair where is a locally compact metric space, and is a point. A sequence converges to if there are metric balls around of diameter going to infinity, metric balls around also of diameter going to infinity, and isometric inclusions of into metric spaces in such a way that the Hausdorff distance between the images of and in goes to zero as . Any locally compact metric space has a tautological map to , where each point is sent to the point . Gromov showed (see section 6 of this paper) that the space itself is locally compact; in fact, this follows in an obvious way from the Arzela-Ascoli theorem.
If has bounded geometry — i.e. if the injectivity radius is uniformly bounded below, and the curvature is bounded above and below — then the image of in is precompact, and its closure is a compact metric space . The path components of are exactly the Riemann surfaces which are arbitrarily well approximated (in the metric sense) on every compact subset by compact subsets of . If you were wandering around on such a component , and you wandered over a compact region, and were only able to measure the geometry up to some (arbitrarily fine) definite precision, you could never rule out the possibility that you were actually wandering around on . Topologically, is a Riemann surface lamination; i.e. a locally compact topological space covered by open charts of the form where is an open two-dimensional disk, where is totally disconnected, and where the transition between charts preserves the decomposition into pieces , and is smooth (in fact, preserves the Riemann surface structure) on the slices, in the overlaps. The unions of “surface” slices — i.e. the path components of — piece together to make the leaves of the lamination, which are (complete) Riemann surfaces. In our case, the leaves have Riemannian metrics, which vary continuously in the direction transverse to the leaves. (Surface) laminations occur in other areas of mathematics, for example as inverse limits of sequences of finite covers of a fixed compact surface, or as objects obtained by inductively splitting open sheets in a branched surface (the latter can easily occur as attractors of certain kinds of partially hyperbolic dynamical systems). One well-known example is sometimes called the (punctured) solenoid; its Teichmüller theory is studied by Penner and Šarić (question: does anyone know how to do a “\acute c” in wordpress? update 11/6: thanks Ian for the unicode hint).
A lamination is said to be minimal if every leaf is dense. In our context this means that for every compact region in and every there is a so that every ball in of radius contains a subset which is -close to in the Gromov-Hausdorff metric. In other words, every “local feature” of that appears somewhere, appears with definite density to within any desired degree of accuracy. Consequently, such features will “almost” appear, with the same definite density, in every other leaf of , and therefore is in the closure of each . Since is (in) the closure of , this implies that every leaf is dense, as claimed.
In a Riemann surface lamination, the conformal type of every leaf is well-defined. If some leaf is elliptic, then necessarily that leaf is a sphere. So if the lamination is minimal, it is equal to a single closed surface. If every leaf is hyperbolic, then each leaf admits a unique hyperbolic metric in its conformal class (i.e. each leaf can be uniformized), and Candel showed that this family of hyperbolic metrics varies continuously in . Étienne Ghys asked whether there is an example of a minimal Riemann surface lamination in which some leaves are conformally parabolic, and others are conformally hyperbolic. It turns out that the answer to this question is yes; Richard Kenyon found an example, which I will now describe.
The lamination in question has exactly one hyperbolic leaf, which is topologically a -times punctured sphere. Every other leaf is an infinite cylinder — i.e. it is conformally the punctured plane . Since the lamination is minimal, to describe the lamination, one just needs to describe one leaf. This leaf will be obtained as the boundary of a thickened neighborhood of an infinite planar graph, which is defined inductively, as follows.
Let be the planar “Greek cross” as in the following figure:
Inductively, if we have defined , define by attaching four copies of to the extremities of . The first few examples are illustrated in the following figure:
The limit is a planar tree with exactly four ends; the boundary of a thickened tubular neighborhood is conformally equivalent to a sphere with four points removed, which is hyperbolic. Every unbounded sequence of points in has a subsequence which escapes out one of the ends. Hence every other leaf in the lamination this defines has exactly two ends, and is conformally equivalent to a punctured plane, which is parabolic.
The header image is a very similar construction in -dimensional space, where the initial seed has six legs along the coordinate axes instead of four; some (quite large) approximation was then rendered in povray.
When I was in graduate school, I was very interested in the (complex) geometry of Riemann surface laminations, and wanted to understand their deformation theory, perhaps with the aim of using structures like taut foliations and essential laminations to hyperbolize -manifolds, as an intermediate step in an approach to the geometrization conjecture (now a theorem of Perelman). I know that at one point Sullivan was quite interested in such objects, as a tool in the study of Julia sets of rational functions. I have the impression that they are not studied so much these days, but I would be happy to be corrected.
Martin Bridgeman gave a nice talk at Caltech recently on his discovery of a beautiful identity concerning orthospectra of hyperbolic surfaces (and manifolds of higher dimension) with totally geodesic boundary. The -dimensional case is (in my opinion) the most beautiful, and I would like to take a post to explain the identity, and give a derivation which is slightly different from the one Martin gives in his paper. There are many other things one could say about this identity, and its relation to other identities that turn up in the theory of hyperbolic manifolds (and elsewhere); I hope to get to this in a later post.
Let be a hyperbolic surface with totally geodesic boundary. An orthogeodesic is a geodesic segment properly immersed in , which is perpendicular to at its endpoints. The set of orthogeodesics is countable, and their lengths are proper. Denote these lengths by (with multiplicity). The identity is:
where is the Rogers’ dilogarithm function (to be defined in a minute). Treating this function as a black box for the moment, the identity has the form a term depending only on the topology of . The proof is very, very short and elegant. By the Gauss-Bonnet theorem, the term on the right is equal to of the volume of the unit tangent bundle of . Almost every tangent vector on can be exponentiated to a geodesic on which intersects the boundary in finite forward and backward time (eg. by ergodicity of the geodesic flow on a closed hyperbolic surface obtained by doubling). If is such a tangent vector, and is the associated geodesic arc, then is homotopic keeping endpoints on to a unique orthogeodesic (which is the unique length minimizer in this relative homotopy class). The volume of the set of associated to a given orthogeodesic can be computed as follows. Lift to the universal cover, where it is the crossbar of a letter “H” whose vertical lines are lifts of the geodesics it ends on. Any lifts to a unique geodesic segment in the universal cover with endpoints on the edges of the H. So the volume of the set of such depends only on , giving rise to the explicit formula for . qed.
That’s it — that’s the whole proof! . . . modulo some calculations, which we now discuss.
The “ordinary” polylogarithms are defined by Taylor series
which converges for , and extends by analytic continuation. Taking derivatives, one sees that they satisfy , thereby giving rising to integral formulae. is the familiar geometric series , so and
The Rogers dilogarithm is then given by the formula for real . One sees that the Rogers dilogarithm is obtained by symmetrizing the integrand for the integral expression for under the involution :
Martin derives his identity by direct calculation, but in fact this calculation can be simplified a bit by some hyperbolic geometry. Consider an ideal quadrilateral (whose unit tangent bundle has area ) with one pair of opposite sides that are distance apart. Join opposite vertices in pairs to decompose the quadrilateral into four triangles, each with one non-ideal point:
In the (schematic) picture, suppose the two edges of the H are the left and right side (call them and ) and the other two edges are and . Similarly, call the four triangles depending on which edge of the quadrilateral they bound. The triangle is colored gray in the figure. We secretly identify this figure with the upper half-plane, in such a way that the ideal vertices are (in circular order) , where are the ideal vertices of the gray triangle. Call the (hyperbolic) angle of the gray triangle at its vertex, so . Moreover, it turns out that where is the distance between and . We will compute implicitly as a function of , and show that it is a multiple of the Rogers dilogarithm function, thus verifying Bridgeman’s identity.
Every vector in exponentiates to a (bi-infinite) geodesic , and we want to compute the volume of the set of vectors for which the corresponding geodesic intersects both and . The point of the decomposition is that for in (say), the geodesic intersects whenever it intersects , so we only need to compute the volume of the in for which intersects . Similarly, we only need to compute the volume of the in for which intersects . For in , we compute the volume of the which do not intersect (since these are exactly the ones that intersect both and ), and similarly for .
These volumes can be expressed in terms of integrals of harmonic functions. Let denote the harmonic function on the disk which is on the arc of the circle bounded by , and on the rest of the circle. This function at each point is equal to times the visual angle (i.e. the length in the unit tangent circle) subtended by the given arc of the circle, as seen from the given point in the hyperbolic plane. Define similarly. Then the total volume we need to compute is equal to
(here we have identified by symmetry, and similarly for the other pair of terms). Let us approach this a bit more systematically. If denotes the angle at the nonideal vertex of triangle , we denote , and . The integral we want to evaluate can be expressed easily in terms of explicit rational multiples of , and the function . These functions satisfy obvious identities:
where the last identity comes by observing that we are integrating a certain function over an ideal triangle, and observing that the average of this function under the symmetries of the ideal triangle is equal to the constant function . In particular, we see that we can express everything in terms of . After some elementary reorganization, we see that the contribution to the volume of the unit tangent bundle of the surface associated to this particular orthogeodesic is
To compute , it makes sense to move to the upper half-space model, and move the endpoints of the interval to and . The harmonic function is equal to on the negative real axis, and on the positive real axis. It takes the value on the line . The area form in the hyperbolic metric is proportional to the Euclidean area form, with constant . In other words, we want to integrate over the region indicated in the figure, where the nonideal angle is , and the base point is :
If we normalize so that the circular arc is part of the semicircle from to , then the real projection of the vertical lines in the figure are and . There is no elementary way to evaluate this integral, so instead we evaluate its derivative as a function of where as before, . This is the definite integral
Integrating by parts gives . This evaluates to
Thinking of as a function of , we get
Comparing values at we see that and the identity is proved.
Well, OK, this is not terribly simple, but a posteriori it gives a way to express the Rogers dilogarithm as a sum of integrals of very simple harmonic functions over hyperbolic triangles, which is a nice geometric way to think of it.
(Update 10/30): This paper by Dupont and Sah relates Rogers dilogarithm to volumes of -simplices, and discusses some interesting connections to conformal field theory and lattice model calculations. I feel like a bit of a dope, since I read this paper while I was in graduate school more than a dozen years ago, but forgot all about it until I was cleaning out my filing cabinet this morning. They cite an older paper of Dupont for the explicit calculations; these are somewhat tedious and unenlightening; however, he does manage to show that the Rogers dilogarithm is characterized by the Abel identity. In other words,
Lemma A.1 (Dupont): Let be a three times differentiable function satisfying
for all . Then there is a real constant such that where is the Rogers dilogarithm (up to an additive constant).
Nevertheless, they don’t seem to have noticed the formula in terms of integrals of harmonic functions over hyperbolic triangles. Perhaps this is also well-known. Do any readers know?
Hermann Amandus Schwarz (1843-1921) was a student of Kummer and Weierstrass, and made many significant contributions to geometry, especially to the fields of minimal surfaces and complex analysis. His mathematical creations are both highly abstract and flexible, and at the same time intimately tied to explicit and practical calculation.
I learned about Schwarz-Christoffel transformations, Schwarzian derivatives, and Schwarz’s minimal surface as three quite separate mathematical objects, and I was very surprised to discover firstly that they had all been discovered by the same person, and secondly that they form parts of a consistent mathematical narrative, which I will try to explain in this post to the best of my ability. There is an instructive lesson in this example (for me), that we tend to mine the past for nuggets, examples, tricks, formulae etc. while forgetting the points of view and organizing principles that made their discovery possible. Another teachable example is that of Dehn’s “invention” of combinatorial (infinite) group theory, as a natural branch of geometry; several generations of followers went about the task of reformulating Dehn’s insights and ideas in the language of algebra, “generalizing” them and stripping them of their context, before geometric and topological methods were reintroduced by Milnor, Schwarz (a different one this time), Stallings, Thurston, Gromov and others to spectacular effect (note: I have the second-hand impression that the geometric point of view in group theory (and every other subject) was never abandoned in the Soviet Union).
Schwarz’s minimal surface (also called “Schwarz’s D surface”, and sometimes “Schwarz’s H surface”) is an extraordinarily beautiful triply-periodic minimal surface of infinite genus that is properly embedded in . According to Nitsche’s excellent book (p.240), this minimal surface closely resembles the separating wall between inorganic and organic materials in the skeleton of a starfish. The basic building block of the surface can be described as follows. If the vertices of a cube are -colored, the black vertices are the vertices of a regular tetrahedron. Let denote the quadrilateral formed by four edges of this tetrahedron; then a fundamental piece of Schwarz’s surface is a minimal disk spanning :
The surface may be “analytically continued” by rotating through an angle around each boundary edge. Six copies of fit smoothly around each vertex, and the resulting surface extends (triply) periodically throughout space.
The symmetries of enable us to give it several descriptions as a Riemann surface. Firstly, we could think of as a polygon in the hyperbolic plane with four edges of equal length, and angles . Twelve copies of can be assembled to make a hyperbolic surface of genus . Thinking of a surface of genus as the boundary of a genus handlebody defines a homomorphism from to , thought of as ; the cover associated to the kernel is (conformally) the triply periodic Schwarz surface, and the deck group acts on as a lattice (of index in the face-centered cubic lattice).
Another description is as follows. Since the deck group acts by translation, the Gauss map from to factors through a map . The map is injective at each point in the interior or on an edge of a copy of , but has an order branch point at each vertex. Thus, the map is a double-branched cover, with one branch point of order at each vertex of a regular inscribed cube. This leads one to think (like a late 19th century mathematician) of as the Riemann surface on which a certain multi-valued function on is single-valued. Under stereographic projection, the vertices of the cube map to the eight points where . These eight points are the roots of the polynomial , so we may think of as the hyperelliptic Riemann surface defined by the equation ; equivalently, as the surface on which the multi-valued (on ) function is single-valued.
The function is known as the Weierstrass function associated to , and an explicit formula for the co-ordinates of the embedding were found by Enneper and Weierstrass. After picking a basepoint (say ) on the sphere, the coordinates are given by integration:
The integral in each case depends on the path, and lifts to a single-valued function precisely on .
Geometrically, the three coordinate functions are harmonic functions on . This corresponds to the fact that minimal surfaces are precisely those with vanishing mean curvature, and the fact that the Laplacian of the coordinate functions (in terms of isothermal parameters on the underlying Riemann surface) can be expressed as a nonzero multiple of the mean curvature vector. A harmonic function on a Riemann surface is the real part of a holomorphic function, unique up to a constant; the holomorphic derivative of the (complexified) coordinate functions are therefore well-defined, and give holomorphic -forms which descend to (since the deck group acts by translations). These -forms satisfy the identity (this identity expresses the fact that the embedding of into via these functions is conformal). The (composition of the) Gauss map (with stereographic projection) can be read off from the , and as a meromorphic function on , it is given by the formula . Define a function on by the formula . Then are the coordinates of a rational map from into which extends to a map into , by sending each zero of to in the at infinity. Symmetry allows us to identify the image with the hyperelliptic embedding from before, and we deduce that . Solving for we obtain the integrands in the formulae above.
In fact, any holomorphic function on a domain in defines a (typically immersed with branch points) minimal surface, by the integral formulae of Enneper-Weierstrass above. Suppose we want to use this fact to produce an explicit description of a minimal surface bounded by some explicit polygonal loop in . Any minimal surface so obtained can be continued across the boundary edges by rotation; if the angles at the vertices are all of the form the resulting surface closes up smoothly around the vertices, and one obtains a compact abstract Riemann surface tiled by copies of the fundamental region, together with a holonomy representation of into . Sometimes the image of this representation in the rotational part of is finite, and one obtains an infinitely periodic minimal surface as in the case of Schwarz’s surface. A fundamental tile in can be uniformized as a hyperbolic polygon; equivalently, as a region in the upper half-plane bounded by arcs of semicircles perpendicular to the real axis. Since the edges of the loop are straight lines, the image of this hyperbolic polygon under the Gauss map is a region in also bounded by arcs of round circles; thus Schwarz’s study of minimal surfaces naturally led him to the problem of how to explicitly describe conformal maps between regions in the plane bounded by circular arcs. This problem is solved by the Schwarz-Christoffel transformation, and its generalizations, with help from the Schwarzian derivative.
Note that if and are two such regions, then a conformal map from to can be factored as the product of a map uniformizing as the upper half-plane, followed by the inverse of a map uniformizing as the upper half-plane. So it suffices to find a conformal map when the domain is the upper half plane, decomposed into intervals and rays that are mapped to the edges of a circular polygon . Near each vertex, can be moved by a fractional linear transformation to (part of) a wedge, consisting of complex numbers with argument between and , where is the angle at . The function uniformizes the upper half-plane as such a wedge; however it is not clear how to combine the contributions from each vertex, because of the complicated interaction with the fractional linear transformation. The fundamental observation is that there are certain natural holomorphic differential operators which are insensitive to the composition of a holomorphic function with groups of fractional linear transformations, and the uniformizing map can be expressed much more simply in terms of such operators.
For example, two functions that differ by addition of a constant have the same derivative: . Functions that differ by multiplication by a constant have the same logarithmic derivative: . Putting these two observations together suggest defining the nonlinearity of a function as the composition . This has the property that for any constants . Under inversion the nonlinearity transforms by . From this, and a simple calculation, one deduces that the operator is invariant under inversion, and since it is also invariant under addition and multiplication by constants, it is invariant under the full group of fractional linear transformations. This combination is called the Schwarzian derivative; explicitly, it is given by the formula . Given the Schwarzian derivative , one may recover the nonlinearity by solving the Ricatti equation . As explained in this post, solutions of the Ricatti equation preserve the projective structure on the line; in this case, it is a complex projective structure on the complex line. Equivalently, different solutions differ by an element of , acting by fractional linear transformations, as we have just deduced. Once we know the nonlinearity, we can solve for by , the usual solution to a first order linear inhomogeneous ODE. The Schwarzian of the function is . The advantage of expressing things in these terms is that the Schwarzian of a uniformizing map for a circular polygon with angles at the vertices has the form of a rational function, with principal parts , where the and the and depend (unfortunately in a very complicated way) on the edges of (for the ugly truth, see Nehari, chapter 5). To see this, observe that the map has an order two pole near finitely many points (the preimages of the vertices of under the uniformizing map) but is otherwise holomorphic. Moreover, it can be analytically continued into the lower half plane across the interval between successive , by reflecting the image across each circular edge. After reflecting twice, the image of is transformed by a fractional linear transformation, so has an analytic continuation which is single valued on the entire Riemann sphere, with finitely many isolated poles, and is therefore a rational function! When the edges of the polygon are straight, a simpler formula involving the nonlinearity specializes to the “familiar” Schwarz-Christoffel formula.
(Update 10/22): In fact, I went to the library to refresh myself on the contents of Nehari, chapter 5. The first thing I noticed — which I had forgotten — was that if is the uniformizing map from the upper half-plane to a polygon with spherical arcs, then is real-valued on the real axis. Since it is a rational function, this implies that its nonsingular part is actually a constant; i.e.
where is as above, and are real constants (which satisfy some further conditions — really see Nehari this time for more details).
The other thing that struck me was the first paragraph of the preface, which touches on some of the issues I alluded to above:
In the preface to the first edition of Courant-Hilbert’s “Methoden der mathematischen Physik”, R. Courant warned against a trend discernible in modern mathematics in which he saw a menace to the future development of mathematical analysis. He was referring to the tendency of many workers in this field to lose sight of the roots of mathematical analysis in physical and geometric intuition and to concentrate their efforts on the refinement and the extreme generalization of existing concepts.
Instead of using a word like “menace”, I would rather take this as a lesson about the value of returning to the points of view that led to the creation of the mathematical objects we study every day; which was (to some approximation) the point I was trying to illustrate in this post.
An amenable group acting by homeomorphisms on a compact topological space preserves a probability measure on ; in fact, one can given a definition of amenability in such terms. For example, if is finite, it preserves an atomic measure supported on any orbit. If , one can take a sequence of almost invariant probability measures, supported on the subset (where is arbitrary), and any weak limit will be invariant. For a general amenable group, in place of the subsets , one works with a sequence of Folner sets; i.e. subsets with the property that the ratio of their size to the size of their boundary goes to zero (so to speak).
But if is not amenable, it is generally not true that there is any probability measure on invariant under the action of . The best one can expect is a probability measure which is invariant on average. Such a measure is called a harmonic measure (or a stationary measure) for the -action on . To be concrete, suppose is finitely generated by a symmetric generating set (symmetric here means that if , then ). Let denote the space of probability measures on . One can form an operator defined by the formula
and then look for a probability measure stationary under , which exists for quite general reasons. This measure is the harmonic measure: the expectation of the -measure of under a randomly chosen is equal to the -measure of . Note for any probability measure that is absolutely continuous with respect to ; in fact, the Radon-Nikodym derivative satisfies . Substituting for in this formula, one sees that the measure class of is preserved by , and that for every , we have , where denotes word length with respect to the given generating set.
The existence of harmonic measure is especially useful when is one-dimensional, e.g. in the case that . In one dimension, a measure (at least one of full support without atoms) can be “integrated” to a path metric. Consequently, any finitely generated group of homeomorphisms of the circle is conjugate to a group of bilipschitz homeomorphisms (if the harmonic measure associated to the original action does not have full support, or has atoms, one can “throw in” another random generator to the group; the resulting action can be assumed to have a harmonic measure of full support without atoms, which can be integrated to give a structure with respect to which the group action is bilipschitz). In fact, Deroin-Kleptsyn-Navas showed that any countable group of homeomorphisms of the circle (or interval) is conjugate to a group of bilipschitz homeomorphisms (the hypothesis that be countable is essential; for example, the group acts in a non-bilipschitz way on the interval — see here).
Suppose now that for some manifold . The action of on determines a foliated circle bundle ; i.e. a circle bundle, together with a codimension one foliation transverse to the circle fibers. To see this, first form the product with its product foliation by leaves , where denotes the universal cover of . The group acts on as the deck group of the covering, and on by the given action; the quotient of this diagonal action on the product is the desired circle bundle . The foliation makes into a “flat” circle bundle with structure group . The foliation allows us to associate to each path in a homeomorphism from the fiber over to the fiber over ; integrability (or flatness) implies that this homeomorphism only depends on the relative homotopy class of in . This identification of fibers is called the holonomy of the foliation along the path . If is a Riemannian manifold, there is another kind of harmonic measure on the circle bundle; in other words, a probability measure on each circle with the property that the holonomy associated to an infinitesimal random walk on preserves the expected value of the measure. This is (very closely related to) a special case of a construction due to Lucy Garnett which associates a harmonic transverse measure to any foliation of a manifold , by finding a fixed point of the leafwise heat flow on the space of probability measures on , and disintegrating this measure into the product of the leafwise area measure, and a “harmonic” transverse measure.
In any case, we normalize our foliated circle bundle so that each circle has length in its harmonic measure. Let be the vector field on the circle bundle that rotates each circle at unit speed, and let be the -form on whose kernel is tangent to the leaves of the foliation. We scale so that everywhere. The integrability condition for a foliation is expressed in terms of the -form as the identity , and we can write where . More intrinsically, descends to a -form on the leaves of the foliation which measures the logarithm of the rate at which the transverse measure expands under holonomy in a given direction (the leafwise form is sometimes called the Godbillon class, since it is “half” of the Godbillon-Vey class associated to a codimension one foliation; see e.g. Candel-Conlon volume 2, Chapter 7). Identifying the universal cover of each leaf with by projection, the fact that our measure is harmonic means that “is” the gradient of the logarithm of a positive harmonic function on . As observed by Thurston, the geometry of then puts constraints on the size of . The following discussion is taken largely from Thurston’s paper “Three-manifolds, foliations and circles II” (unfortunately this mostly unwritten paper is not publicly available; some details can be found in my foliations book, example 4.6).
An orthogonal connection on can be obtained by averaging under the flow of ; i.e. if is the diffeomorphism of which rotates each circle through angle , then
is an -invariant -form on , which therefore descends to a -form on , which can be thought of as a connection form for an -structure on the bundle . The curvature of the connection (in the usual sense) is the -form , and we have a formula
The action of the -parameter group trivializes the cotangent bundle to over each fiber. After choosing such a trivialization, we can think of the values of at each point on a fiber as sweeping out a circle in a fixed vector space . The tangent to this circle is found by taking the Lie derivative
In other words, is identified with under the identification of with , and ; i.e. the absolute value of the curvature of the connection is equal to times the area enclosed by .
Now suppose is a hyperbolic -manifold, i.e. a manifold of dimension with constant curvature everywhere. Equivalently, think of as a quotient of hyperbolic space by a discrete group of isometries. A positive harmonic function on has a logarithmic derivative which is bounded pointwise by ; identifying positive harmonic functions on hyperbolic space with distributions on the sphere at infinity, one sees that the “worst case” is the harmonic extension of an atomic measure concentrated at a single point at infinity, since every other positive harmonic function is the weighted average of such examples. As one moves towards or away from a blob at infinity concentrated near this point, the radius of the blob expands like ; since the sphere at infinity has dimension , the conclusion follows. But this means that the speed of (i.e. the size of ) is pointwise bounded by , and the length of the circle is at most . A circle of length can enclose a disk of area at most , so the curvature of the connection has absolute value pointwise bounded by .
One corollary is a new proof of the Milnor-Wood inequality, which says that a foliated circle bundle over a closed oriented surface of genus at least satisfies , where is the Euler number of the bundle (a topological invariant). For, the surface can be given a hyperbolic metric, and the bundle a harmonic connection whose average is an orthogonal connection with pointwise curvature of absolute value at most . The Euler class of the bundle evaluated on the fundamental class of is the Euler number ; we have
where the first equality is the Chern-Weil formula for the Euler class of a bundle in terms of the curvature of a connection, and the last equality is the Gauss-Bonnet theorem for a hyperbolic surface. Another corollary gives lower bounds on the area of an incompressible surface in a hyperbolic manifold. Suppose is an immersion which is injective on . There is a cover of for which the immersion lifts to a homotopy equivalence, and we get an action of on the circle at infinity of , and hence a foliated circle bundle as above with . Integrating as above over the image of in , and using the fact that the curvature of is pointwise bounded by , we deduce that the area of is at least . If is a -manifold, we obtain .
(A somewhat more subtle argument allows one to get better bounds, e.g. replacing by for , and better estimates for higher .)
I was in Stony Brook last week, visiting Moira Chas and Dennis Sullivan, and have been away from blogging for a while; this week I plan to write a few posts about some of the things I discussed with Moira and Dennis. This is an introductory post about the Goldman bracket, an extraordinary mathematical object made out of the combinatorics of immersed curves on surfaces. I don’t have anything original to say about this object, but for my own benefit I thought I would try to explain what it is, and why Goldman was interested in it.
In his study of symplectic structures on character varieties , where is the fundamental group of a closed oriented surface and is a Lie group satisfying certain (quite general) conditions, Bill Goldman discovered a remarkable Lie algebra structure on the free abelian group generated by conjugacy classes in . Let denote the set of homotopy classes of closed oriented curves on , where is itself a compact oriented surface, and let denote the free abelian group with generating set . If are immersed oriented closed curves which intersect transversely (i.e. in double points), define the formal sum
In this formula, are thought of as based loops at the point , represents their product in , and represents the resulting conjugacy class in . Moreover, is the oriented intersection number of and at .
This operation turns out to depend only on the free homotopy classes of and , and extends by linearity to a bilinear map . Goldman shows that this bracket makes into a Lie algebra over , and that there are natural Lie algebra homomorphisms from to the Lie algebra of functions on with its Poisson bracket.
The connection with character varieties can be summarized as follows. Let be a (smooth) class function (i.e. a function which is constant on conjugacy classes) on a Lie group . Define the variation function by the formula
where is some (fixed) -invariant orthogonal structure on the Lie algebra (for example, if is reductive (eg if is semisimple), one can take ). The tangent space to the character variety at is the first cohomology group of with coefficients in , thought of as a module with the action, and then as a module by the representation . Cup product and the pairing determine a pairing
where the last equality uses the fact that is a closed surface group; this pairing defines the symplectic structure on .
Every element determines a function by sending a (conjugacy class of) representation to . Note that only depends on the conjugacy class of in . It is natural to ask: what is the Hamiltonian flow on generated by the function ? It turns out that when is a simple closed curve, it is very easy to describe this Hamiltonian flow. If is nonseparating, then define a flow by when is represented by a curve disjoint from , and if intersects exactly once with a positive orientation (there is a similar formula when is separating). In other words, the representation is constant on the fundamental group of the surface “cut open” along the curve , and only deforms in the way the two conjugacy classes of in the cut open surface are identified in .
In the important motivating case that , so that one component of is the Teichmüller space of hyperbolic structures on the surface , one can take , and then is just the length of the geodesic in the free homotopy class of , in the hyperbolic structure on associated to a representation. In this case, the symplectic structure on the character variety restricts to the Weil-Petersson symplectic structure on Teichmüller space, and the Hamiltonian flow associated to the length function is a family of Fenchel-Nielsen twists, i.e. the deformations of the hyperbolic structure obtained by cutting along the geodesic , rotating through some angle, and regluing. This latter observation recovers a famous theorem of Wolpert, connected in an obvious way to his formula for the symplectic form where is angle and is length, and the sum is taken over a maximal system of disjoint essential simple curves for the surface .
The combinatorial nature of the Goldman bracket suggests that it might have applications in combinatorial group theory. Turaev discovered a Lie cobracket on , and showed that together with the Goldman bracket, one obtains a Lie bialgebra. Motivated by Stallings’ reformulation of the Poincaré conjecture in terms of group theory, Turaev asked whether a free homotopy class contains a power of a simple curve if and only if the cobracket of the class is zero. The answer to this question is negative, as shown by Chas; on the other hand, Chas and Krongold showed that a class is simple if and only if is zero. Nevertheless, the full geometric meaning of the Goldman bracket remains mysterious, and a topic worthy of investigation.
If is a smooth function on a manifold , and is a critical point of , recall that the Hessian is the quadratic form on (in local co-ordinates, the coefficients of the Hessian are the second partial derivatives of at ). Since is symmetric, it has a well-defined index, which is the dimension of the subspace of maximal dimension on which is negative definite. The Hessian does not depend on a choice of metric. One way to see this is to give an alternate definition where and are any two vector fields with given values and in . To see that this does not depend on the choice of , observe
because of the hypothesis that vanishes at . This calculation shows that the formula is symmetric in and . Furthermore, since only depends on the value of at , the symmetry shows that the result only depends on and as claimed. A critical point is nondegenerate if is nondegenerate as a quadratic form.
In Morse theory, one uses a nondegenerate smooth function (i.e. one with isolated nondegenerate critical points), also called a Morse function, to understand the topology of : the manifold has a (smooth) handle decomposition with one -handle for each critical point of of index . In particular, nontrivial homology of forces any such function to have critical points (and one can estimate their number of each index from the homology of ). Morse in fact applied his construction not to finite dimensional manifolds, but to the infinite dimensional manifold of smooth loops in some finite dimensional manifold, with arc length as a “Morse” function. Critical “points” of this function are closed geodesics. Any closed manifold has a nontrivial homotopy group in some dimension; this gives rise to nontrivial homology in the loop space. Consequently one obtains the theorem of Lyusternik and Fet:
Theorem: Let be a closed Riemannian manifold. Then admits at least one closed geodesic.
In higher dimensions, one can study the space of smooth maps from a fixed manifold to a Riemannian manifold equipped with various functionals (which might depend on extra data, such as a metric or conformal structure on ). One context with many known applications is when is a Riemannian -manifold, is a surface, and one studies the area function on the space of smooth maps from to (usually in a fixed homotopy class). Critical points of the area function are called minimal surfaces; the name is in some ways misleading: they are not necessarily even local minima of the area function. That depends on the index of the Hessian of the area function at such a point.
Let be a (compactly supported) -parameter family of surfaces in a Riemannian -manifold , for which is smoothly immersed. For small the surfaces are transverse to the exponentiated normal bundle of ; hence locally we can assume that takes the form where are local co-ordinates on , and is contained in the normal geodesic to through the point ; we call such a family of surfaces a normal variation of surfaces. For such a variation, one has the following:
Theorem (first variation formula): Let be a normal variation of surfaces, so that where is the unit normal vector field to . Then there is a formula:
where is the mean curvature vector field along .
Proof: let denote the image under of the vector fields . Choose co-ordinates so that are conformal parameters on ; this means that and at .
The infinitesimal area form on is which we abbreviate by , and write
Since are the pushforward of coordinate vector fields, they commute; hence , so and therefore
and similarly for . At we have , and so the calculation reduces to
Now, , and so the conclusion follows. qed.
As a corollary, one deduces that a surface is a critical point for area under all smooth compactly supported variations if and only if the mean curvature vanishes identically; such a surface is called minimal.
The second variation formula follows by a similar (though more involved) calculation. The statement is:
Theorem (second variation formula): Let be a normal variation of surfaces, so that . Suppose is minimal. Then there is a formula:
where is the Jacobi operator (also called the stability operator), given by the formula
where is the second fundamental form, and is the metric Laplacian on .
This formula is frankly a bit fiddly to derive (one derivation, with only a few typos, can be found in my Foliations book; a better derivation can be found in the book of Colding-Minicozzi) but it is easy to deduce some significant consequences directly from this formula. The metric Laplacian on a compact surface is negative self-adjoint (being of the form for some operator ), and is obtained from it by adding a th order perturbation, the scalar field . Consequently the biggest eigenspace for is -dimensional, and the eigenvector of largest eigenvalue cannot change sign. Moreover, the spectrum of is discrete (counted with multiplicity), and therefore the index of (thought of as the “Hessian” of the area functional at the critical point ) is finite.
A surface is said to be stable if the index vanishes. Integrating by parts, one obtains the so-called stability inequality for a stable minimal surface :
for any reasonable compactly supported function . If is closed, we can take . Consequently if the Ricci curvature of is positive, admits no stable minimal surfaces at all. In fact, in the case of a surface in a -manifold, the expression is equal to where is the intrinsic curvature of , and is the scalar curvature on . If has positive genus, the integral of is non-negative, by Gauss-Bonnet. Consequently, one obtains the following theorem of Schoen-Yau:
Corollary (Schoen-Yau): Let be a Riemannian -manifold with positive scalar curvature. Then admits no immersed stable minimal surfaces at all.
On the other hand, one knows that every -injective map to a -manifold is homotopic to a stable minimal surface. Consequently one deduces that when is a -manifold with positive scalar curvature, then does not contain a surface subgroup. In fact, the hypothesis that be -injective is excessive: if is merely incompressible, meaning that no essential simple loop in has a null-homotopic image in , then the map is homotopic to a stable minimal surface. The simple loop conjecture says that a map from a -sided surface to a -manifold is incompressible in this sense if and only if it is -injective; but this conjecture is not yet known.
Update 8/26: It is probably worth making a few more remarks about the stability operator.
The first remark is that the three terms , and in have natural geometric interpretations, which give a “heuristic” justification for the second variation formula, which if nothing else, gives a handy way to remember the terms. We describe the meaning of these terms, one by one.
- Suppose , i.e. consider a variation by flowing points at unit speed in the direction of the normals. In directions in which the surface curves “up”, the normal flow is focussing; in directions in which it curves “down”, the normal flow is expanding. The net first order effect is given by , the mean curvature in the direction of the flow. For a minimal surface, , and only the second order effect remains, which is (remember that is the second fundamental form, which measures the infinitesimal deviation of from flatness in ; the mean curvature is the trace of , which is first order. The norm is second order).
- There is also an effect coming from the ambient geometry of . The second order rate at which a parallel family of normals along a geodesic diverge is where is the curvature operator. Taking the average over all geodesics tangent to at a point gives the Ricci curvature in the direction of , i.e. . This is the infinitesimal expansion of area of a geodesic plane under the normal flow, and has second order. The interactions between these terms have higher order, so the net contribution when is .
- Finally, there is the contribution coming from itself. Imagine that is a flat plane in Euclidean space, and let be the graph of . The infinitesimal area element on is . If has compact support, then differentiating twice by , and integrating by parts, one sees that the (leading) second order term is . When is not totally geodesic, and the ambient manifold is not Euclidean space, there is an interaction which has higher order; the leading terms add, and one is left with .
The second remark to make is that if the support of a variation is sufficiently small, then necessarily will be large compared to , and therefore will be positive definite. In other words all variations of a (fixed) minimal surface with sufficiently small support are area increasing — i.e. a minimal surface is locally area minimizing (this is local in the surface itself, not in the “space of all surfaces”). This is a generalization of the important fact that a geodesic in a Riemannian manifold is locally length minimizing (though typically not globally length minimizing).
One final remark is that when is big enough at some point , and when the injectivity radius of at is big enough (depending on bounds on in some neighborhood of ), one can find a variation with support concentrated near that violates the stability inequality. Contrapositively, as observed by Schoen, knowing that a minimal surface in a -manifold is stable gives one a priori control on the size of , depending only on the Ricci curvature of , and the injectivity radius of the surface at the point. Since stability is preserved under passing to covers (for -sided surfaces, by the fact that the largest eigenvalue of can’t change sign!) one only needs a lower bound on the distance from to . In particular, if is a closed stable minimal surface, there is an a priori pointwise bound on . This fact has many important topological applications in -manifold topology. On the other hand, when has boundary, the curvature can be arbitrarily large. The following example is due to Thurston (also see here for a discussion):
Example (Thurston): Let be an ideal simplex in with ideal simplex parameter imaginary and very large. The four vertices of come in two pairs which are very close together (as seen from the center of gravity of the simplex); let be an ideal quadrilateral whose edges join a point in one pair to a point in the other. The simplex is bisected by a “square” of arbitrarily small area; together with four “cusps” (again, of arbitrarily small area) one makes a (topological) disk spanning with area as small as desired. Isotoping this disk rel. boundary to a least area (and therefore stable) representative can only decrease the area further. By the Gauss-Bonnet formula, the curvature of such a disk must get arbitrarily large (and negative) at some point in the interior.
Jeremy Kahn kindly sent me a more detailed overview of his argument with Vlad Markovic, that I blogged earlier about here (also see Jesse Johnson’s blog for other commentary). With his permission, this is reproduced below in its entirety.
Editorial note: I have latexified Jeremy’s email; hence “dhat-mu” becomes , “boundary-hat” becomes , and “boundary-tilde” becomes . I also linkified the link to Caroline Series’ paper.
I was busy with the conference on Thursday and Friday, and taking a break on Saturday, and now I’ve finally had a chance to read your blog, and reply to your message. I decided (especially as Jesse had requested it) to write out a complete outline of the theorem. I’m sending a copy of this message to you, Jesse Johnson, Ian Agol, and Francois Labourie: you are all welcome to reproduce it, as long as it is reproduced in its entirety, and states clearly that this is joint work with Vladimir Markovic. Of course, time and energy permitting, I’ll be happy to answer any questions.
Here is an outline of the argument, working backwards to make it clearer:
1. We want to construct a surface made out of skew pants, each of which has complex half-length close to , and which are joined together so that the complex twist-bends are within of . Using a paper of Caroline
Series (published in the Pacific J. of Mathematics) we show that these surfaces are quasi-isometrically embedded in the universal cover of the three-manifold.
2. Consider the following two conditions on two Borel measures and on a metric space with the same (finite) total measure:
A. For every Borel subset of , is less than or equal to the -measure of an neighborhood of .
B. There is a measure space and functions and such that and are the push-forwards by and respectively of the measure , and the distance in between and is less than for almost every .
It is easy to show that B implies A (also that A is symmetric in and !). In the case where and are discrete and integral measures (the measure of every point is a non-negative integer), we can show that A implies B (and will be a finite set with the counting measure) using Hall’s marriage theorem. In fact, the statement that A implies B for discrete and integral measures is easily shown to be equivalent to Hall’s marriage theorem. I don’t know if A implies B in general because I don’t know how to replace the inductive algorithm for Hall’s marriage theorem with a method that works for a relation between two general measure spaces.
We call and -equivalent if they satisfy condition A, and note that the condition is additively transitive: if is -equivalent to , and is -equivalent to , then and are -equivalent.
3. Suppose that is one boundary component of a pair of skew pants . We can form the common orthogonals in from to each of other other two cuffs. For each common orthogonal, at the point where it meets , we can find a unit normal vector to that points along this common orthogonal. The two resulting normal vectors are related by a translation along the half-length of (the suitable square root of the loxodromic element for ), so we will call them a pair of opposite unit normal vectors (or pounv for short) and they live in the live in the bundle of pounv’s which is conformally equivalent to the complex plane mod the lattice generated by the half-length of and . We give the bundle of pounv’s the Euclidean metric inherited from the complex plane, and also the Lebesgue measure.
4. Given a measure on pants we can produce a measure on the union pounv bundles of the boundary geodesics as follows: if the measure is a unit atom on one pair of skew pants, the resulting measure on pounv bundles is a unit atom on the pounv bundle of each the cuffs, at the pounv described in step 3. We extend to a general measure by linearity. This produces a linear operator we will call the operator.
If we are given a positive integral formal sum of pants (or a multi-set of pants) we can think of it as an integral measure on the space of pants.
5. On the pounv bundle for each closed geodesic we can apply a translation of ; we will call this translation . We can think of as a map from the union of the pounv bundles to itself.
6. Let be an integral measure on pants with cuff half-lengths close to . We can apply the operator described in step 4 to obtain a measure on the union of pounv bundles of all the boundary geodesics; we will call the measure . If and the translation of by are equivalent, then we can take two oriented pants for each pair of pants in our multi-set (taking each of the two possible orientations) and then fit all of these oriented pants into an oriented surface of the type described in step 1. We use Hall’s marriage theorem as described in step 2, and a very small amount of combinatorics.
If the measure , restricted to a given pounv bundle, is equivalent to a rescaling of Lebesgue measure on that torus, then and of are -equivalent, which is what we wanted.
This is as far as I got in the first talk at Utah, so it would be best to stop and take a breath for a moment. We haven’t really done anything, but we’ve reformulated the problem: the type of surface we want has been well-defined, and the problem of finding this surface has been reformulated as finding a measure on pairs of pants that satisfies a given criterion.
7. A two-frame for will comprise a tangent vector and a normal vector both at the same point, unit length and orthogonal. Given a two-frame we can rotate the tangent vector 120 degrees around the normal vector, using the right-hand rule; the orbit of this action is an ordered triple of two-frames, which will call a tripod. We can also rotate 120 degrees in the opposite direction, and obtain an anti-tripod.
8. A connected pair of two-frames is a pair of two frames along with a geodesic segment connecting them. Given and , with large in terms of , we can find a weighting function on connected two-frames such that the following properties hold whenever the weight is non-zero:
A. The length of the connecting segment is within of .
B. If the normal vector of one two-frame is parallel translated along the connecting segment, then it forms an angle of less then with the normal vector of the other two-frame.
C. The angle between the the tangent vector of the two frame and (the tangent vector to) the connecting geodesic segment is exponentially small in .
D. Given a pair of two-frames, the sum of the weights of the connecting geodesic segments is exponentially close (in ) to 1.
E. The weighting is geometrically natural, in that it depends only the length of the connecting segment, the angle between the parallel translated normal vectors, and the angles between the connecting segment and the tangent vectors.
We will describe the (relatively simple) weighting function in the end; we will use the exponential mixing of geodesic flow to obtain property D.
9. Given a tripod and an anti-tripod, we can form three pairs of two-frames by pairing the frames in order, and then we can measures (or weightings) on the connected pairs of two-frames, and then form the product measure (or weighting) by multiplying the weights of the three connections. This gives us a weighting on “connected pairs of tripods” (really a tripod and an anti-tripod) that is supported on connections that satisfy properties A, B, and C.
10. We call a perfect connection between two two-frames a geodesic segment that has a length of , and angle of zero between the segment and the tangent vectors, and translates one normal vector to the other. If a tripod and an anti-tripod were connected by three perfect connection, then they would be a 1-dimensional retract of a flat pair of pants with three cuffs of equal length , where is approximately when is large. If the tripod and anti-tripod are connected by arcs that satisfy properties A and B, then the connected pair of tripods is still a retract of a skew pair of pants, whose cuffs have half-length within (or ) of . Thus there is a map from good connected pairs of tripods to good pairs of pants, which we will denote by .
11. We can let be the measure on connected pairs of tripods, given by integrating the weighting of steps 8 and 9 with respect to the Liouville measure on pairs of tripods (or pairs of two-frames). We then push this measure forward by to obtain a measure on pairs of pants; after finding a rational approximation and clearing denominators, it will be the that was asked for in step 6. We will show that (taking the original irrational ) is -equivalent to a rescaling of Lebesgue measure on each pounv bundle and thereby complete the proof.
12. A partially connected pair of tripods is a pair of tripods where we have connected two out of the three pairs of two-frames. To a partially connected pair of tripods we can assign a single closed geodesic that is homotopic to the concatenation (at both ends) of the two connecting segments. If we connect the third pair of two-frames and apply we obtain a pair of pants , and we can then find a pair of opposite unit normal vectors for gamma pointing to the two cuffs of (as described in step 3). We will describe a method for predicting the pounv for and knowing only the partially connected tripod : First, lift to the solid torus cover of determined by , and then follow geodesic segments from the tangent vectors of the two unconnected two frames of (the lift of) to the ideal boundary of this -cover. We can connect these two points in the boundary by two geodesics, each of which goes about half-way around this solid torus cover. We can then find the common orthogonals from each of these geodesics to (the lift of) , and then obtain two normal vectors to pointing along these common orthogonals; it is easy to verify that these are half-way along from each other (in the complex sense) and hence form a pounv. Property C of the connections between two-frames (and hence tripods) implies that this predicted pounv will be exponentially close (in ) to the actually pounv of any pair of pants .
To summarize: given a good connected pair of tripods, we get a good pair of pants , and taking one cuff gamma of , we get a pounv for as described in step 3. But we only need two out of the three connecting segments to get , and using the third pair of two frames, without even knowing the third connecting segment, we can predict the pounv for and to very high accuracy.
13. We can then define the operator from measures on partially connected pairs of tripods to measures on the pounv bundles for the associated geodesics; this operator is just the linear extension of the operation in step 12. Given a connected pair of tripods, we can get three partially connected pairs of tripods in the obvious way; we can thereby extend to map measures on connected pairs of tripods to measures on the bundles of pounv’s; because the predicted pounv described in step 12 is exponentially close to the actual pounv described in step 3, the two measures and are -equivalent, by the B => A of step 2.
14. For each closed geodesic , we can lift all the partially connected tripods that give to the cover of described in step 12. There is a natural torus action on the normal bundle of , and this extends to an action on all of the solid torus cover associated to . Moreover, it acts on the (lifts of) partially connected tripods, and it does not change the weightings of the two established connecting segments, because of property E of the weighting function.
This is the crucial point: the effective weighting on a partially connected pair of tripods is not just the product of the weights of the two established connections, but that product times the sum of the weights of all possible third connections. By property D of the weighting function, this sum, while not constant, is exponentially close to being constant, so the effective weighting is exponentially close to being invariant under the torus action. Because the predicted pounv for a partially connected pair of tripods is equivariant for the torus action, the measure is exponentially close to a torus invariant measure on the pounv bundle (which is necessary a rescaling of Lebesgue measure), in the sense that the Radon-Nikodym derivative is exponentially close to 1. It is then an easy lemma that the two measures are exponentially close in the sense of step 2. And then we’re finished: is exponentially close to , which is exponentially close to a rescaling of Lebesgue measure, which is what we wanted (with
overkill) in step 6.
15. It remains only to define the weighting function described in step 8, which is surprisingly simple: We take some left-invariant metric on , and hence on the two-frame bundle for and its universal cover. Given a connected pair of two-frames in , we lift to the universal cover, to obtain two two-frames and . We then flow and forward by the frame flow for time to obtain and . We let be the neighborhood of , and be the neighborhood of , with the tangent vector of replaced by its negation. Then the weighting of the connection is the volume of the intersection of with the image of under the frame flow for time .
Properties A, B, and C are not difficult to verify. Property D follows immediately from exponential mixing: If we have and downstairs without any connection, and similarly define , , and , then the sum of the weights of the possible connections will just be the volume of the intersection of the downstairs with the frame flow of . By exponential mixing, this converges at the rate to the square of the volume of an neighborhood, divided by the volume of .
We can normalize the weights by dividing by this constant.
I will try to add comments as they occur to me.
One obvious comment to make is that the argument is remarkably short, and does not depend on any very delicate or complicated analytic estimates (maybe the argument that the glued up surfaces are quasi-geodesic is the most delicate part). It is fair to say that it defies the conventional wisdom in that respect — I was personally very surprised that the general method could be made to work, especially in light of the failure of Bowen’s program. Kudos to Jeremy and Vlad for their boldness and ingenuity.
Another comment to make is that the matching argument is surprisingly robust and general, and I expect it to have many broader applications. One thing I was confused about in my last post seems to be resolved by Jeremy’s sketch above — if I understand it correctly, one first (almost) pairs continuous measures, and only then approximates them by discrete integral measures (with a little bit of combinatorics at the end). And one really does need exponential mixing rather than just mixing.
Incidentally, apropos the matching argument, there are some interesting and well-known variations where things go haywire. For example, papers by Burago-Kleiner and (Curt) McMullen show that there are examples of separated nets in Euclidean space which are not bilipschitz to a lattice (though, interestingly, Curt shows that they are Holder equivalent). No such examples exist in hyperbolic space, because of — nonamenability and Hall’s marriage theorem! Roughly, when trying to match up points in two nets in hyperbolic space, one doesn’t need to look very far because the number of options grows exponentially. This is one reason why Kahn-Markovic need to control the matchings of their measures carefully, because it must be done on a very small scale (where the exponential growth does not kick in).
I thought I would also mention that in case my previous comments lead one to believe otherwise, exponential mixing of the geodesic flow on a hyperbolic manifold is somewhat delicate. Exponential mixing under a flow on a space preserving a probability measure means that for all (sufficiently nice) functions and on , the correlations are bounded in absolute value by an expression of the form for suitable constants (which might depend on the analytic quality of and ). For example, one takes to be the unit tangent bundle of a hyperbolic manifold, and the geodesic flow (i.e. the flow which pushes vectors along the geodesics they are tangent to, at constant speed). Exponential mixing should be contrasted with the much slower mixing of the horocycle flow on a hyperbolic surface, for which the correlation is bounded by an expression like . The geodesic flow on a hyperbolic manifold is an example of what is called an Anosov flow; i.e. the tangent bundle splits equivariantly under the flow into three subbundles where is -dimensional and tangent to the flow, is contracted uniformly exponentially by the flow, and is expanded uniformly exponentially by the flow. The best one knows for (certain) Anosov flows (by Chernov) is that the flow is stretched exponentially mixing, i.e. with an estimate of the form . One knows exponential mixing for the geodesic flow on variable negative curvature surfaces by Dolgopyat, and on certain locally symmetric spaces, using representation theory. See Pollicott’s lecture notes here for more details. I don’t know if exponential mixing for geodesic flows is known on manifolds of variable negative curvature in high dimensions. Also I’d appreciate it if any reader who knows some ergodic theory can confirm/deny/clarify this paragraph . . .
(Update 8/12): Jeremy tells me that he and Vladimir only need “sufficiently high degree polynomial” mixing, so perhaps there is a decent chance the methods can be extended to variable negative curvature.
(Update 10/29): The paper is now available from the arXiv.
I just learned from Jesse Johnson’s blog that Vlad Markovic and Jeremy Kahn have announced a proof of the surface subgroup conjecture, that every complete hyperbolic -manifold contains a closed -injective surface. Equivalently, contains a closed surface subgroup. Apparently, Jeremy made the announcement at an FRG conference in Utah. This answers a long-standing question in -manifold topology, which is a variation on some problems originally posed by Waldhausen. If one further knew that hyperbolic -manifold groups were LERF, one would be able to deduce that all hyperbolic -manifolds are virtually Haken, and (by a recent theorem of Agol), virtually fibered. Dani Wise (and others) have programs to show that hyperbolic -manifold groups are LERF; if successful, this would therefore resolve some of the most important outstanding problems in -manifold topology (in fact, I would say: the most important outstanding problems, by a substantial margin).
In fact, the argument appears to work for hyperbolic manifolds of every dimension , and possibly more generally still. Details on the argument of Markovic-Kahn are scarce (Vlad informs me that they expect to have a preprint in a few weeks) but the sketch of the argument presented by Kahn is compelling. Roughly speaking, the argument (as summarized by Ian Agol in a comment at Jesse’s blog) takes the following form:
- Given , for a sufficiently big constant , one can find “many” immersed, almost totally-geodesic pairs of pants (i.e. thrice-punctured spheres) with geodesic boundary components (i.e. “cuffs”) of length very close to . In fact, one can further insist that the complex length of the boundary geodesic is very close to (i.e. holonomy transport around this geodesic does not rotate the normal bundle very much).
- Conversely, given any geodesic of complex length very close to , one can find many such pairs of pants that it bounds, and moreover one can find them so that the normal to the geodesic pointing in to the surface is prescribed.
- If one takes a sufficiently big collection of such geodesic pairs of pants, one has enough of them in oppositely-aligned pairs along each boundary component, that they can be matched up (by some version of Hall’s marriage theorem), and furthermore, matched up with a definite prescribed “twist” along the boundary components
- One checks that the resulting (closed) surface is sufficiently close to totally geodesic that the ambient negative curvature certifies it is -injective
Many aspects of this argument have a lot in common with some previous attempts on the surface subgroup conjecture, including one recent approach by Bowen (note: Bowen’s approach is known to have some fatal difficulties; the “twist” in 3. above specifically addresses some of them). All of these points deserve some comments.
First, where do the pairs of pants come from? If is a totally geodesic pair of pants with boundary components of length close to , the pants retract onto a geodesic spine, i.e. an immersed totally geodesic theta graph, whose edges all have length close to , and which meet at angles very close to degrees. One can cut this spine up into two pieces, which are obtained by exponentiating the edges of an infinitesimal (almost)-planar tripod for length .
Given a tripod in some plane in the tangent space at some point of , one can exponentiate the edges for length to construct such a half-spine; if and are a pair of tripods for which the exponentiated endpoints nearly match up, with almost opposite tangent vectors, then the resulting half-spines can be glued up to make a spine, and thickened to make a pair of pants. One key idea is to use the exponential mixing property of the geodesic flow on a hyperbolic manifold, e.g. as proved by Pollicott. Given some tolerance , once is sufficiently large, the mixing result shows that the set of such pairs of tripods for which such a matching occurs have a definite density in the space of all pairs (and in fact, are more and more equidistributed in this space, in probability). In fact, one may even insist that two of the pairs of prongs join up to make some specific closed geodesic of length almost , and vary the pair of third prongs a very small amount so that they glue up. This takes care of the first two points; this seems quite uncontroversial (exponential mixing comes in, I suspect, to know that one doesn’t need to wiggle the pair of third prongs much, having paired the first two pairs).
The matching (i.e. the gluing up of opposite pant cuffs) apparently is done by some variant of Hall’s marriage theorem. One needs to know (I think) that for any finite set of cuffs to be glued, the set of other cuffs that they could potentially be glued to is at least as big in cardinality. This probably needs some thought, but it is plausibly true: given a cuff, it can be glued to any cuff which is almost oppositely aligned to it, and since there is some tolerance in the angle of gluing — this is where dimension at least is necessary — and moreover, since oriented cuffs are almost equidistributed, one can always find “more” cuffs that are opposite, up to a bit of tolerance, to any given subset of cuffs (of course, more details are necessary here). There is an extra wrinkle to the argument, which is that the gluing must be done with a “twist” of a definite amount, so that cuffs are not glued up in such a way that the perpendicular geodesic arcs joining pairs of cuffs match up.
(Update 8/8: I think there must necessarily be more details to the matching argument, as very loosely described above. There are at least two additional issues that must be dealt with in order to perform a matching: a parity issue (since each pants has an odd number of cuffs) and a homology issue (if the argument relativizes, so that one fixes some collection of cuffs in advance and glues up everything else, one concludes a posteriori that the union of the unglued cuffs is homologically inessential). Probably the parity issue (and more subtle divisibility issues) can be solved by gluing with real-valued weights, then approximating a real solution by a rational solution, and multiplying through to clear denominators. Maybe the homology issue does not arise, if in fact the argument doesn’t relativize.) Both these issues suggest that one does not specify in advance a collection of pants to be glued up, but rather wants to glue up a definite number of pants from some subset.)
This issue of a twist is important for the 4th point, which is perhaps the most delicate. In order to know that the resulting surface is -injective, one must use geometry. A closed (immersed) surface in a hyperbolic manifold which is (locally) very close to being totally geodesic is -injective. One way to see this is to observe that a geodesic loop in the surface is almost geodesic in the manifold; the ambient negative curvature means that the geodesic can be shrunk (by the negative of the gradient of length in the space of loops) to become geodesic in the ambient manifold; if it is close to being geodesic at the start, it very quickly becomes totally geodesic, without getting much shorter. Any closed geodesic in a hyperbolic manifold is essential.
If one builds a surface by gluing up almost totally geodesic pieces in such a way that there is almost no angle along the gluing, the resulting surface is almost geodesic, and therefore injective. However, one must be very careful to control the geometry of the pieces that are glued, and this is hard to do if the injectivity radius is very small. A geodesic pair of pants has area no matter how long its boundary components are. So if the boundary components have length , then at the points where they are thinnest, they are only across. If cuffs are glued where the pants are thinnest, even if the gluing angle is very small, the surfaces themselves might twist through a big angle in a very short time. So one needs to make sure that the thinnest part of one pants are glued up to a thicker part of the next, which is glued to a thicker part of the next . . . and so on. This is the point of introducing the twist before gluing: the twists accumulate, and before one has glued pieces together, one has entered the thick part of some pants, where the injectivity radius is bounded below by some universal constant.
Anyway, this seems like a really spectacular development, with an excellent chance of working out. Some of the ingredients — e.g. the exponential mixing of the geodesic flow — work just as well in variable negative curvature. In fact, some version of it should work for arbitrary hyperbolic groups (using Mineyev’s flow space). Without knowing more details of the argument, one can’t say how delicate the last part of the argument is, and how far it generalizes (but readers are invited to speculate . . .)