You are currently browsing the tag archive for the ‘minimal surface’ tag.
Hermann Amandus Schwarz (1843-1921) was a student of Kummer and Weierstrass, and made many significant contributions to geometry, especially to the fields of minimal surfaces and complex analysis. His mathematical creations are both highly abstract and flexible, and at the same time intimately tied to explicit and practical calculation.
I learned about Schwarz-Christoffel transformations, Schwarzian derivatives, and Schwarz’s minimal surface as three quite separate mathematical objects, and I was very surprised to discover firstly that they had all been discovered by the same person, and secondly that they form parts of a consistent mathematical narrative, which I will try to explain in this post to the best of my ability. There is an instructive lesson in this example (for me), that we tend to mine the past for nuggets, examples, tricks, formulae etc. while forgetting the points of view and organizing principles that made their discovery possible. Another teachable example is that of Dehn’s “invention” of combinatorial (infinite) group theory, as a natural branch of geometry; several generations of followers went about the task of reformulating Dehn’s insights and ideas in the language of algebra, “generalizing” them and stripping them of their context, before geometric and topological methods were reintroduced by Milnor, Schwarz (a different one this time), Stallings, Thurston, Gromov and others to spectacular effect (note: I have the second-hand impression that the geometric point of view in group theory (and every other subject) was never abandoned in the Soviet Union).
Schwarz’s minimal surface (also called “Schwarz’s D surface”, and sometimes “Schwarz’s H surface”) is an extraordinarily beautiful triply-periodic minimal surface of infinite genus that is properly embedded in . According to Nitsche’s excellent book (p.240), this minimal surface closely resembles the separating wall between inorganic and organic materials in the skeleton of a starfish. The basic building block of the surface can be described as follows. If the vertices of a cube are
-colored, the black vertices are the vertices of a regular tetrahedron. Let
denote the quadrilateral formed by four edges of this tetrahedron; then a fundamental piece
of Schwarz’s surface is a minimal disk spanning
:

The surface may be “analytically continued” by rotating through an angle
around each boundary edge. Six copies of
fit smoothly around each vertex, and the resulting surface extends (triply) periodically throughout space.
The symmetries of enable us to give it several descriptions as a Riemann surface. Firstly, we could think of
as a polygon in the hyperbolic plane with four edges of equal length, and angles
. Twelve copies of
can be assembled to make a hyperbolic surface
of genus
. Thinking of a surface of genus
as the boundary of a genus
handlebody defines a homomorphism from
to
, thought of as
; the cover
associated to the kernel is (conformally) the triply periodic Schwarz surface, and the deck group acts on
as a lattice (of index
in the face-centered cubic lattice).
Another description is as follows. Since the deck group acts by translation, the Gauss map from to
factors through a map
. The map is injective at each point in the interior or on an edge of a copy of
, but has an order
branch point at each vertex. Thus, the map
is a double-branched cover, with one branch point of order
at each vertex of a regular inscribed cube. This leads one to think (like a late 19th century mathematician) of
as the Riemann surface on which a certain multi-valued function on
is single-valued. Under stereographic projection, the vertices of the cube map to the eight points
where
. These eight points are the roots of the polynomial
, so we may think of
as the hyperelliptic Riemann surface defined by the equation
; equivalently, as the surface on which the multi-valued (on
) function
is single-valued.
The function is known as the Weierstrass function associated to
, and an explicit formula for the co-ordinates of the embedding
were found by Enneper and Weierstrass. After picking a basepoint (say
) on the sphere, the coordinates are given by integration:
The integral in each case depends on the path, and lifts to a single-valued function precisely on .
Geometrically, the three coordinate functions are harmonic functions on
. This corresponds to the fact that minimal surfaces are precisely those with vanishing mean curvature, and the fact that the Laplacian of the coordinate functions (in terms of isothermal parameters on the underlying Riemann surface) can be expressed as a nonzero multiple of the mean curvature vector. A harmonic function on a Riemann surface is the real part of a holomorphic function, unique up to a constant; the holomorphic derivative of the (complexified) coordinate functions are therefore well-defined, and give holomorphic
-forms
which descend to
(since the deck group acts by translations). These
-forms satisfy the identity
(this identity expresses the fact that the embedding of
into
via these functions is conformal). The (composition of the) Gauss map (with stereographic projection) can be read off from the
, and as a meromorphic function on
, it is given by the formula
. Define a function
on
by the formula
. Then
are the coordinates of a rational map from
into
which extends to a map into
, by sending each zero of
to
in the
at infinity. Symmetry allows us to identify the image with the hyperelliptic embedding from before, and we deduce that
. Solving for
we obtain the integrands in the formulae above.
In fact, any holomorphic function on a domain in
defines a (typically immersed with branch points) minimal surface, by the integral formulae of Enneper-Weierstrass above. Suppose we want to use this fact to produce an explicit description of a minimal surface bounded by some explicit polygonal loop in
. Any minimal surface so obtained can be continued across the boundary edges by rotation; if the angles at the vertices are all of the form
the resulting surface closes up smoothly around the vertices, and one obtains a compact abstract Riemann surface
tiled by copies of the fundamental region, together with a holonomy representation of
into
. Sometimes the image of this representation in the rotational part of
is finite, and one obtains an infinitely periodic minimal surface as in the case of Schwarz’s surface. A fundamental tile in
can be uniformized as a hyperbolic polygon; equivalently, as a region in the upper half-plane bounded by arcs of semicircles perpendicular to the real axis. Since the edges of the loop are straight lines, the image of this hyperbolic polygon under the Gauss map is a region in
also bounded by arcs of round circles; thus Schwarz’s study of minimal surfaces naturally led him to the problem of how to explicitly describe conformal maps between regions in the plane bounded by circular arcs. This problem is solved by the Schwarz-Christoffel transformation, and its generalizations, with help from the Schwarzian derivative.
Note that if and
are two such regions, then a conformal map from
to
can be factored as the product of a map uniformizing
as the upper half-plane, followed by the inverse of a map uniformizing
as the upper half-plane. So it suffices to find a conformal map when the domain is the upper half plane, decomposed into intervals and rays that are mapped to the edges of a circular polygon
. Near each vertex,
can be moved by a fractional linear transformation
to (part of) a wedge, consisting of complex numbers with argument between
and
, where
is the angle at
. The function
uniformizes the upper half-plane as such a wedge; however it is not clear how to combine the contributions from each vertex, because of the complicated interaction with the fractional linear transformation. The fundamental observation is that there are certain natural holomorphic differential operators which are insensitive to the composition of a holomorphic function with groups of fractional linear transformations, and the uniformizing map can be expressed much more simply in terms of such operators.
For example, two functions that differ by addition of a constant have the same derivative: . Functions that differ by multiplication by a constant have the same logarithmic derivative:
. Putting these two observations together suggest defining the nonlinearity of a function as the composition
. This has the property that
for any constants
. Under inversion
the nonlinearity transforms by
. From this, and a simple calculation, one deduces that the operator
is invariant under inversion, and since it is also invariant under addition and multiplication by constants, it is invariant under the full group of fractional linear transformations. This combination is called the Schwarzian derivative; explicitly, it is given by the formula
. Given the Schwarzian derivative
, one may recover the nonlinearity
by solving the Ricatti equation
. As explained in this post, solutions of the Ricatti equation preserve the projective structure on the line; in this case, it is a complex projective structure on the complex line. Equivalently, different solutions differ by an element of
, acting by fractional linear transformations, as we have just deduced. Once we know the nonlinearity, we can solve for
by
, the usual solution to a first order linear inhomogeneous ODE. The Schwarzian of the function
is
. The advantage of expressing things in these terms is that the Schwarzian of a uniformizing map for a circular polygon
with angles
at the vertices has the form of a rational function, with principal parts
, where the
and the
and
depend (unfortunately in a very complicated way) on the edges of
(for the ugly truth, see Nehari, chapter 5). To see this, observe that the map has an order two pole near finitely many points
(the preimages of the vertices of
under the uniformizing map) but is otherwise holomorphic. Moreover, it can be analytically continued into the lower half plane across the interval between successive
, by reflecting the image across each circular edge. After reflecting twice, the image of
is transformed by a fractional linear transformation, so
has an analytic continuation which is single valued on the entire Riemann sphere, with finitely many isolated poles, and is therefore a rational function! When the edges of the polygon are straight, a simpler formula involving the nonlinearity specializes to the “familiar” Schwarz-Christoffel formula.
(Update 10/22): In fact, I went to the library to refresh myself on the contents of Nehari, chapter 5. The first thing I noticed — which I had forgotten — was that if is the uniformizing map from the upper half-plane to a polygon
with spherical arcs, then
is real-valued on the real axis. Since it is a rational function, this implies that its nonsingular part is actually a constant; i.e.
where is as above, and
are real constants (which satisfy some further conditions — really see Nehari this time for more details).
The other thing that struck me was the first paragraph of the preface, which touches on some of the issues I alluded to above:
In the preface to the first edition of Courant-Hilbert’s “Methoden der mathematischen Physik”, R. Courant warned against a trend discernible in modern mathematics in which he saw a menace to the future development of mathematical analysis. He was referring to the tendency of many workers in this field to lose sight of the roots of mathematical analysis in physical and geometric intuition and to concentrate their efforts on the refinement and the extreme generalization of existing concepts.
Instead of using a word like “menace”, I would rather take this as a lesson about the value of returning to the points of view that led to the creation of the mathematical objects we study every day; which was (to some approximation) the point I was trying to illustrate in this post.
If is a smooth function on a manifold
, and
is a critical point of
, recall that the Hessian
is the quadratic form
on
(in local co-ordinates, the coefficients of the Hessian are the second partial derivatives of
at
). Since
is symmetric, it has a well-defined index, which is the dimension of the subspace of maximal dimension on which
is negative definite. The Hessian does not depend on a choice of metric. One way to see this is to give an alternate definition
where
and
are any two vector fields with given values
and
in
. To see that this does not depend on the choice of
, observe
because of the hypothesis that vanishes at
. This calculation shows that the formula is symmetric in
and
. Furthermore, since
only depends on the value of
at
, the symmetry shows that the result only depends on
and
as claimed. A critical point is nondegenerate if
is nondegenerate as a quadratic form.
In Morse theory, one uses a nondegenerate smooth function (i.e. one with isolated nondegenerate critical points), also called a Morse function, to understand the topology of
: the manifold
has a (smooth) handle decomposition with one
-handle for each critical point of
of index
. In particular, nontrivial homology of
forces any such function
to have critical points (and one can estimate their number of each index from the homology of
). Morse in fact applied his construction not to finite dimensional manifolds, but to the infinite dimensional manifold of smooth loops in some finite dimensional manifold, with arc length as a “Morse” function. Critical “points” of this function are closed geodesics. Any closed manifold has a nontrivial homotopy group in some dimension; this gives rise to nontrivial homology in the loop space. Consequently one obtains the theorem of Lyusternik and Fet:
Theorem: Let be a closed Riemannian manifold. Then
admits at least one closed geodesic.
In higher dimensions, one can study the space of smooth maps from a fixed manifold to a Riemannian manifold
equipped with various functionals (which might depend on extra data, such as a metric or conformal structure on
). One context with many known applications is when
is a Riemannian
-manifold,
is a surface, and one studies the area function on the space of smooth maps from
to
(usually in a fixed homotopy class). Critical points of the area function are called minimal surfaces; the name is in some ways misleading: they are not necessarily even local minima of the area function. That depends on the index of the Hessian of the area function at such a point.
Let be a (compactly supported)
-parameter family of surfaces in a Riemannian
-manifold
, for which
is smoothly immersed. For small
the surfaces
are transverse to the exponentiated normal bundle of
; hence locally we can assume that
takes the form
where
are local co-ordinates on
, and
is contained in the normal geodesic to
through the point
; we call such a family of surfaces a normal variation of surfaces. For such a variation, one has the following:
Theorem (first variation formula): Let be a normal variation of surfaces, so that
where
is the unit normal vector field to
. Then there is a formula:
where is the mean curvature vector field along
.
Proof: let denote the image under
of the vector fields
. Choose co-ordinates so that
are conformal parameters on
; this means that
and
at
.
The infinitesimal area form on is
which we abbreviate by
, and write
Since are the pushforward of coordinate vector fields, they commute; hence
, so
and therefore
and similarly for . At
we have
,
and
so the calculation reduces to
Now, , and
so the conclusion follows. qed.
As a corollary, one deduces that a surface is a critical point for area under all smooth compactly supported variations if and only if the mean curvature vanishes identically; such a surface is called minimal.
The second variation formula follows by a similar (though more involved) calculation. The statement is:
Theorem (second variation formula): Let be a normal variation of surfaces, so that
. Suppose
is minimal. Then there is a formula:
where is the Jacobi operator (also called the stability operator), given by the formula
where is the second fundamental form, and
is the metric Laplacian on
.
This formula is frankly a bit fiddly to derive (one derivation, with only a few typos, can be found in my Foliations book; a better derivation can be found in the book of Colding-Minicozzi) but it is easy to deduce some significant consequences directly from this formula. The metric Laplacian on a compact surface is negative self-adjoint (being of the form for some operator
), and
is obtained from it by adding a
th order perturbation, the scalar field
. Consequently the biggest eigenspace for
is
-dimensional, and the eigenvector of largest eigenvalue cannot change sign. Moreover, the spectrum of
is discrete (counted with multiplicity), and therefore the index of
(thought of as the “Hessian” of the area functional at the critical point
) is finite.
A surface is said to be stable if the index vanishes. Integrating by parts, one obtains the so-called stability inequality for a stable minimal surface :
for any reasonable compactly supported function . If
is closed, we can take
. Consequently if the Ricci curvature of
is positive,
admits no stable minimal surfaces at all. In fact, in the case of a surface in a
-manifold, the expression
is equal to
where
is the intrinsic curvature of
, and
is the scalar curvature on
. If
has positive genus, the integral of
is non-negative, by Gauss-Bonnet. Consequently, one obtains the following theorem of Schoen-Yau:
Corollary (Schoen-Yau): Let be a Riemannian
-manifold with positive scalar curvature. Then
admits no immersed stable minimal surfaces at all.
On the other hand, one knows that every -injective map
to a
-manifold is homotopic to a stable minimal surface. Consequently one deduces that when
is a
-manifold with positive scalar curvature, then
does not contain a surface subgroup. In fact, the hypothesis that
be
-injective is excessive: if
is merely incompressible, meaning that no essential simple loop in
has a null-homotopic image in
, then the map is homotopic to a stable minimal surface. The simple loop conjecture says that a map
from a
-sided surface to a
-manifold is incompressible in this sense if and only if it is
-injective; but this conjecture is not yet known.
Update 8/26: It is probably worth making a few more remarks about the stability operator.
The first remark is that the three terms ,
and
in
have natural geometric interpretations, which give a “heuristic” justification for the second variation formula, which if nothing else, gives a handy way to remember the terms. We describe the meaning of these terms, one by one.
- Suppose
, i.e. consider a variation by flowing points at unit speed in the direction of the normals. In directions in which the surface curves “up”, the normal flow is focussing; in directions in which it curves “down”, the normal flow is expanding. The net first order effect is given by
, the mean curvature in the direction of the flow. For a minimal surface,
, and only the second order effect remains, which is
(remember that
is the second fundamental form, which measures the infinitesimal deviation of
from flatness in
; the mean curvature is the trace of
, which is first order. The norm
is second order).
- There is also an effect coming from the ambient geometry of
. The second order rate at which a parallel family of normals
along a geodesic
diverge is
where
is the curvature operator. Taking the average over all geodesics
tangent to
at a point gives the Ricci curvature in the direction of
, i.e.
. This is the infinitesimal expansion of area of a geodesic plane under the normal flow, and has second order. The interactions between these terms have higher order, so the net contribution when
is
.
- Finally, there is the contribution coming from
itself. Imagine that
is a flat plane in Euclidean space, and let
be the graph of
. The infinitesimal area element on
is
. If
has compact support, then differentiating twice by
, and integrating by parts, one sees that the (leading) second order term is
. When
is not totally geodesic, and the ambient manifold is not Euclidean space, there is an interaction which has higher order; the leading terms add, and one is left with
.
The second remark to make is that if the support of a variation is sufficiently small, then necessarily
will be large compared to
, and therefore
will be positive definite. In other words all variations of a (fixed) minimal surface with sufficiently small support are area increasing — i.e. a minimal surface is locally area minimizing (this is local in the surface itself, not in the “space of all surfaces”). This is a generalization of the important fact that a geodesic in a Riemannian manifold is locally length minimizing (though typically not globally length minimizing).
One final remark is that when is big enough at some point
, and when the injectivity radius of
at
is big enough (depending on bounds on
in some neighborhood of
), one can find a variation with support concentrated near
that violates the stability inequality. Contrapositively, as observed by Schoen, knowing that a minimal surface in a
-manifold
is stable gives one a priori control on the size of
, depending only on the Ricci curvature of
, and the injectivity radius of the surface at the point. Since stability is preserved under passing to covers (for
-sided surfaces, by the fact that the largest eigenvalue of
can’t change sign!) one only needs a lower bound on the distance from
to
. In particular, if
is a closed stable minimal surface, there is an a priori pointwise bound on
. This fact has many important topological applications in
-manifold topology. On the other hand, when
has boundary, the curvature can be arbitrarily large. The following example is due to Thurston (also see here for a discussion):
Example (Thurston): Let be an ideal simplex in
with ideal simplex parameter imaginary and very large. The four vertices of
come in two pairs which are very close together (as seen from the center of gravity of the simplex); let
be an ideal quadrilateral whose edges join a point in one pair to a point in the other. The simplex
is bisected by a “square” of arbitrarily small area; together with four “cusps” (again, of arbitrarily small area) one makes a (topological) disk spanning
with area as small as desired. Isotoping this disk rel. boundary to a least area (and therefore stable) representative can only decrease the area further. By the Gauss-Bonnet formula, the curvature of such a disk must get arbitrarily large (and negative) at some point in the interior.
I recently made the final edits to my paper “Positivity of the universal pairing in 3 dimensions”, written jointly with Mike Freedman and Kevin Walker, to appear in Jour. AMS. This paper is inspired by questions that arise in the theory of unitary TQFT’s. An -dimensional TQFT (“topological quantum field theory”) is a functor
from the category of smooth oriented
-manifolds and smooth cobordisms between them, to the category of (usually complex) vector spaces and linear maps, that obeys the (so-called) monoidal axiom
. The monoidal axiom implies that
. Roughly speaking, the functor associates to a “spacelike slice” — i.e. to each
-manifold
— the vector space of “quantum states” on
(whatever they are), denoted
. A cobordism stands in for the physical idea of the universe and its quantum state evolving in time. An
-manifold
bounding
can be thought of as a cobordism from the empty manifold to
, so
is a linear map from
to
, or equivalently, a vector in
(the image of
).
Note that as defined above, a TQFT is sensitive not just to the underlying topology of a manifold, but to its smooth structure. One can define variants of TQFTs by requiring more or less structure on the underlying manifolds and cobordisms. One can also consider “decorated” cobordism categories, such as those whose objects are pairs where
is a manifold and
is a submanifold of some fixed codimension (usually
) and whose morphisms are pairs of cobordisms
(e.g. Wilson loops in a
-dimensional TQFT).
In realistic physical theories, the space of quantum states is a Hilbert space — i.e. it is equipped with a nondegenerate inner product. In particular, the result of pairing a vector with itself should be positive. One says that a TQFT with this property is unitary. In the TQFT, reversing the orientation of a manifold interchanges a vector space with its dual, and pairing is accomplished by gluing diffeomorphic manifolds with opposite orientations. It is interesting to note that many -dimensional TQFTs of interest to mathematicians are not unitary; e.g. Donaldson theory, Heegaard Floer homology, etc. These theories depend on a grading, which prevents attempts to unitarize them. It turns out that there is a good reason why this is true, discussed below.
Definition: For any -manifold
, let
denote the complex vector space spanned by the set of
-manifolds bounding
, up to a diffeomorphism fixed on
. There is a pairing on this vector space — the universal pairing — taking values in the complex vector space
spanned by the set of closed
-manifolds up to diffeomorphism. If
and
are two vectors in
, the pairing of these two vectors is equal to the formal sum
where overline is complex conjugation on numbers, and orientation-reversal on manifolds, and
denotes the closed manifold obtained by gluing
to
along
.
The point of making this definition is the following. If is a vector with the property that
(i.e. the result of pairing
with itself is zero), then
for any unitary TQFT
. One says that the universal pairing is positive in
dimensions if every nonzero vector
pairs nontrivially with itself.
Example: The Mazur manifold is a smooth
-manifold with boundary
. There is an involution
of
that does not extend over
, so
denote distinct elements of
. Let
, their formal difference. Then the result of pairing
with itself has four terms:
. It turns out that all four terms are diffeomorphic to
, and therefore this formal sum is zero even though
is not zero, and the universal pairing is not positive in dimension
.
More generally, it turns out that unitary TQFTs cannot distinguish -cobordant
-manifolds, and therefore they are insensitive to essentially all “interesting” smooth
-manifold topology! This “explains” why interesting
-dimensional TQFTs, such as Donaldson theory and Heegaard Floer homology (mentioned above) are necessarily not unitary.
One sees that cancellation arises, and a pairing may fail to be positive, if there are some unusual “coincidences” in the set of terms arising in the pairing. One way to ensure that cancellation does not occur is to control the coefficients for the terms appearing in some fixed diffeomorphism type. Observe that the “diagonal” coefficients
are all positive real numbers, and therefore cancellation can only occur if every manifold appearing as a diagonal term is diffeomorphic to some manifold appearing as an off-diagonal term. The way to ensure that this does not occur is to define some sort of ordering or complexity on terms in such a way that the term of greatest complexity can occur only on the diagonal. This property — diagonal dominance — can be expressed in the following way:
Definition: A pairing as above satisfies the topological Cauchy-Schwarz inequality if there is a complexity function
defined on all closed
-manifolds, so that if
are any two
-manifolds with boundary
, there is an inequality
with equality if and only if
.
The existence of such a complexity function ensures diagonal dominance, and therefore the positivity of the pairing .
Example: Define a complexity function on closed
-manifolds, by defining
to be equal to the number of components of
. This complexity function satisfies the topological Cauchy-Schwarz inequality, and proves positivity for the universal pairing in
dimension.
Example: A suitable complexity function can also be found in dimensions. The first term in the complexity is number of components. The second is a lexicographic list of the Euler characteristics of the resulting pieces (i.e. the complexity favors more components of bigger Euler characteristic). The first term is maximized if and only if the pieces of
and
are all glued up in pairs with the same number of boundary components in
; the second term is then maximized if and only if each piece of
is glued to a piece of
with the same Euler characteristic and number of boundary components — i.e. if and only if
.
Positivity holds in dimensions below , and fails in dimensions above
. The main theorem we prove in our paper is that positivity holds in dimension
, and we do this by constructing an explicit complexity function which satisfies the topological Cauchy-Schwarz inequality.
Unfortunately, the function itself is extremely complicated. At a first pass, it is a tuple where
treats number of components,
treats the kernel of
under inclusion,
treats the essential
-spheres, and
treats prime factors arising in the decomposition.
The term is itself very interesting: for each finite group
Witten and Dijkgraaf constructed a real unitary TQFT
(i.e. one for which the resulting vector spaces are real), so that roughly speaking
is the vector space spanned by representations of
into
up to conjugacy, and
is the vector that counts (in a suitable sense) the number of ways each such representation extends over
. The value of
on a closed manifold is roughly just the number of representations of the fundamental group in
, up to conjugacy. The complexity
is obtained by first enumerating all isomorphism classes of finite groups
and then listing the values of
in order. If the kernel of
is different from the kernel of
, this difference can be detected by some finite group (this fact depends on the fact that
-manifold groups are residually finite, proved in this context by Hempel); so
is diagonal dominant unless these two kernels are equal; equivalently, if the maximal compression bodies of
in
and
are diffeomorphic rel.
. It is essential to control these compression bodies before counting essential
-spheres, so this term must come before
in the complexity.
The term has a contribution
from each prime summand. The complexity
itself is a tuple
where
treats Seifert-fibered pieces,
treats hyperbolic pieces, and
treats the way in which these are assembled in the JSJ decomposition. The term
is quite interesting; evaluated on a finite volume hyperbolic
-manifold
it gives as output the tuple
where
denotes hyperbolic volume, and
is the geodesic length spectrum, or at least those terms in the spectrum with zero imaginary part. The choice of the first term depends on the following theorem:
Theorem: Let be an orientable surface of finite type so that each component has negative Euler characteristic, and let
be irreducible, atoroidal and acylindrical, with boundary
. Then
admit unique complete hyperbolic structures, and either
or else
and
is totally geodesic in
.
This theorem is probably the most technically difficult part of the paper. Notice that even though in the end we are only interested in closed manifolds, we must prove this theorem for hyperbolic manifolds with cusps, since these are the pieces that arise in the JSJ decomposition. This theorem was proved for closed manifolds by Agol-Storm-Thurston, and our proof follows their argument in general terms, although there are more technical difficulties in the cusped case. One starts with the hyperbolic manifold , and finds a least area representative of the surface
. Cut along this surface, and double (metrically) to get two singular metrics on the topological manifolds
and
. The theorem will be proved if we can show the volume of this singular metric is bigger than the volume of the hyperbolic metric. Such comparison theorems for volume are widely studied in geometry; in many circumstances one defines a geometric invariant of a Riemannian metric, and then shows that it is minimized/maximized on a locally symmetric metric (which is usually unique in dimensions
). For example, Besson-Courtois-Gallot famously proved that a negatively curved locally symmetric metric on a manifold uniquely minimizes the volume entropy over all metrics with fixed volume (roughly, the entropy of the geodesic flow, at least when the curvature is negative).
Hamilton proved that if one rescales Ricci flow to have constant volume, then scalar curvature satisfies
where
denotes the traceless Ricci tensor, and
denotes the spatial average of the scalar curvature
. If the spatial minimum of
is negative, then at a point achieving the minimum,
is non-negative, as are the other two terms; in other words, if one does Ricci flow rescaled to have constant volume, the minimum of scalar curvature increases (this fact remains true for noncompact manifolds, if one substitutes infimum for maximum). Conversely, if one rescales to keep the infimum of scalar curvature constant, volume decreases under flow. In
dimensions, Perelman shows that Ricci flow with surgery converges to the hyperbolic metric. Surgery at finite times occurs when scalar curvature blows up to positive infinity, so surgery does not affect the infimum of scalar curvature, and only makes volume smaller (since things are being cut out). Consequently, Perelman’s work implies that of all metrics on a hyperbolic
-manifold with the infimum of scalar curvature equal to
, the constant curvature metric is the unique metric minimizing volume.
Now, the metric on obtained by doubling along a minimal surface is not smooth, so one cannot even define the curvature tensor. However, if one interprets scalar curvature as an “average” of Ricci curvature, and observes that a minimal surface is flat “on average”, then one should expect that the distributional scalar curvature of the metric is equal to what it would be if one doubled along a totally geodesic surface, i.e. identically equal to
. So Perelman’s inequality should apply, and prove the desired volume estimate.
To make this argument rigorous, one must show that the singular metric evolves under Ricci flow, and instantaneously becomes smooth, with . A theorem of Miles Simon says that this follows if one can find a smooth background metric with uniform bounds on the curvature and its first derivatives, and which is
-bilipschitz to the singular metric. The existence of such a background metric is essentially trivial in the closed case, but becomes much more delicate in the cusped case. Basically, one needs to establish the following comparison lemma, stated somewhat informally:
Lemma: Least area surfaces in cusps of hyperbolic -manifolds become asymptotically flat faster than the thickness of the cusp goes to zero.
In other words, if one lifts a least area surface to a surface
in the universal cover, there is a (unique) totally geodesic surface
(the “osculating plane”) asymptotic to
at the fixed point of the parabolic element corresponding to the cusp, and satisfying the following geometric estimate. If
is the horoball centered at the parabolic fixed point at height
(for some horofunction), then the Hausdorff distance between
and
is
. One must further prove that if a surface
has multiple ends in a single cusp, these ends osculate distinct geodesic planes. Given this, it is not too hard to construct a suitable background metric. Between ends of
, the geometry looks more and more like a slab wedged between two totally geodesic planes. The double of this is a nonsingular hyperbolic manifold, so it certainly enjoys uniform control on the curvature and its first derivatives; this gives the background metric in the thin part. In the thick part, one can convolve the singular metric with a bump function to find a bilipschitz background metric; compactness of the thick part implies trivially that any smooth metric enjoys uniform bounds on the curvature and its first derivatives. Hence one may apply Simon, and then Perelman, and the volume estimate is proved.
The Seifert fibered case is very fiddly, but ultimately does not require many new ideas. The assembly complexity turns out to be surprisingly involved. Essentially, one thinks of the JSJ decomposition as defining a decorated graph, whose vertices correspond to the pieces in the decomposition, and whose edges control the gluing along tori. One must prove an analogue of the topological Cauchy-Schwarz inequality in the context of (decorated) graphs. This ends up looking much more like the familiar TQFT picture of tensor networks, but a more detailed discussion will have to wait for another post.
In a previous post, I discussed some methods for showing that a given group contains a (nonabelian) free subgroup. The methods were analytic and/or dynamical, and phrased in terms of the existence (or nonexistence) of certain functions on or on spaces derived from
, or in terms of actions of
on certain spaces. Dually, one can try to find a free group in
by finding a homomorphism
and looking for circumstances under which
is injective.
For concreteness, let for some (given) space
. If
is a free group, a representation
up to conjugation determines a homotopy class of map
where
is a
. The most natural
‘s to consider are graphs and surfaces (with boundary). It is generally not easy to tell whether a map of a graph or a surface to a topological space is
-injective at the topological level, but might be easier if one can use some geometry.
Example: Let be a complete Riemannian manifold with sectional curvature bounded above by some negative constant
. Convexity of the distance function in a negatively curved space means that given any map of a graph
one can flow
by the negative gradient of total length until it undergoes some topology change (e.g. some edge shrinks to zero length) or it (asymptotically) achieves a local minimum (the adjective “asymptotically” here just means that the flow takes infinite time to reach the minimum, because the size of the gradient is small when the map is almost minimum; there are no analytic difficulties to overcome when taking the limit). A typical topological change might be some loop shrinking to a point, thereby certifying that a free summand of
mapped trivially to
and should have been discarded. Technically, one probably wants to choose
to be a trivalent graph, and when some interior edge collapses (so that four points come together) to let the
-valent vertex resolve itself into a pair of
-valent vertices in whichever of the three combinatorial possibilities is locally most efficient. The limiting graph, if nonempty, will be trivalent, with geodesic edges, and vertices at which the three edges are all (tangentially) coplanar and meet at angles of
. Such a graph can be certified as
-injective provided the edges are sufficiently long (depending on the curvature
). After rescaling the metric on
so that the supremum of the curvatures is
, a trivalent geodesic graph with angles
at the vertices and edges at least
is
-injective. To see this, lift to maps between universal covers, i.e. consider an equivariant map from a tree
to
. Let
be an embedded arc in
, and consider the image in
. Using Toponogov’s theorem, one can compare with a piecewise isometric map from
to
. The worst case is when all the edges are contained in a single
, and all corners “bend” the same way. Providing the image does not bend as much as a horocircle, the endpoints of the image of
stay far away in
. An infinite sided convex polygon in
with all edges of length
and all angles
osculates a horocycle, so we are done.
Remark: The fundamental group of a negatively curved manifold is word-hyperbolic, and therefore contains many nonabelian free groups, which may be certified by pingpong applied to the action of the group on its Gromov boundary. The point of the previous example is therefore to certify that a certain subgroup is free in terms of local geometric data, rather than global dynamical data (so to speak). Incidentally, I would not swear to the correctness of the constants above.
Example: A given free group is the fundamental group of a surface with boundary in many different ways (this difference is one of the reasons that a group like is so much more complicated than the mapping class group of a surface). Pick a realization
. Then a homomorphism
up to conjugacy determines a homotopy class of map from
to
as above. If
is negatively curved as before, each boundary loop is homotopic to a unique geodesic, and we may try to find a “good” map
with boundary on these geodesics. There are many possible classes of good maps to consider:
- Fix a conformal structure on
and pick a harmonic map in the homotopy class of
. Such a map exists since the target is nonpositively curved, by the famous theorem of Eells-Sampson. The image is real analytic if
is, and is at least as negatively curved as the target, and therefore there is an a priori upper bound on the intrinsic curvature of the image; if the supremum of the curvature on
is normalized to be
, then the image surface is
, which just means that pointwise it is at least as negatively curved as hyperbolic space. By Gauss-Bonnet, one obtains an a priori bound on the area of the image of
in terms of the Euler characteristic (which just depends on the rank of
). On the other hand, this map depends on a choice of marked conformal structure on
, and the space of such structures is noncompact.
- Vary over all conformal structures on
and choose a harmonic map of least energy (if one exists) or find a sequence of maps that undergo a “neck pinch” as a sequence of conformal structures on
degenerates. Such a neck pinch exhibits a simple curve in
that is essential in
but whose image is inessential in
; such a curve can be compressed, and the topology of
simplified. Since each compression increases
, after finitely many steps the process terminates, and one obtains the desired map. This is Schoen-Yau‘s method to construct a stable minimal surface representative of
. When the target is
-dimensional, the surface may be assumed to be unbranched, by a trick due to Osserman.
- Following Thurston, pick an ideal triangulation of
(i.e. a geodesic lamination of
whose complementary regions are all ideal triangles); since
has boundary, we may choose such a lamination by first picking a triangulation (in the ordinary sense) with all vertices on
and then “spinning” the vertices to infinity. Unless
factors through a cyclic group, there is some choice of lamination so that the image of
can be straightened along the lamination, and then the image spanned with
ideal triangles to produce a pleated surface in
representing
(note: if
has constant negative curvature, these ideal triangles can be taken to be totally geodesic). The space of pleated surfaces in fixed (closed)
of given genus is compact, so this is a reasonable class of maps to work with.
- If
is merely a hyperbolic group, one can still construct pleated surfaces, not quite in
, but equivariantly in Mineyev’s flow space associated to
. Here we are not really thinking of the triangles themselves, but the geodesic laminations they bound (which carry the same information).
- If
is complete and
-dimensional but noncompact, the space of pleated surfaces of given genus is generally not compact, and it is not always easy to find a pleated surface where you want it. This can sometimes be remedied by shrinkwrapping; one looks for a minimal/pleated/harmonic surface subject to the constraint that it cannot pass through some prescribed set of geodesics in
(which act as “barriers” or “obstacles”, and force the resulting surface to end up roughly where one wants it to).
Anyway, one way or another, one can usually find a map of a surface, or a space of maps of surfaces, representing a given homomorphism, with some kind of a priori control of the geometry. Usually, this control is not enough to certify that a given map is -injective, but sometimes it might be. For instance, a totally geodesic (immersed) surface in a complete manifold of constant negative curvature is always
-injective, and any surface whose extrinsic curvature is small enough will also be
-injective.
Geometric methods to certify injectivity of free or surface groups are very useful and flexible, as far as they go. Unfortunately, I know of very few topological methods to certify injectivity. By far the most important exception is the following:
Example: In -dimensions, one should look for properly embedded surfaces. If
is a
-manifold (possibly with boundary), and
is a two-sided properly embedded surface, the famous Dehn’s Lemma (proved by Papakyriakopoulos) implies that either
is
-injective, or there is an embedded essential loop in
that bounds an embedded disk in
on one side of
. Such a loop may be compressed (i.e.
may be cut open along the loop, and two copies of the compressing disk sewn in) preserving the property of embeddedness, but increasing
. After finitely many steps, either
compresses away entirely, or one obtains a
-injective surface. One way to ensure that
does not compress away entirely is to start with a surface that is essential in (relative) homology; another way is to look for a surface dual to an action (of
) on a tree. In the latter case, one can often construct quite different free subgroups in
by pingpong on the ends of the tree. Note by the way that this method produces closed surface subgroups as well as free subgroups. Note too that two-sidedness is essential to apply Dehn’s Lemma.
Remark: Modern -manifold topologists are sometimes unreasonably indifferent to the power of Dehn’s Lemma (probably because this tool has been incorporated so fully into their subconscious?); it is worth reading Ralph Fox’s review of Papakyriakopoulos’s paper (linked above). Of this paper, he writes:
. . . it has already led to renewed attack on the problem of classifying the 3-dimensional manifolds; significant results have been and are being obtained. A complete solution has suddenly become a definite possibility.
Remember this was written more than 50 years ago — before the geometrization conjecture, before the JSJ decomposition, before the Scott core theorem, before Haken manifolds. The only reasonable reaction to this is: !!!
Example: The construction of injective surfaces by Dehn’s Lemma may be abstracted in the following way. Given a target space , and a class of maps
of surfaces into
(in some category; e.g. homotopy classes of maps, pleated surfaces,
surfaces, etc.) suppose one can find a complexity
with values in some ordered set, such that if
is not injective, one can find
of smaller complexity. Then if
is well-ordered, an injective surface may be found. If
is not well-ordered, one may ask at least that
is upper semi-continuous on
, and hope to extend it upper semi-continuously to some suitable compactification of
. Even if
is not well-ordered, one can at least certify that a map is injective, by showing that it minimizes
. Here are some potential examples (none of them entirely satisfactory).
- Given a (homologically trivial) homotopy class of loop
in
, one can look at all maps of orientable surfaces
to
with boundary factoring through
. For such a surface, let
denote the degree with which the (possibly multiple) boundary (components) of
wrap homologically around
, and let
denote the sum of Euler characteristics of non-disk and non-sphere components of
. For each surface
, one considers the quantity
(the factor of
can be ignored if desired). The important feature of this quantity is that it does not change if
is replaced by a finite cover. If
is not injective, let
be an essential loop on
whose image in
is inessential. Peter Scott showed that any essential loop on a surface lifts to an embedded loop in some finite cover. Hence, after passing to such a cover,
may be compressed, and the resulting surface
satisfies
. In other words, a global minimizer of this quantity is injective. Such a surface is called extremal. The problem is that extremal surfaces do not always exist; but this construction motivates one to look for them.
- Given a
surface
with geodesic boundary in
, one can retract
to a geodesic spine, and encode the surface by the resulting fatgraph, with edges labelled by homotopy classes in
. Since Euler characteristic is local, one does not really care precisely how the pieces of the fatgraph are assembled, but only how many pieces of what kinds are needed for a given boundary. So if only finitely many such pieces appear in some infinite family of surfaces, one can in fact construct an extremal surface as above, which is necessarily injective (more technically, one reduces the computation of Euler characteristic to a linear programming problem, finds a rational extremal solution (which corresponds to a weighted sum of pieces of fatgraph), and glues together the pieces to construct the extremal surface; one situation in which this scheme can be made to work is explained in this paper of mine). Edges can be subdivided into a finite number of possibilities, so one just needs to ensure finiteness of the number of vertex types. One condition that ensures finiteness of vertex types is the existence of a uniform constant
so that for each surface
in the given family, and for each point
, there is an estimate
. If this condition is violated, one finds pairs
which converge in the geometric topology to a point in a complete (i.e. without boundary, but probably noncompact) surface.
- Given
, either compress an embedded essential loop, or realize
by a least area surface. If
is not injective, pass to a cover, compress a loop, and realize the result by a least area surface. Repeat this process. One obtains in this way a sequence of least area surfaces in
(typically of bigger and bigger genus) and there is no reason to expect the process to terminate. If
is a
-manifold, the curvature of a least area surface admits two-sided curvature bounds away from the boundary, by a theorem of Schoen (near the boundary, the negative curvature might blow up, but only in controlled ways — e.g. after rescaling about a sequence of points with the most negative curvature, one may obtain in the limit a helicoid). Away from the boundary, the family of surfaces one obtains vary precompactly in the
topology, and one may obtain a complete locally least area lamination
in the limit. If
is not injective, one can continue to pass to covers (applying a version of Scott’s theorem for infinite surfaces) and compress, and by transfinite induction, eventually arrive at a locally least area lamination with injective
. Of course, such a limit might well be a lamination by planes. However, the lamination one obtains is not completely arbitrary: since it is a limit of limits of . . . compact surfaces, one can choose a limit that admits a nontrivial invariant transverse measure (one must be careful here, since the lamination will typically have boundary). Or, as in bullet 2. above, one may insist that this limit lamination is complete (i.e. without boundary).
It is more tricky to find a limit lamination as in 3. without boundary and admitting an invariant transverse measure; in any case, this motivates the following:
Question: Is there a closed hyperbolic -manifold
which admits a locally least area transversely measured complete immersed lamination
, all of whose leaves are disks? (note that the answer is negative if one asks for the lamination to be embedded (there are several easy proofs of this fact)).
Secretly, the function that assigns to a homologically trivial loop
is the stable commutator length of the conjugacy class in
represented by
. Extremal surfaces can sometimes be certified by constructing certain functions on
called homogeneous quasimorphisms, but a discussion of such functions will have to wait for another post.

Recent Comments