I was at UC Riverside this past weekend, attending the regional meeting, and giving a talk in a special session on knot theory in memory of the late Xiao-Song Lin. After lunch, I joined in a conversation between Rob Kirby and Mike Freedman on the recent flurry of activity this summer, in which Selman Akbulut and Bob Gompf independently showed that the Cappell-Shaneson manifolds — smooth 4-manifolds known since Freedman’s work to be homeomorphic to S^4 — are in fact diffeomorphic to the standard smooth S^4 (actually, Cappell-Shaneson’s manifolds have the additional feature that they admit a free \mathbb{Z}/2\mathbb{Z} action, giving rise to fake \mathbb{RP}^4’s, which was actually their original interest).

Apparently these constructions had somewhat altered the experts’s (i.e. Freedman and Kirby) feelings about whether the smooth 4-dimensional Poincaré conjecture is likely to be true. The Cappell-Shaneson manifolds are constructed by doing surgery on certain torus bundles over a circle — those with monodromy chosen so that the resulting torus bundles have the homology of a S^1 \times S^3. A suitable surgery, killing the S^1 factor makes the manifold into homology S^4’s, and also kills the subgroup of the fundamental group normally generated by the S^1 factor. On the other hand, everything else in the fundamental group “comes from” the torus, whose fundamental group is abelian, and therefore the resulting manifold is simply-connected. Since it is a homology 4-sphere, is it therefore a homotopy 4-sphere, and consequently (by Freedman), a topological 4-sphere.

Gompf shows these 4-spheres are standard by showing that a certain move — which simplifies the monodromy of the T^3 fiber — can be realized by a diffeomorphism. The move is an example of what is known to 4-manifold topologists as a “log transform” (and to 3-manifold topologists as “Dehn surgery times S^1”). A log transform takes as input a smooth embedded torus. A tubular neighborhood of this torus is a product T^2 \times D^2 whose boundary is a 3-torus T^3. This tubular neighborhood is removed, and reglued by an automorphism of the T^3 factor. Usually a log transform will change the topology of the manifold, or at least the smooth structure. But in this case, the surgered torus is contained in a T^3 fiber, and the log transform can be shown to be isotopic to the identity, by using the monodromy of the fibration (technically, the monodromy of the fibration produces a once-punctured torus in the T^3 bundle with boundary on the curve along which the log transform “twists”, but after doing surgery to produce the homology S^4’s, this once-punctured torus is “capped” to become a smooth disk).

The point of this blog post is to show how to construct many, many other smooth 4-manifolds which are topological 4-spheres, and for which Gompf’s method of showing they are standard does not work. Are these manifolds counterexamples to the smooth 4-dimensional Poincaré conjecture? I am really not the person to ask.

The construction takes as input a fibered knot — i.e. a knot K in the 3-sphere S^3 whose complement fibers over a circle. In other words, there is a fibration S \to S^3 - K \to S^1, where S is a (minimal genus) Seifert surface for the knot K. The fibration of spaces gives rise to a short exact sequence of fundamental groups (in general, one gets a long exact sequence of homotopy groups, but the spaces S, S^3-K,S^1 are all K(\pi,1)’s — i.e. their homotopy groups in dimension other than 1 all vanish). Since S has boundary, the fundamental group of S is free and finitely generated of rank 2g, where g is the genus. The fundamental group of S^1 is \mathbb{Z}. So one exhibits \pi_1(S^3 - K) as an HNN extension of a free group, where the meridian m acts by conjugation on the free group \pi_1(S) by some automorphism \phi:\pi_1(S) \to \pi_1(S).

Since K is a knot in S^3, the homology of S^3-K is equal to \mathbb{Z} in dimension 1. Moreover, since putting K back in recovers S^3, it follows that the fundamental group \pi_1(S^3-K) is normally generated by the meridian (which also generates the \mathbb{Z} in H_1). For the moment everything is 3-dimensional, but there is a trick to promote this to 4 dimensions. In place of the surface S, consider the 3-manifold M_{2g} = \#_{i=1}^{2g} S^2 \times S^1. In other words, M_{2g} is obtained by doubling a handlebody of genus 2g. The fundamental group \pi_1(M_{2g}) is free of rank 2g. Now one builds a M bundle over S^1 with monodromy \phi; call this 4-manifold W_\phi. The existence of such a manifold depends on being able to realize any automorphism of a free group by a homeomorphism of a doubled handlebody; one way to see this is to observe that \text{Aut}(F) is generated by Nielsen moves — interchanging generators, replacing generators by their inverses, and replacing generators x,y by xy, y. These moves are all realizable by homeomorphisms of doubled handlebodies, the last by a “handle slide”.

Now, observe that \pi_1(W_\phi) = \pi_1(S^3 - K), and is normally generated by a loop \gamma \in W_\phi representing the circle direction. Moreover, H_1(W_\phi) = H_1(S^3-K) = \mathbb{Z}. To compute H_2, observe that H_2(M) = (H_1(M))^* by Poincaré duality. If the action of \phi on H_1(M) (a free abelian group of rank 2g) is represented by a matrix A, then the action on H_2(M) is represented by the transpose A^T. The fact that H_1(W_\phi)=\mathbb{Z} is equivalent to the fact that the first homology of M dies in the bundle; i.e. \det(A - \text{Id})=\pm 1; hence \det(A^T - \text{Id}) = \pm 1, and for the same reason, the second homology of M dies in the bundle, and H_2(W_\phi)=0. By (4-dimensional) Poincaré duality, H_3(W_\phi) = \mathbb{Z}, and we see that W_\phi is a homology S^1 \times S^3.

A tubular neighborhood of the loop \gamma is a product S^1 \times D^3, since W_\phi is orientable. The boundary of this is S^1 \times S^2. So we drill out \gamma and glue in a product D^2 \times S^2 to produce W_\phi'. Drilling out \gamma does not affect the fundamental group, by Seifert van-Kampen, and the fact that the inclusion S^1 \times S^2 \to S^1 \times D^3 is an isomorphism on \pi_1. On the other hand, filling in a D^2 \times S^2 has the effect of killing the meridian m, and therefore (by the discussion above), killing \pi_1 completely; i.e. W_\phi' is simply-connected. Hence H_1(W_\phi') = H_3(W_\phi')=0. Drilling out a circle and gluing back a sphere increases Euler characteristic by 2; since the rank of H_1 and H_3 have both gone down by 1, it follows that the rank of H_2(W_\phi') is still 0, and since the fundamental group is trivial, H_2(W_\phi')=0. So W_\phi' is a smooth, simply-connected homology sphere, which is to say, a smooth 4-manifold which is topologically S^4.

Back in June, Freedman-Gompf-Morrison-Walker described a way to use Rasmussen’s s-invariant to detect exotic S^4’s, and proposed trying this invariant out on the Cappell-Shaneson examples (see Scott Morrison’s post about that here). Is it feasible to compute the invariants on these new examples?

(Update:) Mike Freedman pointed out in an email that these spheres are all standard, and give rise to interesting open-book decompositions of S^4 with binding a knot \times S^1 and fiber S \times S^1 where S is the fiber of the fibration of S^3 - K. This is related to the familiar fact that 1-twisted spun knots are unknotted (see e.g. Rolfsen, p. 339) — an interesting exercise in visualization. (Updated update:) there are some interesting variations on this construction which give other S^4’s that are not obviously standard (at least not in the same obvious way). If I can work out some good examples, I’ll try to describe them.

The “header image” for this blog is an example of an interesting construction in 2-dimensional conformal geometry, due to Richard Kenyon, that I learned of some time ago; I thought it might be fun to try to explain where it comes from.

The example comes from the idea of a Riemann surface lamination. This is an object that geometrizes some ideas in 1-dimensional complex analysis. The basic idea is simple: given a noncompact infinite Riemannian 2-manifold \Sigma, one gives it a new topology by declaring that two points on the surface are “close” in the new topology if there are balls of big radius in the surface centered at the two points which are “almost isometric”. Points that were close in the old topology are close in the new topology, but points that might have been far away in the old topology can become close in the new. For example, if \Sigma is a covering space of some other Riemannian surface S, then points in the orbit of the deck group are “infinitely close” in the new topology. This means that the resulting topological space is not Hausdorff; one “Hausdorffifies” by identifying pairs of points that are not contained in disjoint open sets, and the quotient recovers the surface S (assuming that the metric on S is sufficiently generic; otherwise, it recovers S modulo its group of isometries). Morally what one is doing is mapping \Sigma into the space \mathcal{M} of pointed locally compact metric spaces (which is itself a locally compact topological space), and giving it the subspace topology. In more detail, a point in \mathcal{M} is a pair (X,p) where X is a locally compact metric space, and p \in X is a point. A sequence X_i,p_i converges to X,p if there are metric balls B_i around p_i of diameter going to infinity, metric balls D_i around p also of diameter going to infinity, and isometric inclusions of B_i,D_i into metric spaces Z_i in such a way that the Hausdorff distance between the images of B_i and D_i in Z_i goes to zero as i \to \infty. Any locally compact metric space Y has a tautological map to \mathcal{M}, where each point y \in Y is sent to the point (Y,y) \in \mathcal{M}. Gromov showed (see section 6 of this paper) that the space \mathcal{M} itself is locally compact; in fact, this follows in an obvious way from the Arzela-Ascoli theorem.

If \Sigma has bounded geometry — i.e. if the injectivity radius is uniformly bounded below, and the curvature is bounded above and below — then the image of \Sigma in \mathcal{M} is precompact, and its closure is a compact metric space \mathcal{L}. The path components of \mathcal{L} are exactly the Riemann surfaces which are arbitrarily well approximated (in the metric sense) on every compact subset by compact subsets of \Sigma. If you were wandering around on such a component \Sigma', and you wandered over a compact region, and were only able to measure the geometry up to some (arbitrarily fine) definite precision, you could never rule out the possibility that you were actually wandering around on \Sigma. Topologically, \mathcal{L} is a Riemann surface lamination; i.e. a locally compact topological space covered by open charts of the form U \times X where U is an open two-dimensional disk, where X is totally disconnected, and where the transition between charts preserves the decomposition into pieces U \times \text{point}, and is smooth (in fact, preserves the Riemann surface structure) on the U \times \text{point} slices, in the overlaps. The unions of “surface” slices — i.e. the path components of \mathcal{L} — piece together to make the leaves of the lamination, which are (complete) Riemann surfaces. In our case, the leaves have Riemannian metrics, which vary continuously in the direction transverse to the leaves. (Surface) laminations occur in other areas of mathematics, for example as inverse limits of sequences of finite covers of a fixed compact surface, or as objects obtained by inductively splitting open sheets in a branched surface (the latter can easily occur as attractors of certain kinds of partially hyperbolic dynamical systems). One well-known example is sometimes called the (punctured) solenoid; its Teichmüller theory is studied by Penner and Šarić  (question: does anyone know how to do a “\acute c” in wordpress? update 11/6: thanks Ian for the unicode hint).

A lamination is said to be minimal if every leaf is dense. In our context this means that for every compact region K in \Sigma and every \epsilon>0 there is a T so that every ball in \Sigma of radius T contains a subset K' which is \epsilon-close to K in the Gromov-Hausdorff metric. In other words, every “local feature” of \Sigma that appears somewhere, appears with definite density to within any desired degree of accuracy. Consequently, such features will “almost” appear, with the same definite density, in every other leaf \Sigma' of \mathcal{L}, and therefore \Sigma is in the closure of each \Sigma'. Since \mathcal{L} is (in) the closure of \Sigma, this implies that every leaf is dense, as claimed.

In a Riemann surface lamination, the conformal type of every leaf is well-defined. If some leaf is elliptic, then necessarily that leaf is a sphere. So if the lamination is minimal, it is equal to a single closed surface. If every leaf is hyperbolic, then each leaf admits a unique hyperbolic metric in its conformal class (i.e. each leaf can be uniformized), and Candel showed that this family of hyperbolic metrics varies continuously in \mathcal{L}. Étienne Ghys asked whether there is an example of a minimal Riemann surface lamination in which some leaves are conformally parabolic, and others are conformally hyperbolic. It turns out that the answer to this question is yes; Richard Kenyon found an example, which I will now describe.

The lamination in question has exactly one hyperbolic leaf, which is topologically a 4-times punctured sphere. Every other leaf is an infinite cylinder — i.e. it is conformally the punctured plane \mathbb{C}^*. Since the lamination is minimal, to describe the lamination, one just needs to describe one leaf. This leaf will be obtained as the boundary of a thickened neighborhood of an infinite planar graph, which is defined inductively, as follows.

Let T_1 be the planar “Greek cross” as in the following figure:

Kenyon_1

Inductively, if we have defined T_n, define T_{n+1} by attaching four copies of T_n to the extremities of T_1. The first few examples T_1,\cdots,T_4 are illustrated in the following figure:

Kenyon_2

The limit T_\infty is a planar tree with exactly four ends; the boundary of a thickened tubular neighborhood is conformally equivalent to a sphere with four points removed, which is hyperbolic. Every unbounded sequence of points p_i in T_\infty has a subsequence which escapes out one of the ends. Hence every other leaf in the lamination \mathcal{L} this defines has exactly two ends, and is conformally equivalent to a punctured plane, which is parabolic.

The header image is a very similar construction in 3-dimensional space, where the initial seed has six legs along the coordinate axes instead of four; some (quite large) approximation was then rendered in povray.

When I was in graduate school, I was very interested in the (complex) geometry of Riemann surface laminations, and wanted to understand their deformation theory, perhaps with the aim of using structures like taut foliations and essential laminations to hyperbolize 3-manifolds, as an intermediate step in an approach to the geometrization conjecture (now a theorem of Perelman). I know that at one point Sullivan was quite interested in such objects, as a tool in the study of Julia sets of rational functions. I have the impression that they are not studied so much these days, but I would be happy to be corrected.

Martin Bridgeman gave a nice talk at Caltech recently on his discovery of a beautiful identity concerning orthospectra of hyperbolic surfaces (and manifolds of higher dimension) with totally geodesic boundary. The 2-dimensional case is (in my opinion) the most beautiful, and I would like to take a post to explain the identity, and give a derivation which is slightly different from the one Martin gives in his paper. There are many other things one could say about this identity, and its relation to other identities that turn up in the theory of hyperbolic manifolds (and elsewhere); I hope to get to this in a later post.

Let \Sigma be a hyperbolic surface with totally geodesic boundary. An orthogeodesic is a geodesic segment properly immersed in \Sigma, which is perpendicular to \partial \Sigma at its endpoints. The set of orthogeodesics is countable, and their lengths are proper. Denote these lengths by l_i (with multiplicity). The identity is:

\sum_i \mathcal{L}(1/\cosh^2{l_i/2}) = -\pi^2\chi(\Sigma)/2

where \mathcal{L} is the Rogers’ dilogarithm function (to be defined in a minute). Treating this function as a black box for the moment, the identity has the form \sum_i L(l_i) = a term depending only on the topology of \Sigma. The proof is very, very short and elegant. By the Gauss-Bonnet theorem, the term on the right is equal to 1/8 of the volume of the unit tangent bundle of \Sigma. Almost every tangent vector on \Sigma can be exponentiated to a geodesic on \Sigma which intersects the boundary in finite forward and backward time (eg. by ergodicity of the geodesic flow on a closed hyperbolic surface obtained by doubling). If v is such a tangent vector, and \gamma_v is the associated geodesic arc, then \gamma_v is homotopic keeping endpoints on \partial \Sigma to a unique orthogeodesic (which is the unique length minimizer in this relative homotopy class). The volume of the set of v associated to a given orthogeodesic \alpha can be computed as follows. Lift \alpha to the universal cover, where it is the crossbar of a letter “H” whose vertical lines are lifts of the geodesics it ends on. Any \gamma_v lifts to a unique geodesic segment in the universal cover with endpoints on the edges of the H. So the volume of the set of such v depends only on \text{length}(\alpha), giving rise to the explicit formula for L. qed.

That’s it — that’s the whole proof! . . . modulo some calculations, which we now discuss.

The “ordinary” polylogarithms \text{Li}_k are defined by Taylor series

\text{Li}_k(z) = \sum_{n=1}^\infty \frac {z^n} {n^k}

which converges for |z|<1, and extends by analytic continuation. Taking derivatives, one sees that they satisfy \text{Li}_k'(z) = \text{Li}_{k-1}(z)/z, thereby giving rising to integral formulae. \text{Li}_0(z) is the familiar geometric series z/(1-z), so \text{Li}_1(z) = -\log(1-z) and

\text{Li}_2(z) = -\int \frac {\log(1-z)} {z} dz

The Rogers dilogarithm is then given by the formula \mathcal{L}(z) = \text{Li}_2(z) + \frac 1 2 \log(|z|)\log(1-z) for real z<1. One sees that the Rogers dilogarithm is obtained by symmetrizing the integrand for the integral expression for \text{Li}_2 under the involution z \to 1-z:

\mathcal{L}'(z) = -\frac {1}{2} \left(\frac {\log(1-z)}{z} + \frac {\log(z)}{1-z} \right)

Martin derives his identity by direct calculation, but in fact this calculation can be simplified a bit by some hyperbolic geometry. Consider an ideal quadrilateral Q (whose unit tangent bundle has area 4\pi^2) with one pair of opposite sides that are distance l apart. Join opposite vertices in pairs to decompose the quadrilateral into four triangles, each with one non-ideal point:

circles_figure_2

In the (schematic) picture, suppose the two edges of the H are the left and right side (call them L and R) and the other two edges are U and D. Similarly, call the four triangles T_L, T_R, T_U, T_D depending on which edge of the quadrilateral they bound. The triangle T_R is colored gray in the figure. We secretly identify this figure with the upper half-plane, in such a way that the ideal vertices are (in circular order) 0,x,1,\infty, where \infty,0 are the ideal vertices of the gray triangle. Call \alpha the (hyperbolic) angle of the gray triangle at its vertex, so x = (1+\cos(\alpha))/2. Moreover, it turns out that x = 1/\cosh^2(l/2) where l is the distance between L and R. We will compute L implicitly as a function of x, and show that it is a multiple of the Rogers dilogarithm function, thus verifying Bridgeman’s identity.

Every vector v in Q exponentiates to a (bi-infinite) geodesic \gamma_v, and we want to compute the volume of the set of vectors v for which the corresponding geodesic intersects both L and R. The point of the decomposition is that for v in T_L (say), the geodesic \gamma_v intersects L whenever it intersects R, so we only need to compute the volume of the v in T_L for which \gamma_v intersects R. Similarly, we only need to compute the volume of the v in T_R for which \gamma_v intersects L. For v in T_U, we compute the volume of the v which do not intersect U (since these are exactly the ones that intersect both L and R), and similarly for T_D.

These volumes can be expressed in terms of integrals of harmonic functions. Let \chi_L denote the harmonic function on the disk which is 1 on the arc of the circle bounded by L, and 0 on the rest of the circle. This function at each point is equal to 1/2\pi times the visual angle (i.e. the length in the unit tangent circle) subtended by the given arc of the circle, as seen from the given point in the hyperbolic plane. Define \chi_R,\chi_U,\chi_D similarly. Then the total volume we need to compute is equal to

4\pi \left( (\int_{T_L} 2\chi_R) + (\int_{T_U} 1 - 2\chi_U) \right)

(here we have identified \int_{T_L} \chi_R = \int_{T_R} \chi_L by symmetry, and similarly for the other pair of terms). Let us approach this a bit more systematically. If \alpha denotes the angle at the nonideal vertex of triangle T_R, we denote \int_{T_R} \chi_R = A(\alpha), \int_{T_R} \chi_U = B(\alpha) and \int_{T_R} \chi_L = C(\alpha). The integral we want to evaluate can be expressed easily in terms of explicit rational multiples of \pi, and the function A,B,C. These functions satisfy obvious identities:

C(\alpha) = \int_{T_R} 1 - A(\alpha) - 2B(\alpha) = \pi-\alpha - A(\alpha) - 2B(\alpha)

and

A(\alpha) + B(\pi - \alpha) = \pi/3

where the last identity comes by observing that we are integrating a certain function over an ideal triangle, and observing that the average of this function under the symmetries of the ideal triangle is equal to the constant function 1/3. In particular, we see that we can express everything in terms of A. After some elementary reorganization, we see that the contribution V(\alpha) to the volume of the unit tangent bundle of the surface associated to this particular orthogeodesic is

V(\alpha) = \pi^2(8 - 16/3) - 4\pi\alpha - 8\pi(A(\alpha) - A(\pi - \alpha))

To compute A(\alpha), it makes sense to move to the upper half-space model, and move the endpoints of the interval to 0 and \infty. The harmonic function is equal to 1 on the negative real axis, and 0 on the positive real axis. It takes the value \theta/\pi on the line \text{arg}(z) = \theta. The area form in the hyperbolic metric is proportional to the Euclidean area form, with constant 1/\text{Im}(z)^2. In other words, we want to integrate \text{arg}(z)/\pi\text{Im}(z)^2 over the region indicated in the figure, where the nonideal angle is \alpha, and the base point is 0:

circles_figure

If we normalize so that the circular arc is part of the semicircle from 0 to 1, then the real projection of the vertical lines in the figure are 0 and x. There is no elementary way to evaluate this integral, so instead we evaluate its derivative as a function of x where as before, x = (1+\cos(\alpha))/2. This is the definite integral

A'(x) = \int_{y = \sqrt{x-x^2}}^\infty (\tan^{-1}(y/x)/\pi y^2) dy

Integrating by parts gives (\alpha/\pi\sin{\alpha}) + 1/\pi \int_{y = \sqrt{x-x^2}}^\infty xdy/y(y^2+x^2). This evaluates to

A'(x) = (\alpha/\pi\sin{\alpha}) - 1/\pi ( \log(1-x)/2x)

Thinking of V(\alpha) as a function of x, we get

V'(x) = -4\pi d\alpha/dx - 8\pi(A'(x) + A'(1-x)) = 8\mathcal{L}'(x)

Comparing values at x=0 we see that V=8\mathcal{L} and the identity is proved.

Well, OK, this is not terribly simple, but a posteriori it gives a way to express the Rogers dilogarithm as a sum of integrals of very simple harmonic functions over hyperbolic triangles, which is a nice geometric way to think of it.

(Update 10/30): This paper by Dupont and Sah relates Rogers dilogarithm to volumes of \text{SL}(2,\mathbb{R})-simplices, and discusses some interesting connections to conformal field theory and lattice model calculations. I feel like a bit of a dope, since I read this paper while I was in graduate school more than a dozen years ago, but forgot all about it until I was cleaning out my filing cabinet this morning. They cite an older paper of Dupont for the explicit calculations; these are somewhat tedious and unenlightening; however, he does manage to show that the Rogers dilogarithm is characterized by the Abel identity. In other words,

Lemma A.1 (Dupont): Let f:(0,1) \to \mathbb{R} be a three times differentiable function satisfying

f(s_1) - f(s_2) + f(\frac{s_2}{s_1}) - f(\frac{1-s_1^{-1}}{1-s_2^{-1}}) + f(\frac{1-s_1}{1-s_2})=0

for all 0 < s_2 < s_1 < 1. Then there is a real constant \kappa such that f(x) = \kappa L(x) where L(x) is the Rogers dilogarithm (up to an additive constant).

Nevertheless, they don’t seem to have noticed the formula in terms of integrals of harmonic functions over hyperbolic triangles. Perhaps this is also well-known. Do any readers know?

Hermann Amandus Schwarz (1843-1921) was a student of Kummer and Weierstrass, and made many significant contributions to geometry, especially to the fields of minimal surfaces and complex analysis. His mathematical creations are both highly abstract and flexible, and at the same time intimately tied to explicit and practical calculation.

I learned about Schwarz-Christoffel transformations, Schwarzian derivatives, and Schwarz’s minimal surface as three quite separate mathematical objects, and I was very surprised to discover firstly that they had all been discovered by the same person, and secondly that they form parts of a consistent mathematical narrative, which I will try to explain in this post to the best of my ability. There is an instructive lesson in this example (for me), that we tend to mine the past for nuggets, examples, tricks, formulae etc. while forgetting the points of view and organizing principles that made their discovery possible. Another teachable example is that of Dehn’s “invention” of combinatorial (infinite) group theory, as a natural branch of geometry; several generations of followers went about the task of reformulating Dehn’s insights and ideas in the language of algebra, “generalizing” them and stripping them of their context, before geometric and topological methods were reintroduced by Milnor, Schwarz (a different one this time), Stallings, Thurston, Gromov and others to spectacular effect (note: I have the second-hand impression that the geometric point of view in group theory (and every other subject) was never abandoned in the Soviet Union).

Schwarz’s minimal surface (also called “Schwarz’s D surface”, and sometimes “Schwarz’s H surface”) is an extraordinarily beautiful triply-periodic minimal surface of infinite genus that is properly embedded in \mathbb{R}^3. According to Nitsche’s excellent book (p.240), this minimal surface closely resembles the separating wall between inorganic and organic materials in the skeleton of a starfish. The basic building block of the surface can be described as follows. If the vertices of a cube are 2-colored, the black vertices are the vertices of a regular tetrahedron. Let Q denote the quadrilateral formed by four edges of this tetrahedron; then a fundamental piece S of Schwarz’s surface is a minimal disk spanning Q:

schwarz_piece

The surface may be “analytically continued” by rotating Q through an angle \pi around each boundary edge. Six copies of Q fit smoothly around each vertex, and the resulting surface extends (triply) periodically throughout space.

The symmetries of Q enable us to give it several descriptions as a Riemann surface. Firstly, we could think of Q as a polygon in the hyperbolic plane with four edges of equal length, and angles \pi/3. Twelve copies of Q can be assembled to make a hyperbolic surface \Sigma of genus 3. Thinking of a surface of genus 3 as the boundary of a genus 3 handlebody defines a homomorphism from \pi_1(\Sigma) to \mathbb{Z}^3, thought of as H_1(\text{handlebody}); the cover \widetilde{\Sigma} associated to the kernel is (conformally) the triply periodic Schwarz surface, and the deck group acts on \mathbb{R}^3 as a lattice (of index 2 in the face-centered cubic lattice).

Another description is as follows. Since the deck group acts by translation, the Gauss map from \widetilde{\Sigma} to S^2 factors through a map \Sigma \to S^2. The map is injective at each point in the interior or on an edge of a copy of Q, but has an order 2 branch point at each vertex. Thus, the map \Sigma \to S^2 is a double-branched cover, with one branch point of order 2 at each vertex of a regular inscribed cube. This leads one to think (like a late 19th century mathematician) of \Sigma as the Riemann surface on which a certain multi-valued function on S^2 = \mathbb{C} \cup \infty is single-valued. Under stereographic projection, the vertices of the cube map to the eight points \lbrace \alpha,i\alpha,-\alpha,-i\alpha,1/\alpha,i/\alpha,-1/\alpha,-i/\alpha \rbrace where \alpha = (\sqrt{3}-1)/\sqrt{2}. These eight points are the roots of the polynomial w^8 - 14w^4 + 1, so we may think of \Sigma as the hyperelliptic Riemann surface defined by the equation v^2 = w^8 - 14w^4 + 1; equivalently, as the surface on which the multi-valued (on \mathbb{C} \cup \infty) function R(w):= 1/v=1/\sqrt{w^8 - 14w^4 + 1} is single-valued.

The function R(w) is known as the Weierstrass function associated to \Sigma, and an explicit formula for the co-ordinates of the embedding \widetilde{\Sigma} \to \mathbb{R}^3 were found by Enneper and Weierstrass. After picking a basepoint (say 0) on the sphere, the coordinates are given by integration:

x = \text{Re} \int_0^{w_0} \frac{1}{2}(1-w^2)R(w)dw

y = \text{Re} \int_0^{w_0} \frac{i}{2}(1+w^2)R(w)dw

z = \text{Re} \int_0^{w_0} wR(w)dw

The integral in each case depends on the path, and lifts to a single-valued function precisely on \widetilde{\Sigma}.

Geometrically, the three coordinate functions x,y,z are harmonic functions on \widetilde{\Sigma}. This corresponds to the fact that minimal surfaces are precisely those with vanishing mean curvature, and the fact that the Laplacian of the coordinate functions (in terms of isothermal parameters on the underlying Riemann surface) can be expressed as a nonzero multiple of the mean curvature vector. A harmonic function on a Riemann surface is the real part of a holomorphic function, unique up to a constant; the holomorphic derivative of the (complexified) coordinate functions are therefore well-defined, and give holomorphic 1-forms \phi_1,\phi_2,\phi_3 which descend to \Sigma (since the deck group acts by translations). These 1-forms satisfy the identity \sum_i \phi_i^2 = 0 (this identity expresses the fact that the embedding of \widetilde{\Sigma} into \mathbb{R}^3 via these functions is conformal). The (composition of the) Gauss map (with stereographic projection) can be read off from the \phi_i, and as a meromorphic function on \Sigma, it is given by the formula w = \phi_3/(\phi_1 - i\phi_2). Define a function f on \Sigma by the formula fdw = \phi_1 - i\phi_2. Then 1/f,w are the coordinates of a rational map from \Sigma into \mathbb{C}^2 which extends to a map into \mathbb{CP}^2, by sending each zero of f to wf = \phi_3/dw in the \mathbb{CP}^1 at infinity. Symmetry allows us to identify the image with the hyperelliptic embedding from before, and we deduce that f=R(w). Solving for \phi_1,\phi_2 we obtain the integrands in the formulae above.

In fact, any holomorphic function R(w) on a domain in \mathbb{C} defines a (typically immersed with branch points) minimal surface, by the integral formulae of Enneper-Weierstrass above. Suppose we want to use this fact to produce an explicit description of a minimal surface bounded by some explicit polygonal loop in \mathbb{R}^3. Any minimal surface so obtained can be continued across the boundary edges by rotation; if the angles at the vertices are all of the form \pi/n the resulting surface closes up smoothly around the vertices, and one obtains a compact abstract Riemann surface \Sigma tiled by copies of the fundamental region, together with a holonomy representation of \pi_1(\Sigma) into \text{Isom}^+(\mathbb{R}^3). Sometimes the image of this representation in the rotational part of \text{Isom}^+(\mathbb{R}^3) is finite, and one obtains an infinitely periodic minimal surface as in the case of Schwarz’s surface. A fundamental tile in \Sigma can be uniformized as a hyperbolic polygon; equivalently, as a region in the upper half-plane bounded by arcs of semicircles perpendicular to the real axis. Since the edges of the loop are straight lines, the image of this hyperbolic polygon under the Gauss map is a region in \mathbb{R}^3 also bounded by arcs of round circles; thus Schwarz’s study of minimal surfaces naturally led him to the problem of how to explicitly describe conformal maps between regions in the plane bounded by circular arcs. This problem is solved by the Schwarz-Christoffel transformation, and its generalizations, with help from the Schwarzian derivative.

Note that if P and Q are two such regions, then a conformal map from P to Q can be factored as the product of a map uniformizing P as the upper half-plane, followed by the inverse of a map uniformizing Q as the upper half-plane. So it suffices to find a conformal map when the domain is the upper half plane, decomposed into intervals and rays that are mapped to the edges of a circular polygon Q. Near each vertex, Q can be moved by a fractional linear transformation z \to (az+b)/(cz+d) to (part of) a wedge, consisting of complex numbers with argument between 0 and \alpha, where \alpha is the angle at Q. The function f(z) = z^{\alpha/\pi} uniformizes the upper half-plane as such a wedge; however it is not clear how to combine the contributions from each vertex, because of the complicated interaction with the fractional linear transformation. The fundamental observation is that there are certain natural holomorphic differential operators which are insensitive to the composition of a holomorphic function with groups of fractional linear transformations, and the uniformizing map can be expressed much more simply in terms of such operators.

For example, two functions that differ by addition of a constant have the same derivative: f' = (f+c)'. Functions that differ by multiplication by a constant have the same logarithmic derivative: (\log(f))' = (\log(cf))'. Putting these two observations together suggest defining the nonlinearity of a function as the composition N(f):= (\log(f'))' = f''/f'. This has the property that N(af+b) = N(f) for any constants a,b. Under inversion z \to 1/z the nonlinearity transforms by N(1/f) = N(f) - 2f'/f. From this, and a simple calculation, one deduces that the operator N' - N^2/2 is invariant under inversion, and since it is also invariant under addition and multiplication by constants, it is invariant under the full group of fractional linear transformations. This combination is called the Schwarzian derivative; explicitly, it is given by the formula S(f) = f'''/f' - 3/2(f''/f')^2. Given the Schwarzian derivative S(f), one may recover the nonlinearity N(f) by solving the Ricatti equation N' - N^2/2 - S = 0. As explained in this post, solutions of the Ricatti equation preserve the projective structure on the line; in this case, it is a complex projective structure on the complex line. Equivalently, different solutions differ by an element of \text{PSL}(2,\mathbb{C}), acting by fractional linear transformations, as we have just deduced. Once we know the nonlinearity, we can solve for f by f = \int e^{\int N}, the usual solution to a first order linear inhomogeneous ODE. The Schwarzian of the function z^{\alpha/\pi} is (1-\alpha^2/\pi^2)/2z^2. The advantage of expressing things in these terms is that the Schwarzian of a uniformizing map for a circular polygon Q with angles \alpha_i at the vertices has the form of a rational function, with principal parts a_i/(z-z_i)^2 + b_i/(z-z_i), where the a_i = (1-\alpha_i^2/\pi^2)/2 and the b_i and z_i depend (unfortunately in a very complicated way) on the edges of Q (for the ugly truth, see Nehari, chapter 5). To see this, observe that the map has an order two pole near finitely many points z_i (the preimages of the vertices of Q under the uniformizing map) but is otherwise holomorphic. Moreover, it can be analytically continued into the lower half plane across the interval between successive z_i, by reflecting the image across each circular edge. After reflecting twice, the image of Q is transformed by a fractional linear transformation, so S(f) has an analytic continuation which is single valued on the entire Riemann sphere, with finitely many isolated poles, and is therefore a rational function! When the edges of the polygon are straight, a simpler formula involving the nonlinearity specializes to the “familiar” Schwarz-Christoffel formula.

(Update 10/22): In fact, I went to the library to refresh myself on the contents of Nehari, chapter 5. The first thing I noticed — which I had forgotten — was that if f is the uniformizing map from the upper half-plane to a polygon Q with spherical arcs, then S(f) is real-valued on the real axis. Since it is a rational function, this implies that its nonsingular part is actually a constant; i.e.

S(f) = \sum _i a_i/(z-z_i)^2 + b_i/(z-z_i) + c

where a_i is as above, and z_i,b_i,c are real constants (which satisfy some further conditions — really see Nehari this time for more details).

The other thing that struck me was the first paragraph of the preface, which touches on some of the issues I alluded to above:

In the preface to the first edition of Courant-Hilbert’s “Methoden der mathematischen Physik”, R. Courant warned against a trend discernible in modern mathematics in which he saw a menace to the future development of mathematical analysis. He was referring to the tendency of many workers in this field to lose sight of the roots of mathematical analysis in physical and geometric intuition and to concentrate their efforts on the refinement and the extreme generalization of existing concepts.

Instead of using a word like “menace”, I would rather take this as a lesson about the value of returning to the points of view that led to the creation of the mathematical objects we study every day; which was (to some approximation) the point I was trying to illustrate in this post.

Last week, Michael Brandenbursky from the Technion gave a talk at Caltech on an interesting connection between knot theory and quasimorphisms. Michael’s paper on this subject may be obtained from the arXiv. Recall that given a group G, a quasimorphism is a function \phi:G \to \mathbb{R} for which there is some least real number D(\phi) \ge 0 (called the defect) such that for all pairs of elements g,h \in G there is an inequality |\phi(gh) - \phi(g) - \phi(h)| \le D(\phi). Bounded functions are quasimorphisms, although in an uninteresting way, so one is usually only interested in quasimorphisms up to the equivalence relation that \phi \sim \psi if the difference |\phi - \psi| is bounded. It turns out that each equivalence class of quasimorphism contains a unique representative which has the extra property that \phi(g^n) = n\phi(g) for all g\in G and n \in \mathbb{Z}. Such quasimorphisms are said to be homogeneous. Any quasimorphism may be homogenized by defining \overline{\phi}(g) = \lim_{n \to \infty} \phi(g^n)/n (see e.g. this post for more about quasimorphisms, and their relation to stable commutator length).

Many groups that do not admit many homomorphisms to \mathbb{R} nevertheless admit rich families of homogeneous quasimorphisms. For example, groups that act weakly properly discontinuously on word-hyperbolic spaces admit infinite dimensional families of homogeneous quasimorphisms; see e.g. Bestvina-Fujiwara. This includes hyperbolic groups, but also mapping class groups and braid groups, which act on the complex of curves.

Michael discussed another source of quasimorphisms on braid groups, those coming from knot theory. Let I be a knot invariant. Then one can extend I to an invariant of pure braids on n strands by I(\alpha) = I(\widehat{\alpha \Delta}) where \Delta = \sigma_1 \cdots \sigma_{n-1}, and the “hat” denotes plat closure. It is an interesting question to ask: under what conditions on I is the resulting function on braid groups a quasimorphism?

In the abstract, such a question is probably very hard to answer, so one should narrow the question by concentrating on knot invariants of a certain kind. Since one wants the resulting invariants to have some relation to the algebraic structure of braid groups, it is natural to look for functions which factor through certain algebraic structures on knots; Michael was interested in certain homomorphisms from the knot concordance group to \mathbb{R}. We briefly describe this group, and a natural class of homomorphisms.

Two oriented knots K_1,K_2 in the 3-sphere are said to be concordant if there is a (locally flat) properly embedded annulus A in S^3 \times [0,1] with A \cap S^3 \times 0 = K_1 and A \cap S^3 \times 1 = K_2. Concordance is an equivalence relation, and the equivalence classes form a group, with connect sum as the group operation, and orientation-reversed mirror image as inverse. The only subtle aspect of this is the existence of inverses, which we briefly explain. Let K be an arbitrary knot, and let K^! denote the mirror image of K with the opposite orientation. Arrange K \cup K^! in space so that they are symmetric with respect to reflection in a dividing plane. There is an immersed annulus A in S^3 which connects each point on K to its mirror image on K^!, and the self-intersections of this annulus are all disjoint embedded arcs, corresponding to the crossings of K in the projection to the mirror. This annulus is an example of what is called a ribbon surface. Connect summing K to K^! by pushing out a finger of each into an arc in the mirror connects the ribbon annulus to a ribbon disk spanning K \# K^!. A ribbon surface (in particular, a ribbon disk) can be pushed into a (smoothly) embedded surface in a 4-ball bounding S^3. Puncturing the 4-ball at some point on this smooth surface, one obtains a concordance from K\#K^! to the unknot, as claimed.

The resulting group is known as the concordance group \mathcal{C} of knots. Since connect sum is commutative, this group is abelian. Notice as above that a slice knot — i.e. a knot bounding a locally flat disk in the 4-ball — is concordant to the unknot. Ribbon knots (those bounding ribbon disks) are smoothly slice, and therefore slice, and therefore concordant to the trivial knot. Concordance makes sense for codimension two knots in any dimension. In higher even dimensions, knots are always slice, and in higher odd dimensions, Levine found an algebraic description of the concordance groups in terms of (Witt) equivalence classes of linking pairings on a Seifert surface; (some of) this information is contained in the signature of a knot.

Let K be a knot (in S^3 for simplicity) with Seifert surface \Sigma of genus g. If \alpha,\beta are loops in \Sigma, define f(\alpha,\beta) to be the linking number of \alpha with \beta^+, which is obtained from \beta by pushing it to the positive side of \Sigma. The function f is a bilinear form on H_1(\Sigma), and after choosing generators, it can be expressed in terms of a matrix V (called the Seifert matrix of K). The signature of K, denoted \sigma(K), is the signature (in the usual sense) of the symmetric matrix V + V^T. Changing the orientation of a knot does not affect the signature, whereas taking mirror image multiplies it by -1. Moreover, if \Sigma_1,\Sigma_2 are Seifert surfaces for K_1,K_2, one can form a Seifert surface \Sigma for K_1 \# K_2 for which there is some sphere S^2 \in S^3 that intersects \Sigma in a separating arc, so that the pieces on either side of the sphere are isotopic to the \Sigma_i, and therefore the Seifert matrix of K_1 \# K_2 can be chosen to be block diagonal, with one block for each of the Seifert matrices of the K_i; it follows that \sigma(K_1 \# K_2) = \sigma(K_1) + \sigma(K_2). In fact it turns out that \sigma is a homomorphism from \mathcal{C} to \mathbb{Z}; equivalently (by the arguments above), it is zero on knots which are topologically slice. To see this, suppose K bounds a locally flat disk \Delta in the 4-ball. The union \Sigma':=\Sigma \cup \Delta is an embedded bicollared surface in the 4-ball, which bounds a 3-dimensional Seifert “surface” W whose interior may be taken to be disjoint from S^3. Now, it is a well-known fact that for any oriented 3-manifold W, the inclusion \partial W \to W induces a map H_1(\partial W) \to H_1(W) whose kernel is Lagrangian (with respect to the usual symplectic pairing on H_1 of an oriented surface). Geometrically, this means we can find a basis for the homology of \Sigma' (which is equal to the homology of \Sigma) for which half of the basis elements bound 2-chains in W. Let W^+ be obtained by pushing off W in the positive direction. Then chains in W and chains in W^+ are disjoint (since W and W^+ are disjoint) and therefore the Seifert matrix V of K has a block form for which the lower right g \times g block is identically zero. It follows that V+V^T also has a zero g\times g lower right block, and therefore its signature is zero.

The Seifert matrix (and therefore the signature), like the Alexander polynomial, is sensitive to the structure of the first homology of the universal abelian cover of S^3 - K; equivalently, to the structure of the maximal metabelian quotient of \pi_1(S^3 - K). More sophisticated “twisted” and L^2 signatures can be obtained by studying further derived subgroups of \pi_1(S^3 - K) as modules over group rings of certain solvable groups with torsion-free abelian factors (the so-called poly-torsion-free-abelian groups). This was accomplished by Cochran-Orr-Teichner, who used these methods to construct infinitely many new concordance invariants.

The end result of this discussion is the existence of many, many interesting homomorphisms from the knot concordance group to the reals, and by plat closure, many interesting invariants of braids. The connection with quasimorphisms is the following:

Theorem(Brandenbursky): A homomorphism I:\mathcal{C} \to \mathbb{R} gives rise to a quasimorphism on braid groups if there is a constant C so that |I([K])| \le C\cdot\|K\|_g, where \|\cdot\|_g denotes 4-ball genus.

The proof is roughly the following: given pure braids \alpha,\beta one forms the knots \widehat{\alpha\Delta}, \widehat{\beta\Delta} and \widehat{\alpha\beta\Delta}. It is shown that the connect sum L:= \widehat{\alpha \Delta} \# \widehat{\beta\Delta} \# \widehat{\alpha\beta\Delta}^! bounds a Seifert surface whose genus may be universally bounded in terms of the number of strands in the braid group. Pushing this Seifert surface into the 4-ball, the hypothesis of the theorem says that I is uniformly bounded on L. Properties of I then give an estimate for the defect; qed.

It would be interesting to connect these observations up to other “natural” chiral, homogeneous invariants on mapping class groups. For example, associated to a braid or mapping class \phi \in \text{MCG}(S) one can (usually) form a hyperbolic 3-manifold M_\phi which fibers over the circle, with fiber S and monodromy \phi. The \eta-invariant of M_\phi is the signature defect \eta(M_\phi) = \int_Y p_1/3 - \text{sign}(Y) where Y is a 4-manifold with \partial Y = M_\phi with a product metric near the boundary, and p_1 is the first Pontriagin form on Y (expressed in terms of the curvature of the metric). Is \eta a quasimorphism on some subgroup of \text{MCG}(S) (eg on a subgroup consisting entirely of pseudo-Anosov elements)?