You are currently browsing the tag archive for the ‘Schwarzian derivative’ tag.
Hermann Amandus Schwarz (1843-1921) was a student of Kummer and Weierstrass, and made many significant contributions to geometry, especially to the fields of minimal surfaces and complex analysis. His mathematical creations are both highly abstract and flexible, and at the same time intimately tied to explicit and practical calculation.
I learned about Schwarz-Christoffel transformations, Schwarzian derivatives, and Schwarz’s minimal surface as three quite separate mathematical objects, and I was very surprised to discover firstly that they had all been discovered by the same person, and secondly that they form parts of a consistent mathematical narrative, which I will try to explain in this post to the best of my ability. There is an instructive lesson in this example (for me), that we tend to mine the past for nuggets, examples, tricks, formulae etc. while forgetting the points of view and organizing principles that made their discovery possible. Another teachable example is that of Dehn’s “invention” of combinatorial (infinite) group theory, as a natural branch of geometry; several generations of followers went about the task of reformulating Dehn’s insights and ideas in the language of algebra, “generalizing” them and stripping them of their context, before geometric and topological methods were reintroduced by Milnor, Schwarz (a different one this time), Stallings, Thurston, Gromov and others to spectacular effect (note: I have the second-hand impression that the geometric point of view in group theory (and every other subject) was never abandoned in the Soviet Union).
Schwarz’s minimal surface (also called “Schwarz’s D surface”, and sometimes “Schwarz’s H surface”) is an extraordinarily beautiful triply-periodic minimal surface of infinite genus that is properly embedded in . According to Nitsche’s excellent book (p.240), this minimal surface closely resembles the separating wall between inorganic and organic materials in the skeleton of a starfish. The basic building block of the surface can be described as follows. If the vertices of a cube are -colored, the black vertices are the vertices of a regular tetrahedron. Let denote the quadrilateral formed by four edges of this tetrahedron; then a fundamental piece of Schwarz’s surface is a minimal disk spanning :
The surface may be “analytically continued” by rotating through an angle around each boundary edge. Six copies of fit smoothly around each vertex, and the resulting surface extends (triply) periodically throughout space.
The symmetries of enable us to give it several descriptions as a Riemann surface. Firstly, we could think of as a polygon in the hyperbolic plane with four edges of equal length, and angles . Twelve copies of can be assembled to make a hyperbolic surface of genus . Thinking of a surface of genus as the boundary of a genus handlebody defines a homomorphism from to , thought of as ; the cover associated to the kernel is (conformally) the triply periodic Schwarz surface, and the deck group acts on as a lattice (of index in the face-centered cubic lattice).
Another description is as follows. Since the deck group acts by translation, the Gauss map from to factors through a map . The map is injective at each point in the interior or on an edge of a copy of , but has an order branch point at each vertex. Thus, the map is a double-branched cover, with one branch point of order at each vertex of a regular inscribed cube. This leads one to think (like a late 19th century mathematician) of as the Riemann surface on which a certain multi-valued function on is single-valued. Under stereographic projection, the vertices of the cube map to the eight points where . These eight points are the roots of the polynomial , so we may think of as the hyperelliptic Riemann surface defined by the equation ; equivalently, as the surface on which the multi-valued (on ) function is single-valued.
The function is known as the Weierstrass function associated to , and an explicit formula for the co-ordinates of the embedding were found by Enneper and Weierstrass. After picking a basepoint (say ) on the sphere, the coordinates are given by integration:
The integral in each case depends on the path, and lifts to a single-valued function precisely on .
Geometrically, the three coordinate functions are harmonic functions on . This corresponds to the fact that minimal surfaces are precisely those with vanishing mean curvature, and the fact that the Laplacian of the coordinate functions (in terms of isothermal parameters on the underlying Riemann surface) can be expressed as a nonzero multiple of the mean curvature vector. A harmonic function on a Riemann surface is the real part of a holomorphic function, unique up to a constant; the holomorphic derivative of the (complexified) coordinate functions are therefore well-defined, and give holomorphic -forms which descend to (since the deck group acts by translations). These -forms satisfy the identity (this identity expresses the fact that the embedding of into via these functions is conformal). The (composition of the) Gauss map (with stereographic projection) can be read off from the , and as a meromorphic function on , it is given by the formula . Define a function on by the formula . Then are the coordinates of a rational map from into which extends to a map into , by sending each zero of to in the at infinity. Symmetry allows us to identify the image with the hyperelliptic embedding from before, and we deduce that . Solving for we obtain the integrands in the formulae above.
In fact, any holomorphic function on a domain in defines a (typically immersed with branch points) minimal surface, by the integral formulae of Enneper-Weierstrass above. Suppose we want to use this fact to produce an explicit description of a minimal surface bounded by some explicit polygonal loop in . Any minimal surface so obtained can be continued across the boundary edges by rotation; if the angles at the vertices are all of the form the resulting surface closes up smoothly around the vertices, and one obtains a compact abstract Riemann surface tiled by copies of the fundamental region, together with a holonomy representation of into . Sometimes the image of this representation in the rotational part of is finite, and one obtains an infinitely periodic minimal surface as in the case of Schwarz’s surface. A fundamental tile in can be uniformized as a hyperbolic polygon; equivalently, as a region in the upper half-plane bounded by arcs of semicircles perpendicular to the real axis. Since the edges of the loop are straight lines, the image of this hyperbolic polygon under the Gauss map is a region in also bounded by arcs of round circles; thus Schwarz’s study of minimal surfaces naturally led him to the problem of how to explicitly describe conformal maps between regions in the plane bounded by circular arcs. This problem is solved by the Schwarz-Christoffel transformation, and its generalizations, with help from the Schwarzian derivative.
Note that if and are two such regions, then a conformal map from to can be factored as the product of a map uniformizing as the upper half-plane, followed by the inverse of a map uniformizing as the upper half-plane. So it suffices to find a conformal map when the domain is the upper half plane, decomposed into intervals and rays that are mapped to the edges of a circular polygon . Near each vertex, can be moved by a fractional linear transformation to (part of) a wedge, consisting of complex numbers with argument between and , where is the angle at . The function uniformizes the upper half-plane as such a wedge; however it is not clear how to combine the contributions from each vertex, because of the complicated interaction with the fractional linear transformation. The fundamental observation is that there are certain natural holomorphic differential operators which are insensitive to the composition of a holomorphic function with groups of fractional linear transformations, and the uniformizing map can be expressed much more simply in terms of such operators.
For example, two functions that differ by addition of a constant have the same derivative: . Functions that differ by multiplication by a constant have the same logarithmic derivative: . Putting these two observations together suggest defining the nonlinearity of a function as the composition . This has the property that for any constants . Under inversion the nonlinearity transforms by . From this, and a simple calculation, one deduces that the operator is invariant under inversion, and since it is also invariant under addition and multiplication by constants, it is invariant under the full group of fractional linear transformations. This combination is called the Schwarzian derivative; explicitly, it is given by the formula . Given the Schwarzian derivative , one may recover the nonlinearity by solving the Ricatti equation . As explained in this post, solutions of the Ricatti equation preserve the projective structure on the line; in this case, it is a complex projective structure on the complex line. Equivalently, different solutions differ by an element of , acting by fractional linear transformations, as we have just deduced. Once we know the nonlinearity, we can solve for by , the usual solution to a first order linear inhomogeneous ODE. The Schwarzian of the function is . The advantage of expressing things in these terms is that the Schwarzian of a uniformizing map for a circular polygon with angles at the vertices has the form of a rational function, with principal parts , where the and the and depend (unfortunately in a very complicated way) on the edges of (for the ugly truth, see Nehari, chapter 5). To see this, observe that the map has an order two pole near finitely many points (the preimages of the vertices of under the uniformizing map) but is otherwise holomorphic. Moreover, it can be analytically continued into the lower half plane across the interval between successive , by reflecting the image across each circular edge. After reflecting twice, the image of is transformed by a fractional linear transformation, so has an analytic continuation which is single valued on the entire Riemann sphere, with finitely many isolated poles, and is therefore a rational function! When the edges of the polygon are straight, a simpler formula involving the nonlinearity specializes to the “familiar” Schwarz-Christoffel formula.
(Update 10/22): In fact, I went to the library to refresh myself on the contents of Nehari, chapter 5. The first thing I noticed — which I had forgotten — was that if is the uniformizing map from the upper half-plane to a polygon with spherical arcs, then is real-valued on the real axis. Since it is a rational function, this implies that its nonsingular part is actually a constant; i.e.
where is as above, and are real constants (which satisfy some further conditions — really see Nehari this time for more details).
The other thing that struck me was the first paragraph of the preface, which touches on some of the issues I alluded to above:
In the preface to the first edition of Courant-Hilbert’s “Methoden der mathematischen Physik”, R. Courant warned against a trend discernible in modern mathematics in which he saw a menace to the future development of mathematical analysis. He was referring to the tendency of many workers in this field to lose sight of the roots of mathematical analysis in physical and geometric intuition and to concentrate their efforts on the refinement and the extreme generalization of existing concepts.
Instead of using a word like “menace”, I would rather take this as a lesson about the value of returning to the points of view that led to the creation of the mathematical objects we study every day; which was (to some approximation) the point I was trying to illustrate in this post.
Quadratic forms (i.e. homogeneous polynomials of degree two) are fundamental mathematical objects. For the ancient Greeks, quadratic forms manifested in the geometry of conic sections, and in Pythagoras’ theorem. Riemann recognized the importance of studying abstract smooth manifolds equipped with a field of infinitesimal quadratic forms (i.e. a Riemannian metric), giving rise to the theory of Riemannian manifolds. In contrast to more general norms, an inner product on a vector space enjoys a big group of symmetries; thus infinitesimal Riemannian geometry inherits all the richness of the representation theory of orthogonal groups, which organizes the various curvature tensors and Weitzenbock formulae. It is natural that quadratic forms should come up in so many distinct ways in differential geometry: one uses calculus to approximate a smooth object near some point by a linear object, and the “difference” is a second-order term, which can often be interpreted as a quadratic form. For example:
- If is a Riemannian manifold, at any point one can choose an orthonormal frame for , and exponentiate to obtain geodesic normal co-ordinates. In such local co-ordinates, the metric tensor satisfies and . The second order derivatives can be expressed in terms of the Riemann curvature tensor at .
- If is an immersed submanifold of Euclidean space, at every point there is a unique linear subspace that is tangent to at . The second order difference between these two spaces is measured by the second fundamental form of , a quadratic form (with coefficients in the normal bundle) whose eigenvectors are the directions of (extrinsic) principal curvature. If has codimension one, the second fundamental form is easily described in terms of the Gauss map taking each point on to the unique unit normal to at that point, and using the flatness of the ambient Euclidean space to identify the normal spheres at different points with “the” standard sphere. The second fundamental form is then defined by the formula . For higher codimension, one considers Gauss maps with values in an appropriate Grassmannian.
- If is a smooth function on a manifold , a critical point of is a point at which (i.e. at which all the partial derivatives of in some local coordinates vanish). At such a point, one defines the Hessian , which is a quadratic form on , determined by the second partial derivatives of at such a point. If is a Levi-Civita connection on (determined by an Riemannian metric on compatible with the smooth structure) then . The condition that the Levi-Civita connection is torsion-free translates into the fact that the antisymmetric part of is equal to for any -form ; in this context, this means that the antisymmetric part of the Hessian vanishes — i.e. that it is symmetric (and therefore a quadratic form). If is a different connection, then for some -form , and therefore their values at agree, and is well-defined, independent of a choice of metric.
By contrast, cubic forms are less often encountered, either in geometry or in other parts of mathematics; their appearance is often indicative of unusual richness. For example: Lie groups arise as the subgroups of automorphisms of vector spaces preserving certain structure. Orthogonal and symplectic groups are those that preserve certain (symmetric or alternating) quadratic forms. The exceptional Lie group is the group of automorphisms of that preserves a generic (i.e. nondegenerate) alternating -form. One expects to encounter cubic forms most often in flavors of geometry in which the local transformation pseudogroups are bigger than the orthogonal group.
One example is that of -dimensional complex projective geometry. If is a domain in the Riemann sphere, one can think of as a geometric space in at least two natural ways: by considering the local pseudogroup of all holomorphic self-maps between open subsets of the Riemann sphere, restricted to (i.e. all holomorphic functions), or by considering only those holomorphic maps that extend to the entire Riemann sphere (i.e. the projective transformations: ). The difference between these two geometric structures is measured by a third-order term, called the Schwarzian derivative. If is homeomorphic to a disk, then we can think of as the image of the round unit disk under a uniformizing map . At every point there is a unique projective transformation that osculates to to second order at (i.e. has the same value, first derivative, and second derivative as at the point ); the (scaled) third derivative is the Schwarzian of at . In local co-ordinates, . Actually, although the Schwarzian is sensitive to third-order information, it should really be thought of as a quadratic form on the (one-dimensional) complex tangent space to .
Real projective geometry gives rise to similar invariants. Consider an immersed curve in the (real projective) plane. At every point, there is a unique osculating conic, that agrees with the immersed curve to second order. The projective curvature (really a cubic form) measures the third order deviation between these two immersed submanifolds at this point. See e.g. the book by Ovsienko and Tabachnikov for more details.
Another example is the so-called symplectic curvature. Let be a flat symplectic space; this could be ordinary Euclidean space with its standard symplectic form, or a quotient of such a space by a discrete group of translations. A linear subspace of through the origin is a Lagrangian subspace if it has (maximal) dimension , and the restriction of the symplectic form to is identically zero. A smooth submanifold of dimension is Lagrangian if its tangent space at every point is a Lagrangian submanifold. A Lagrangian submanifold of a flat symplectic space inherits a natural cubic form on the tangent space at every point, which can be defined in any of the following equivalent ways:
- If is a symplectic manifold and is a Lagrangian submanifold, then near any point one can find a neighborhood and choose symplectic coordinates so that is symplectomorphic to a neighborhood of some point in . Moreover, every other Lagrangian submanifold sufficiently close (in ) to can be taken in some possibly smaller neighborhood to be of the form , where is a smooth function on (well-defined up to a constant), thought of as a section of . In the context above, choose local symplectic coordinates (by a linear symplectic transformation) for which the flat space looks locally like and looks locally like . The condition that and are tangent at the origin means that the -jet of vanishes. The first nonvanishing term are the third partial derivatives of , which can be thought of as the coefficients of a (symmetric) cubic form on .
- If we choose a Euclidean metric on compatible with the flat symplectic structure, the second fundamental form of at some point is a quadratic form on with coefficients in the normal bundle to . The symplectic form identifies the normal to with the dual , so by contracting indices, one obtains a cubic form on . This form does not depend on the choice of Euclidean metric, since a different metric skews the normal bundle replacing it with . But since is Lagrangian, the identification of this normal bundle with is insensitive to the skewed term, and therefore independent of the choices.
- The space of all Lagrangian subspaces of is a symmetric space, homeomorphic to , sometimes called the Shilov boundary of the Siegel upper half-space. If and is a tangent vector to in , then one obtains a symmetric quadratic form on in the following way. If is a transverse Lagrangian to , and is a -parameter family of Lagrangians starting at , then for small the Lagrangians and are transverse, and span . For any there is a unique decomposition . Define . Then is a symmetric bilinear form that vanishes on , and therefore descends to a form on that depends only on . A Lagrangian submanifold maps to by the Gauss map . One obtains a cubic form on associated to as follows: if then is a tangent vector to in , and therefore determines a quadratic form on ; this form is then evaluated on the vectors .
One application of symplectic curvature is to homological mirror symmetry, where the symplectic curvature associated to a Lagrangian family of Calabi-Yau -folds in determines the so-called “Yukawa 3-differential”, whose expression in a certain local coordinate gives the generating function for the number of rational curves of degree in a generic quintic hypersurface in . This geometric picture is described explicitly in the work of Givental (e.g. here). In another more recent paper, Givental shows how the topological recursion relations, the string equation and the dilaton equation in Gromov-Witten theory can be reformulated in terms of the geometry of a certain Lagrangian cone in a formal loop space (the geometric property of this cone is that it is overruled — i.e. each tangent space is tangent to the cone exactly along , where is a formal variable). This geometric condition translates into properties of the symplectic curvature of the Lagrangian cone, from which one can read off the “gravitational descendents” in the theory (let me add that this subject is quite far from my area of expertise, and that I come to this material as an interested outsider).
Cubic forms occur naturally in other “special” geometric contexts, e.g. holomorphic symplectic geometry (Rozansky-Witten invariants), affine differential geometry (related to the discussion of the Schwarzian above), etc. Each of these contexts is the start of a long story, which is best kept for another post.