You are currently browsing the monthly archive for July 2009.
A beautiful identity in Euclidean geometry is the Brianchon-Gram relation (also called the Gram-Sommerville formula, or Gram’s equation), which says the following: let be a convex polytope, and for each face
of
, let
denote the solid angle along the face, as a fraction of the volume of a linking sphere. The relation then says:
Theorem (Brianchon-Gram relation): . In other words, the alternating sum of the (solid) angles of all dimensions of a convex polytope is zero.
Sketch of Proof: we prove the theorem in the case that is a simplex
; the more general case follows by generalizing to pyramids, and then decomposing any polytope into pyramids by coning to an interior point. This argument is due to Shephard.
Associated to each face is a spherical polyhedron
in
; if the span of
is the intersection of a family of half-spaces bounded by hyperplanes
with inward normals
, then
is the set of unit vectors
whose inner product with each
is non-negative. Note further that for each
there is some
that pairs non-negatively with
; consequently to each
one can assign a subset
of indices, so that
pairs non-negatively with
if and only if
. On the other hand, each subset
determines a unique face
of dimension
. By the inclusion-exclusion formula, we conclude that
“equals” zero, thought of as a signed union of spherical polyhedra. Since
, the formula follows. qed.
Another well-known proof starts by approximating the polytope by a rational polytope (i.e. one with rational vertices). The proof then goes via Macdonald reciprocity, using generating functions.
Example: Let be a triangle, with angles
. The solid angle at an interior point is
, and the solid angle at each edge is
. Hence we get
and therefore in this case Brianchon-Gram is equivalent to the familiar angle sum identity for a triangle:
.
Example: Next consider the example of a Euclidean simplex . The contribution from the interior is
, and the contribution from the four facets is
. There are six edges, with angles
, that contribute
. Each vertex contributes one spherical triangle, with (spherical) angles
for certain
, where each
appears as a spherical angle in exactly two spherical triangles. The Gauss-Bonnet theorem implies that the area of a spherical triangle is equal to the angle sum defect:
so the vertices contribute
and the identity is seen to follow in this case too.
Note in fact that the usual proof of Gauss-Bonnet for a spherical triangle is done by an inclusion-exclusion argument involving overlapping lunes, that is very similar to the proof of Brianchon-Gram given above.
The sketch of proof above just as easily proves an identity in the spherical scissors congruence group. For equal to spherical, Euclidean or hyperbolic space of dimension
, the scissors congruence group
is the abelian group generated by formal symbols
where
and
is a choice of orientation, modulo certain relations, namely:
if the
are contained in a hyperplane
- an odd permutation of the points induces multiplication by
; changing the orientation induces multiplication by
- if
is an isometry of
, then
for any set of
points, and any orientation
(Note that this definition of scissors congruence is consistent with that of Goncharov, and differs slightly from another definition consistent with Sah; this difference has to do with orientations, and has as a consequence the vanishing of spherical scissors congruence in even dimensions; whereas with Sah’s definition, for each
)
The argument we gave above shows that for any Euclidean simplex , we have
in
.
Scissors congruence satisfies several fundamental properties:
in
. To see this, “triangulate” the sphere as a pair of degenerate simplices, whose vertices lie entirely on a hyperplane.
- There is a natural multiplication
; to define it on simplices, think of
as the unit sphere in
. A complementary pair of subspaces
and
intersect
in a linked pair of spheres of dimensions
; if
are spherical simplices in these subspaces, the image of
is the join of these two simplices in
.
It follows that the polyhedra in
whenever
is a face of dimension at least
; for in this case,
is the join of a spherical simplex with a sphere of some dimension, and is therefore trivial in spherical scissors congruence. Hence the identity above simplifies to
in
.
One nice application is to extend the definition of Dehn invariants to ideal hyperbolic simplices. We recall the definition of the usual Dehn invariant. Given a simplex , for each face
we let
denote the spherical polyhedron equal to the intersection of
with the link of
. Then
. Ideal scissors congruence makes sense for ideal hyperbolic simplices, except in dimension one (where it is degenerate). For ideal hyperbolic simplices (i.e. those with some vertices at infinity), the formula above for Dehn invariant is adequate, except for the
-dimensional faces (i.e. the edges)
. This problem is solved by the following “regularization” procedure due to Thurston: put a disjoint horoball at each ideal vertex of
, and replace each infinite edge
by the finite edge
which is the intersection of
with the complement of the union of horoballs; hence one obtains terms of the form
in
. This definition apparently depends on the choice of horoballs. However, if
are two different horoballs, the difference is a sum of terms of the form
where
is constant, and
ranges over the edges sharing the common ideal vertex. The intersection of
with a horosphere is a Euclidean simplex
, and the
are exactly the spherical polyhedra
as
ranges over the vertices of
. By what we have shown above, the sum
is trivial in scissors congruence; it follows that
is well-defined.
For more general ideal polyhedra (and finite volume complete hyperbolic manifolds) one first decomposes into ideal simplices, then computes the Dehn invariant on each piece and adds. A minor variation of the usual argument on closed manifolds shows that the Dehn invariant of any complete finite-volume hyperbolic manifold vanishes.
Update(7/29/2009): It is perhaps worth remarking that the Brianchon-Gram relation can be thought of, not merely as an identity in spherical scissors congruence, but in the “bigger” spherical polytope group, in which one does not identify simplices that differ by an isometry. Incidentally, there is an interesting paper on this subject by Peter McMullen, in which he proves generalizations of Brianchon-Gram(-Sommerville), working explicitly in the spherical polytope group. He introduces what amounts to a generalization of the Dehn invariant, with domain the Euclidean translational scissors congruence group, and range a sum of tensor products of Euclidean translational scissors congruence (in lower dimensions) with spherical polytope groups. It appears, from the paper, that McMullen was aware of the classical Dehn invariant (in any case, he was aware of Sah’s book) but he does not refer to it explicitly.
I have just uploaded a paper to the arXiv, entitled “Scl, sails and surgery”. The paper discusses a connection between stable commutator length in free groups and the geometry of sails. This is an interesting example of what sometimes happens in geometry, where a complicated topological problem in low dimensions can be translated into a “simple” geometric problem in high dimensions. Other examples include the Veronese embedding in Algebraic geometry (i.e. the embedding of one projective space into another taking a point with homogeneous co-ordinates to the point whose homogeneous co-ordinates are the monomials of some fixed degree in the
), which lets one exhibit any projective variety as an intersection of a Veronese variety (whose geometry is understood very well) with a linear subspace.
In my paper, the fundamental problem is to compute stable commutator length in free groups, and more generally in free products of Abelian groups. Let’s focus on the case of a group where
are free abelian of finite rank. A
is just a wedge
of tori of dimension equal to the ranks of
. Let
be a free homotopy class of
-manifold in
, which is homologically trivial. Formally, we can think of
as a chain
in
, the vector space of group
-boundaries, modulo homogenization; i.e. quotiented by the subspace spanned by chains of the form
and
. One wants to find the simplest surface
mapping to
that rationally bounds
. I.e. we want to find a map
such that
factors through
, and so that the boundary
wraps homologically
times around each loop of
, in such a way as to infimize
. This infimum, over all maps of all surfaces
of all possible genus, is the stable commutator length of the chain
. Computing this quantity for all such finite chains is tantamount to understanding the bounded cohomology of a free group in dimension
.
Given such a surface , one can cut it up into simpler pieces, along the preimage of the basepoint
. Since
is a surface with boundary, these simpler pieces are surfaces with corners. In general, understanding how a surface can be assembled from an abstract collection of surfaces with corners is a hopeless task. When one tries to glue the pieces back together, one runs into trouble at the corners — how does one decide when a collection of surfaces “closes up” around a corner? The wrong decision leads to branch points; moreover, a decision made at one corner will propogate along an edge and lead to constraints on the choices one can make at other corners. This problem arises again and again in low-dimensional topology, and has several different (and not always equivalent) formulations and guises, including -
- Given an abstract branched surface and a weight on that surface, when is there an unbranched surface carried by the abstract branched surface and realizing the weight?
- Given a triangulation of a
-manifold and a collection of normal surface types in each simplex satisfying the gluing constraints but *not* necessarily satisfying the quadrilateral condition (i.e. there might be more than one quadrilateral type per simplex), when is there an immersed unbranched normal surface in the manifold realizing the weight?
- Given an immersed curve in the plane, when is there an immersion from the disk to the plane whose boundary is the given curve?
- Given a polyhedral surface (arising e.g. in computer graphics), how can one choose smooth approximations of the polygonal faces that mesh smoothly at the vertices?
I think of all these problems as examples of what I like to call the holonomy problem, since all of them can be reduced, in one way or another, to studying representations of fundamental groups of punctured surfaces into finite groups. The fortunate “accident” in this case is that every corner arises by intersecting a cut with a boundary edge of . Consequently, one never wants to glue more than two pieces up at any corner, and the holonomy problem does not arise. Hence in principle, to understand the surface
one just needs to understand the pieces of
that can arise by cutting, and the ways in which they can be reassembled.
This is still not a complete solution of the problem, since infinitely many kinds of pieces can arise by cutting complicated surfaces . The
-manifold
decomposes into a collection of arcs in the tori
and
which we denote
respectively, and the surface
(hereafter abbreviated to
) has edges that alternate between elements of
, and edges mapping to
. Since
is a torus, handles of
mapping to
can be compressed, reducing the complexity of
, and thereby
, so one need only consider planar surfaces
.
Let denote the real vector space with basis the set of ordered pairs
of elements of
(not necessarily distinct), and
the real vector space with basis the elements of
. A surface
determines a non-negative integral vector
, by counting the number of times a given pair of edges
appear in succession on one of the (oriented) boundary components of
. The vector
satisfies two linear constraints. First, there is a map
defined on a basis vector by
. The vector
satisfies
. Second, each element
is a based loop in
, and therefore corresponds to an element in the free abelian group
. Define
on a basis vector by
(warning: the notation obscures the fact that
and
map to quite different vector spaces). Then
; moreover, a non-negative rational vector
satisfying
has a multiple of the form
for some
as above. Denote the subspace of
consisting of non-negative vectors in the kernel of
and
by
. This is a rational polyhedral cone — i.e. a cone with finitely many extremal rays, each spanned by a rational vector.
Although every integral is equal to
for some
, many different
correspond to a given
. Moreover, if we are allowed to consider formal weighted sums of surfaces, then even more possibilities. In order to compute stable commutator length, we must determine, for a given vector
, an expression
where the
are positive real numbers, which minimizes
. Here
denotes orbifold Euler characteristic of a surface with corners; each corner contributes
to
. The reason one counts complexity using this modified definition is that the result is additive:
. The contribution to
from corners is a linear function on
. Moreover, a component
with
can be covered by a surface of high genus and compressed (increasing
); so such a term can always be replaced by a formal sum
for which
. Thus the only nonlinear contribution to
comes from the surfaces
whose underlying topological surface is a disk.
Call a vector a disk vector if
where
is topologically a disk (with corners). It turns out that the set of disk vectors
has the following simple form: it is equal to the union of the integer lattice points contained in certain of the open faces of
(those satisfying a combinatorial criterion). Define the sail of
to be equal to the boundary of the convex hull of the polyhedron
(where
here denotes Minkowski sum). The Klein function
is the unique continuous function on
, linear on rays, that is equal to
exactly on the sail. Then
over expressions
satisfies
where
denotes
norm. To calculate stable commutator length, one minimizes
over
contained in a certain rational polyhedron in
.
Sails are considered elsewhere by several authors; usually, people take to be the set of all integer vectors except the vertex of the cone, and the sail is therefore the boundary of the convex hull of this (simpler) set. Klein introduced sails as a higher-dimensional generalization of continued fractions: if
is a polyhedral cone in two dimensions (i.e. a sector in the plane, normalized so that one edge is the horizontal axis, say), the vertices of the sail are the continued fraction approximations of the boundary slope. Arnold has revived the study of such objects in recent years. They arise in many different interesting contexts, such as numerical analysis (especially diophantine approximation) and algebraic number theory. For example, let
be a matrix with irreducible characteristic equation, and all eigenvalues real and positive. There is a basis for
consisting of eigenvalues, spanning a convex cone
. The cone — and therefore its sail — is invariant under
; moreover, there is a
subgroup of
consisting of matrices with the same set of eigenvectors; this observation follows from Dirichlet’s theorem on the units in a number field, and is due to Tsuchihashi. This abelian group acts freely on the sail with quotient a (topological) torus of dimension
, together with a “canonical” cell decomposition. This connection between number theory and combinatorics is quite mysterious; for example, Arnold asks: which cell decompositions can arise? This is unknown even in the case
.
The most interesting aspect of this correspondence, between stable commutator length and sails, is that it allows one to introduce parameters. An element in a free group can be expressed as a word in letters
, e.g.
, which is usually abbreviated with exponential notation, e.g.
. Having introduced this notation, one can think of the exponents as parameters, and study stable commutator length in families of words, e.g.
. Under the correspondence above, the parameters only affect the coefficients of the linear map
, and therefore one obtains families of polyhedral cones
whose extremal rays depend linearly on the exponent parameters. This lets one prove many facts about the stable commutator length spectrum in a free group, including:
Theorem: The image of a nonabelian free group of rank at least under scl in
is precisely
.
and
Theorem: For each , the image of the free group
under scl contains a well-ordered sequence of values with ordinal type
. The image of
contains a well-ordered sequence of values with ordinal type
.
One can also say things about the precise dependence of scl on parameters in particular families. More conjecturally, one would like to use this correspondence to say something about the statistical distribution of scl in free groups. Experimentally, this distribution appears to obey power laws, in the sense that a given (reduced) fraction appears in certain infinite families of elements with frequency proportional to
for some power
(which unfortunately depends in a rather opaque way on the family). Such power laws are reminiscent of Arnold tongues in dynamics, one of the best-known examples of phase locking of coupled nonlinear oscillators. Heuristically one expects such power laws to appear in the geometry of “random” sails — this is explained by the fact that the (affine) geometry of a sail depends only on its
orbit, and the existence of invariant measures on a natural moduli space; see e.g. Kontsevich and Suhov. The simplest example concerns the (
-dimensional) cone spanned by a random integral vector in
. The
orbit of such a vector depends only on the gcd of the two co-ordinates. As is easy to see, the probability distribution of the gcd of a random pair of integers
obeys a power law:
with probability
. The rigorous justification of the power laws observed in the scl spectrum of free groups remains the focus of current research by myself and my students.
The development and scope of modern biology is often held out as a fantastic opportunity for mathematicians. The accumulation of vast amounts of biological data, and the development of new tools for the manipulation of biological organisms at microscopic levels and with unprecedented accuracy, invites the development of new mathematical tools for their analysis and exploitation. I know of several examples of mathematicians who have dipped a toe, or sometimes some more substantial organ, into the water. But it has struck me that I know (personally) few mathematicians who believe they have something substantial to learn from the biologists, despite the existence of several famous historical examples. This strikes me as odd; my instinctive feeling has always been that intellectual ruts develop so easily, so deeply, and so invisibly, that continual cross-fertilization of ideas is essential to escape ossification (if I may mix biological metaphors . . .)
It is not necessarily easy to come up with profound examples of biological ideas or principles that can be easily translated into mathematical ones, but it is sometimes possible to come up with suggestive ones. Let me try to give a tentative example.
Deoxiribonucleic acid (DNA) is a nucleic acid that contains the genetic blueprint for all known living things. This blueprint takes the form of a code — a molecule of DNA is a long polymer strand composed of simple units called nucleotides; such a molecule is typically imagined as a string in a four character alphabet , which stand for the nucleotides Adenine, Thymine, Guanine, and Cytosine. These molecular strands like to arrange themselves in tightly bound oppositely aligned pairs, matching up nucleotides in one string with complementary nucleotides in the other, so that
matches with
, and
with
.
The geometry of a strand of DNA is very complicated — strands can be tangled, knotted, linked in complicated ways, and the fundamental interactions between strands (e.g. transcription, recombination) are facilitated or obstructed by mechanical processes depending on this geometry. Topology, especially knot theory, has been used in the study of some of these processes; the value of topological methods in this context include their robustness (fault-tolerance) and the discreteness of their invariants (similar virtues motivate some efforts to build topological quantum computers). A complete mathematical description of the salient biochemistry, mechanics, and semantic content of a configuration of DNA in a single cell is an unrealistic goal for the foreseeable future, and therefore attempts to model such systems depends on ignoring, or treating statistically, certain features of the system. One such framework ignores the ambient geometry entirely, and treats the system using symbolic, or combinatorial methods which have some of the flavor of geometric group theory.
One interesting approach is to consider a mapping from the alphabet of nucleotides to a standard generating set for , the free group on two generators; for example, one can take the mapping
where
are free generators for
, and
denote their inverses. Then a pair of oppositely aligned strands of DNA translates into an edge of a van Kampen diagram — the “words” obtained by reading the letters along an edge on either side are inverse in
.
Strands of DNA in a configuration are not always paired along their lengths; sometimes junctions of three or more strands can form; certain mobile four-strand junctions, so-called “Holliday junctions”, perform important functions in the process of genetic recombination, and are found in a wide variety of organisms. A configuration of several strands with junctions of varying valences corresponds in the language of van Kampen diagrams to a fatgraph — i.e. a graph together with a choice of cyclic ordering of edges at each vertex — with edges labeled by inverse pairs of words in (note that this is quite different from the fatgraph model of proteins developed by Penner-Knudsen-Wiuf-Andersen). The energy landscape for branch migration (i.e. the process by which DNA strands separate or join along some segment) is very complicated, and it is challenging to model it thermodynamically. It is therefore not easy to predict in advance what kinds of fatgraphs are more or less likely to arise spontaneously in a prepared “soup” of free DNA strands.
As a thought experiment, consider the following “toy” model, which I do not suggest is physically realistic. We make the assumption that the energy cost of forming a junction of valence is
for some fixed constant
. Consequently, the energy of a configuration is proportional to
, i.e. the negative of Euler characteristic of the underlying graph. Let
be a reduced word, representing an element of
, and imagine a soup containing some large number of copies of the strand of DNA corresponding to the string
. In thermodynamic equilibrium, the partition function has the form
where
is Boltzmann’s constant,
is temperature, and
is the energy of a configuration (which by hypothesis is proportional to
). At low temperature, minimal energy configurations tend to dominate; these are those that minimize
per unit “volume”. Topologically, a fatgraph corresponding to such a configuration can be thickened to a surface with boundary. The words along the edges determine a homotopy class of map from such a surface to a
(e.g. a once-punctured torus) whose boundary components wrap multiply around the free homotopy class corresponding to the conjugacy class of
. The infimum of
where
is the winding degree on the boundary, taken over all configurations, is precisely the stable commutator length of
; see e.g. here for a definition.
Anyway, this example is perhaps a bit strained (and maybe it owes more to thermodynamics than to biology), but already it suggests a new mathematical object of study, namely the partition function as above, and one is already inclined to look for examples for which the partition function obeys a symmetry like that enjoyed by the Riemann zeta function, or to specialize temperature to other values, as in random matrix theory. The introduction of new methods into the study of a classical object — for example, the decision to use thermodynamic methods to organize the study of van Kampen diagrams — bends the focus of the investigation towards those examples and contexts where the methods and tools are most informative. Phenomena familiar in one context (power laws, frequency locking, phase transitions etc.) suggest new questions and modes of enquiry in another. Uninspired or predictable research programs can benefit tremendously from such infusions, whether the new methods are borrowed from other intellectual disciplines (biology, physics), or depend on new technology (computers), or new methods of indexing (google) or collaboration (polymath).
One of my intellectual heroes — Wolfgang Haken — worked for eight years in R+D for Siemens in Munich after completing his PhD. I have a conceit (unsubstantiated as far as I know by biographical facts) that his experience working for a big engineering firm colored his approach to mathematics, and made it possible for him to imagine using industrial-scale “engineering” tools (e.g. integer programming, exhaustive computer search of combinatorial possibilities) to solve two of the most significant “pure” mathematical open problems in topology at the time — the knot recognition problem, and the four-color theorem. It is an interesting exercise to try to imagine (fantastic) variations. If I sit down and decide to try to prove (for example) Cannon’s conjecture, I am liable to try minor variations on things I have tried before, appeal for my intuition to examples that I understand well, read papers by others working in similar ways on the problem, etc. If I imagine that I have been given a billion dollars to prove the conjecture, I am almost certain to prioritize the task in different ways, and to entertain (and perhaps create) much more ambitious or innovative research programs to tackle the task. This is the way in which I understand the following quote by John Dewey, which I used as the colophon of my first book:
Every great advance in science has issued from a new audacity of the imagination.
I have struggled for a long time (and I continue to struggle) with the following question:
Question: Is the group of self-homeomorphisms of the unit disk (in the plane) that fix the boundary pointwise a left-orderable group?
Recall that a group is left-orderable if there is a total order
on the elements satisfying
if and only if
for all
. For a countable group, the property of being left orderable is equivalent to the property that the group admits a faithful action on the interval by orientation-preserving homeomorphisms; however, this equivalence is not “natural” in the sense that there is no natural way to extract an ordering from an action, or vice-versa. This formulation of the question suggests that one is trying to embed the group of homeomorphisms of the disk into the group of homeomorphisms of the interval, an unlikely proposition, made even more unlikely by the following famous theorem of Filipkiewicz:
Theorem: (Filipkiewicz) Let be two compact manifolds, and
two non-negative integers or infinity. Suppose the connected components of the identity of
and
are isomorphic as abstract groups. Then
and the isomorphism is induced by some diffeomorphism.
The hard(est?) part of the argument is to identify a subgroup stabilizing a point in purely algebraic terms. It is a fundamental and well-studied problem, in some ways a natural outgrowth of Klein’s Erlanger programme, to perceive the geometric structure on a space in terms of algebraic properties of its automorphism group. The book by Banyaga is the best reference I know for this material, in the context of “flexible” geometric structures, with big transformation groups (it is furthermore the only math book I know with a pink cover).
Left orderability is inherited under extensions. I.e. if is a short exact sequence, and both
and
are left orderable, then so is
. Furthermore, it is a simple but useful theorem of Burns and Hale that a group
is left orderable if and only if for every finitely generated subgroup
there is a left orderable group
and a surjective homomorphism
. The necessity of this condition is obvious: a subgroup of a left orderable group is left orderable (by restricting the order), so one can take
to be
and the surjection to be the identity. One can exploit this strategy to show that certain transformation groups are left orderable, as follows:
Example: Suppose is a group of homeomorphisms of some space
, with a nonempty fixed point set. If
is a finitely generated subgroup of
, then there is a point
in the frontier of
so that
has a nontrivial image in the group of germs of homeomorphisms of
at
. If this group of germs is left-orderable for all
, then so is
by Burns-Hale.
Example: (Rolfsen-Wiest) Let be the group of PL homeomorphisms of the unit disk (thought of as a PL square in the plane) fixed on the boundary. If
is a finitely generated subgroup, there is a point
in the frontier of
. Note that
has a nontrivial image in the group of piecewise linear homeomorphisms of the projective space of lines through
. Since the fixed point set of a finitely generated subgroup is equal to the intersection of the fixed point sets of a finite generating set, it is itself a polyhedron. Hence
fixes some line through
, and therefore has a nontrivial image in the group of homeomorphisms of an interval. By Burns-Hale,
is left orderable.
Example: Let be the group of diffeomorphisms of the unit disk, fixed on the boundary. If
is a finitely generated subgroup, then at a non-isolated point
in
the group
fixes some tangent vector to
(a limit of short straight lines from
to nearby fixed points). Consequently the image of
in
is reducible, and is conjugate into an affine subgroup, which is left orderable. If the image is nontrivial, we are done by Burns-Hale. If it is trivial, then the linear part of
at
is trivial, and therefore by the Thurston stability theorem, there is a nontrivial homomorphism from
to the (orderable) group of translations of the plane. By Burns-Hale, we conclude that
is left orderable.
The second example does not require infinite differentiability, just , the necessary hypothesis to apply the Thurston stability theorem. This is such a beautiful and powerful theorem that it is worth making an aside to discuss it. Thurston’s theorem says that if
is a finitely generated group of germs of diffeomorphisms of a manifold fixing a common point, then a suitable limit of rescaled actions of the group near the fixed point converge to a nontrivial action by translations. One way to think of this is in terms of power series: if
is a group of real analytic diffeomorphisms of the line, fixing the point
, then every
can be expanded as a power series:
. The function
is a multiplicative homomorphism; however, if the logarithm of
is identically zero, then if
is the first index for which some
is nonzero, then
is an additive homomorphism. The choice of coefficient
is a “gauge”, adapted to
, that sees the most significant nontrivial part; this leading term is a character (i.e. a homomorphism to an abelian group), since the nonabelian corrections have strictly higher degree. Thurston’s insight was to realize that for a finitely generated group of germs of
diffeomorphisms with trivial linear part, one can find some gauge that sees the most significant nontrivial part of the action of the group, and at this gauge, the action looks abelian. There is a subtlety, that one must restrict attention to finitely generated groups of homeomorphisms: on each scale of a sequence of finer and finer scales, one of a finite generating set differs the most from the identity; one must pass to a subsequence of scales for which this one element is constant (this is where the finite generation is used). The necessity of this condition is demonstrated by a theorem of Sergeraert: the group of germs of (
) diffeomorphisms of the unit interval, infinitely tangent to the identity at both endpoints (i.e. with trivial power series at each endpoint) is perfect, and therefore admits no nontrivial homomorphism to an abelian group.
Let us now return to the original question. The examples above suggest that it might be possible to find a left ordering on the group of homeomorphisms of the disk, fixed on the boundary. However, I think this is misleading. The construction of a left ordering in either category (PL or smooth) was ad hoc, and depended on locality in two different ways: the locality of the property of left orderability (i.e. Burns-Hale) and the tameness of groups of PL or smooth homeomorphisms blown up near a common fixed point. Rescaling an arbitrary homeomorphism about a fixed point does not make things any less complicated. Burns-Hale and Filipkiewicz together suggest that one should look for a structural dissimilarity between the group of homeomorphisms of the disk and of the interval that persists in finitely generated subgroups. The simplest way to distinguish between the two spaces algebraically is in terms of their lattices of closed (or equivalently, open) subsets. To a topological space , one can associate the lattice
of (nonempty, for the sake of argument) closed subsets of
, ordered by inclusion. One can reconstruct the space
from this lattice, since points in
correspond to minimal elements. However, any surjective map
defines an embedding
, so there are many structure-preserving morphisms between such lattices. The lattice
is an
-space in an obvious way, and one can study algebraic maps
together with homomorphisms
for which the algebraic maps respect the induced
-structures. A weaker “localization” of this condition asks merely that for points (i.e. minimal elements)
in the same
-orbit, their images in
are in the same
-orbit. This motivates the following:
Proposition: There is a surjective map from the unit interval to the unit disk so that the preimages of any two points are homeomorphic.
Sketch of Proof: This proposition follows from two simpler propositions. The first is that there is a surjective map from the unit interval to itself so that every point preimage is a Cantor set. The second is that there is a surjective map from the unit interval to the unit disk so that the preimage of any point is finite. A composition of these two maps gives the desired map, since a finite union of Cantor sets is itself a Cantor set.
There are many surjective maps from the unit interval to the unit disk so that the preimage of any point is finite. For example, if is a hyperbolic three-manifold fibering over the circle with fiber
, then the universal cover of a fiber
is properly embedded in hyperbolic
-space, and its ideal boundary (a circle) maps surjectively and finitely-to-one to the sphere at infinity of hyperbolic
-space. Restricting to a suitable subinterval gives the desired map.
To obtain the first proposition, one builds a surjective map from the interval to itself inductively; there are many possible ways to do this, and details are left to the reader. qed.
It is not clear how much insight such a construction gives.
Another approach to the original question involves trying to construct an explicit (finitely generated) subgroup of the group of homeomorphisms of the disk that is not left orderable. There is a “cheap” method to produce finitely presented groups with no left-orderable quotients. Let be a group defined by a presentation, where
is a word in the letters
and
, and
is a word in the letters
and
. In any left-orderable quotient in which both
and
are nontrivial, after reversing the orientation if necessary, we can assume that
. If further
then
, contrary to the fact that
. If
, then
, contrary to the fact that
. In either case we get a contradiction. One can try to build by hand nontrivial homeomorphisms
of the unit disk, fixed on the boundary, that satisfy
. Some evidence that this will be hard to do comes from the fact that the group of smooth and PL homeomorphisms of the disk are in fact left-orderable: any such
can be arbitrarily well-approximated by smooth
; nevertheless at least one of the words
evaluated on any smooth
will be nontrivial. Other examples of finitely presented groups that are not left orderable include higher Q-rank lattices (e.g. subgroups of finite index in
when
), by a result of Dave Witte-Morris. Suppose such a group admits a faithful action by homeomorphisms on some closed surface of genus at least
. Since such groups do not admit homogeneous quasimorphisms, their image in the mapping class group of the surface is finite, so after passing to a subgroup of finite index, one obtains a (lifted) action on the universal cover. If the genus of the surface is at least
, this action can be compactified to an action by homeomorphisms on the unit disk (thought of as the universal cover of a hyperbolic surface) fixed pointwise on the boundary. Fortunately or unfortunately, it is already known by Franks-Handel (see also Polterovich) that such groups admit no area-preserving actions on closed oriented surfaces (other than those factoring through a finite group), and it is consistent with the so-called “Zimmer program” that they should admit no actions even without the area-preserving hypothesis when the genus is positive (of course,
admits a projective action on
). Actually, higher rank lattices are very fragile, because of Margulis’ normal subgroup theorem. Every normal subgroup of such a lattice is either finite or finite index, so to prove the results of Franks-Handel and Polterovich, it suffices to find a single element in the group of infinite order that acts trivially. Unipotent elements are exponentially distorted in the word metric (i.e. the cyclic subgroups they generate are not quasi-isometrically embedded), so one “just” needs to show that groups of area-preserving diffeomorphisms of closed surfaces (of genus at least
) do not contain such distorted elements. Some naturally occurring non-left orderable groups include some (rare) hyperbolic
-manifold groups, amenable but not locally indicable groups, and a few others. It is hard to construct actions of such groups on a disk, although certain flows on
-manifolds give rise to actions of the fundamental group on a plane.

Amenability of Thompson’s group F?
July 6, 2009 in Commentary, Groups | Tags: Akhmedov, amenability, Dynamics, Shavgulidze, Thompson's group | by Danny Calegari | 12 comments
Geometric group theory is not a coherent and unified field of enquiry so much as a collection of overlapping methods, examples, and contexts. The most important examples of groups are those that arise in nature: free groups and fundamental groups of surfaces, the automorphism groups of the same, lattices, Coxeter and Artin groups, and so on; whereas the most important properties of groups are those that lend themselves to applications or can be used in certain proof templates: linearity, hyperbolicity, orderability, property (T), coherence, amenability, etc. It is natural to confront examples arising in one context with properties that arise in the other, and this is the source of a wealth of (usually very difficult) problems; e.g. do mapping class groups have property (T)? (no, by Andersen) or: is every lattice in
virtually orderable?
As remarked above, it is natural to formulate these questions, but not necessarily productive. Gromov, in his essay Spaces and Questions remarks that
A famous question of the kind Gromov warns against is the following:
Question: Is Thompson’s group
amenable?
Recall that Thompson’s group is the group of (orientation-preserving) PL homeomorphisms of the unit interval with breakpoints at dyadic rationals (i.e. rational numbers of the form
for integers
) and derivatives all powers of
. This group is a rich source of examples/counterexamples in geometric group theory: it is finitely presented (in fact
) but “looks like” a transformation group; it contains no nonabelian free group (by Brin-Squier), but obeys no law. It is not elementary amenable (i.e. it cannot be built up from finite or abelian groups by elementary operations — subgroups, quotients, extensions, directed unions), so it is “natural” to wonder whether it is amenable at all, or whether it is one of the rare examples of nonamenable groups without nonabelian free subgroups (see this post for a discussion of amenability versus the existence of free subgroups, and von Neumann’s conjecture). This question has attracted a great deal of attention, possibly because of its long historical pedigree, rather than because of the potential applications of a positive (or negative) answer.
Recently, two papers were posted on the arXiv, promising competing resolutions of the question. In February, Azer Akhmedov posted a preprint claiming to show that the group
is not amenable. This preprint was revised, withdrawn, then revised again, and as of the end of April, Akhmedov continues to press his claim. Akhmedov’s argument depends on a new geometric criterion for nonamenability, roughly speaking, the existence of a
-generator subgroup and a subadditive non-negative function on the group whose values grow at a definite rate on words in the subgroup whose exponents satisfy suitable parity conditions and inequalities. The non-negative function (Akhmedov calls it a “height function”) certifies the existence of a sufficiently “bushy” subset of the group to violate Folner’s criterion for amenability. Akhmedov’s paper reads like a “conventional” paper in geometric group theory, using methods from coarse geometry, careful combinatorial and counting arguments to establish the existence of a geometric object with certain large-scale properties, and an appeal to a standard geometric criterion to obtain the desired result. Akhmedov’s paper is part of a series, relating (non)amenability to certain other interesting geometric properties, some related to the so-called “traveling salesman” property, introduced earlier by Akhmedov.
On the other hand, in May, E. Shavgulidze posted a preprint claiming to show that the group
is amenable. Interestingly enough, Shavgulidze’s argument does not apply to the slightly more general class of Stein-Thompson groups in which slopes and denominators of break points can be divisible by an arbitrary (but prescribed) finite set of prime numbers. Moreover, his methods are very unlike any that one would expect to find in the typical geometric group theory paper. The argument depends on the construction, going back (in some sense) to a paper of Shavgulidze from 1978, of a measure on the space
of continuous functions on the interval which is quasi-invariant under the natural action of the group of diffeomorphisms of the interval of regularity at least
. In more detail, let
denote the group of diffeomorphisms of the interval of regularity at least
for each
, and let
denote the Banach space of continuous functions on the interval that vanish at the origin. Define
by the formula
. The space
can be equipped with a natural measure — the Wiener measure
of variance
, and this measure can be pulled back to
by
, which is thought of as a topological space with the
topology. Shavgulidze shows that the left action of
on
quasi-preserves this measure. Here the Wiener measure on
is the probability measure associated to Brownian motion (with given variance). A “sample” trajectory
from
is characterized by three properties: that it starts at the origin (i.e.
), that is it continuous almost surely (this is already implicit in the fact that the measure is supported on the space
and not some more general space), and that increments are independent, with the distribution of
equal to a Gaussian with mean zero and variance
. Shavgulidze’s argument depends first on an argument of Ghys-Sergiescu that shows Thompson’s group is conjugate (by a homeomorphism) to a discrete subgroup of the group of
diffeomorphisms of the interval. A bounded function
on
determines a continuous bounded function
on
(for
) by a certain convolution trick, using both the group structure of
, and its discreteness in
. Roughly, given an element
, the set of elements of
whose (group) composition with
is uniformly bounded in the
norm is finite; so the value of
is obtained by taking a suitable average of the value of
on this finite subset of
. This reduces the problem of the amenability of
to the existence of a suitable functional on the space of bounded continuous functions on
, which is constructed via the pulled back Wiener measure as above.
There are several distinctive features of Shavgulidze’s preprint. One of the most striking is that it depends on very delicate analytic features of the Wiener measure, and the way it transforms under the action of
on
— a transformation law involving the Schwartzian derivative — and suggesting that certain parts of the argument could be clarified (at least from the point of view of a topologist?) by using projective geometry and Sturm-Liouville theory. Another is that the crucial analytic quality — namely differentiability of class
— is also crucial for many other natural problems in
-dimensional analysis and geometry, from regularity estimates in the thin obstacle problem, to Navas’ work on actions of property (T) groups on the circle. At least one of the preprints by Akhmedov and Shavgulidze must be in error (in fact, a real skeptic’s skeptic such as Michael Aschbacher is not even willing to concede that much . . .) but even if wrong, it is possible that they contain things more valuable than a resolution of the question that prompted them.
Update (7/6): Azer Akhmedov sent me a construction of a (nonabelian) free subgroup of
that is discrete in the
topology. This is not quite enough regularity to intersect with Shavgulidze’s program, but it is interesting, and worth explaining. This is my (minor) modification of Azer’s construction (any errors are due to me):
Proposition: The group
contains a discrete nonabelian free subgroup.
Sketch of Proof: First, decompose the interval
into countably many disjoint subintervals accumulating only at the endpoints. Choose a free action on two generators by doing something generic on each subinterval, in such a way that the derivative is equal to
at the endpoints. This can certainly be accomplished; for concreteness, choose the action so that for each subinterval
there is a point
in the interior of
whose stabilizer is trivial.
Second, for each pair of distinct words in the generators, choose a subinterval and modify the action there so that the derivatives of those words in that subinterval differ by at least some definite constant
at some point. In more detail: enumerate the pairs of words somehow
where each
is a pair of words
in the generators, and modify the action on the subinterval
so the words in
differ by at least
in the
norm on the interval
. Since we are modifying the generators infinitely many times, but in such a way that the support of the modification exits any compact subset of the interior, we just need to check that the modifications are
. Since there are only finitely many pairs of words, both of which are of bounded length (for any given bound), when
is sufficiently big, one of the words
,
has length at least
where
goes to infinity as
goes to infinity. Without loss of generality, we can order the pairs so that
is the “long” word.
Now this is how we modify the action in
. Recall that the point
has trivial stabilizer, so the translates
of
under the suffixes of
are distinct. Take disjoint intervals about the
and observe that each
is taken to
by one of the generators. Modify this generator inside this disjoint neighborhood so that
is still taken to
, but the derivative at that point is multiplied by
, and the derivative at nearby points is not multiplied by more than
. Since the neighborhoods of the
are disjoint, these modifications are all compatible, and the derivative of the generators does not change by more than
at any point. Since
goes to infinity as
goes to infinity, we can perform such modifications for each
, and the resulting action is still
. But now the derivative of
at
has been multiplied by
, so
and
differ by at least
in the
norm. qed.
It is interesting to observe that this construction, while
, is not
for any
. For big
, we have
whereas
. Introducing a “bump” which modifies the derivative by
in a subinterval of size
will blow up every Holder norm.
(Update 8/10): Mark Sapir has created a webpage to discuss Shavgulidze’s paper here. Also, Matt Brin has posted notes on Shavgulidze’s paper here. The notes are very nice, and go into great detail, as far as they go. Matt promises to update the notes periodically.
(Update 11/18): Matt Brin has let me know by email that a significant gap has emerged in Shavgulidze’s argument. He writes:
In light of this, it would seem to be reasonable to consider the question of whether
is amenable as wide open.
(Update 9/21/2012): Justin Moore has posted a preprint on the arXiv claiming to prove amenability of
. It is too early to suggest that there is expert consensus on the correctness of the proof, but certainly everything I have heard is promising. I have not had time to look carefully at the argument yet, but hope to get a chance to do so before too long.
(Update 10/2/2012): Justin has withdrawn his claim of a proof. A gap was found by Akhmedov.