You are currently browsing the category archive for the ‘Commentary’ category.
I am Alden, one of Danny’s students. Error/naivete that may (will) be found here is mine. In these posts, I will attempt to give notes from Danny’s class on hyperbolic geometry (157b). This first post covers some models for hyperbolic space.
1. Models
We have a very good natural geometric understanding of , i.e. 3-space with the euclidean metric. Pretty much all of our geometric and topological intuition about manifolds (Riemannian or not) comes from finding some reasonable way to embed or immerse them (perhaps locally) in
. Let us look at some examples of 2-manifolds.
- Example (curvature = 1)
with its standard metric embeds in
; moreover, any isometry of
is the restriction of (exactly one) isometry of the ambient space (this group of isometries being
). We could not ask for anything more from an embedding.
- Example (curvature = 0) Planes embed similarly.
- Example (curvature = -1) The pseudosphere gives an example of an isometric embedding of a manifold with constant curvature -1. Consider a person standing in the plane at the origin. The person holds a string attached to a rock at
, and they proceed to walk due east dragging the rock behind them. The movement of the rock is always straight towards the person, and its distance is always 1 (the string does not stretch). The line traced out by the rock is a tractrix. Draw a right triangle with hypotenuse the tangent line to the curve and vertical side a vertical line to the
-axis. The bottom has length
, which shows that the tractrix is the solution to the differential equation

The Tractrix
The surface of revolution about the
-axis is the pseudosphere, an isometric embedding of a surface of constant curvature -1. Like the sphere, there are some isometries of the pseudosphere that we can understand as isometries of
, namely rotations about the
-axis. However, there are lots of isometries which do not extend, so this embeddeding does not serve us all that well.
- Example (hyperbolic space) By the Nash embedding theorem, there is a
immersion of
in
, but by Hilbert, there is no
immersion of any complete hyperbolic surface.That last example is the important one to consider when thinking about hypobolic spaces. Intuitively, manifolds with negative curvature have a hard time fitting in euclidean space because volume grows too fast — there is not enough room for them. The solution is to find (local, or global in the case of
) models for hyperbolic manfolds such that the geometry is distorted from the usual euclidean geometry, but the isometries of the space are clear.
2. 1-Dimensional Models for Hyperbolic Space
While studying 1-dimensional hyperbolic space might seem simplistic, there are nice models such that higher dimensions are simple generalizations of the 1-dimensional case, and we have such a dimensional advantage that our understanding is relatively easy.
2.1. Hyperboloid Model
Parameterizing
Consider the quadratic form
on
defined by
, where
. This doesn’t give a norm, since
is not positive definite, but we can still ask for the set of points
with
. This is (both sheets of) the hyperbola
. Let
be the upper sheet of the hyperbola. This will be 1-dimensional hyperbolic space.
For any
matrix
, let
. That is, matrices which preserve the form given by
. The condition is equivalent to requiring that
. Notice that if we let
be the identity matrix, we would get the regular orthogonal group. We define
, where
has
positive eigenvalues and
negative eigenvalues. Thus
. We similarly define
to be matricies of determinant 1 preserving
, and
to be the connected component of the identity.
is then the group of matrices preserving both orientation and the sheets of the hyperbolas.
We can find an explicit form for the elements of
. Consider the matrix
. Writing down the equations
and
gives us four equations, which we can solve to get the solutions
Since we are interested in the connected component of the identity, we discard the solution on the right. It is useful to do a change of variables
, so we have (recall that
).
These matrices take
to
. In other words,
acts transitively on
with trivial stabilizers, and in particular we have parmeterizing maps
The first map is actually a Lie group isomorphism (with the group action on
being
) in addition to a diffeomorphism, since
Metric
As mentioned above,
is not positive definite, but its restriction to the tangent space of
is. We can see this in the following way: tangent vectors at a point
are characterized by the form
. Specifically,
, since (by a calculation)
. Therefore,
takes tangent vectors to tangent vectors and preserves the form (and is transitive), so we only need to check that the form is positive definite on one tangent space. This is obvious on the tangent space to the point
. Thus,
is a Riemannian manifold, and
acts by isometries.
Let’s use the parameterization
. The unit (in the
metric) tangent at
is
. The distance between the points
and
is
In other words,
is an isometry from
to
.

1-dimensional hyperbollic space. The hyperboloid model is shown in blue, and the projective model is shown in red. An example of the projection map identifying
with
is shown.
2.2. Projective Model
Parameterizing
Real projective space
is the set of lines through the origin in
. We can think about
as
, where
is associated with the line (point in
) intersecting
in
, and
is the horizontal line. There is a natural projection
by projecting a point to the line it is on. Under this projection,
maps to
.
Since
acts on
preserving the lines
, it gives a projective action on
fixing the points
. Now suppose we have any projective linear isomorphism of
fixing
. The isomorphism is represented by a matrix
with eigenvectors
. Since scaling
preserves its projective class, we may assume it has determinant 1. Its eigenvalues are thus
and
. The determinant equation, plus the fact that
Implies that
is of the form of a matrix in
. Therefore, the projective linear structure on
is the “same” (has the same isometry (isomorphism) group) as the hyperbolic (Riemannian) structure on
.
Metric
Clearly, we’re going to use the pushforward metric under the projection of
to
, but it turns out that this metric is a natural choice for other reasons, and it has a nice expression.
The map taking
to
is
. The hyperbolic distance between
and
in
is then
(by the fact from the previous sections that
is an isometry).
Recall the fact that
. Applying this, we get the nice form
We also recall the cross ratio, for which we fix notation as
. Then
Call the numerator of that fraction by
and the denominator by
. Then, recalling that
, we have
Therefore,
.
3. Hilbert Metric
Notice that the expression on the right above has nothing, a priori, to do with the hyperbolic projection. In fact, for any open convex body in
, we can define the Hilbert metric on
by setting
, where
and
are the intersections of the line through
and
with the boundary of
. How is it possible to take the cross ratio, since
are not numbers? The line containing all of them is projectively isomorphic to
, which we can parameterize as
. The cross ratio does not depend on the choice of parameterization, so it is well defined. Note that the Hilbert metric is not necessarily a Riemannian metric, but it does make any open convex set into a metric space.
Therefore, we see that any open convex body in
has a natural metric, and the hyperbolic metric in
agrees with this metric when
is thought of as a open convex set in
.
4. Higher-Dimensional Hyperbolic Space
4.1. Hyperboloid
The higher dimensional hyperbolic spaces are completely analogous to the 1-dimensional case. Consider
with the basis
and the 2-form
. This is the form defined by the matrix
. Define
to be the positive (positive in the
direction) sheet of the hyperbola
.
Let
be the linear transformations preserving the form, so
. This group is generated by
as symmetries of the
plane, together with
as symmetries of the span of the
(this subspace is euclidean). The group
is the set of orientation preserving elements of
which preserve the positive sheet of the hyperboloid (
). This group acts transitively on
with point stabilizers
: this is easiest to see by considering the point
. Here the stabilizer is clearly
, and because
acts transitively, any stabilizer is a conjugate of this.
As in the 1-dimensional case, the metric on
is
, which is invariant under
.
Geodesics in
can be understood by consdering the fixed point sets of isometries, which are always totally geodesic. Here, reflection in a vertical (containing
) plane restricts to an (orientation-reversing, but that’s ok) isometry of
, and the fixed point set is obviously the intersection of this plane with
. Now
is transitive on
, and it sends planes to planes in
, so we have a bijection
{Totally geodesic subspaces through
}
{linear subspaces of
through
}
By considering planes through
, we can see that these totally geodesic subspaces are isometric to lower dimensional hyperbolic spaces.
4.2. Projective
Analogously, we define the projective model as follows: consider the disk
. I.e. the points in the
plane inside the cone
. We can think of
as
, so this disk is
. There is, as before, the natural projection of
to
, and the pushforward of the hyperbolic metric agrees with the Hilbert metric on
as an open convex body in
.
Geodesics in the projective model are the intersections of planes in
with
; that is, they are geodesics in the euclidean space spanned by the
. One interesting consequence of this is that any theorem which is true in euclidean geometry which does not reply on facts about angles is still true for hyperbolic space. For example, Pappus’ hexagon theorem, the proof of which does not use angles, is true.
4.3. Projective Model in Dimension 2
In the case that
, we can understand the projective isomorphisms of
by looking at their actions on the boundary
. The set
is projectively isomorphic to
as an abstract manifold, but it should be noted that
is not a straight line in
, which would be the most natural way to find
‘s embedded in
.
In addition, any projective isomorphism of
can be extended to a real projective isomorphism of
. In other words, we can understand isometries of 2-dimensional hyperbolic space by looking at the action on the boundary. Since
is not a straight line, the extension is not trivial. We now show how to do this.
The automorphisms of
are
. We will consider
. For any Lie group
, there is an Adjoint action
defined by (the derivative of) conjugation. We can similarly define an adjoint action
by the Lie algebra on itself, as
for any path
with
. If the tangent vectors
and
are matrices, then
.
We can define the Killing form
on the Lie algebra by
. Note that
is a matrix, so this makes sense, and the Lie group acts on the tangent space (Lie algebra) preserving this form.
Now let’s look at
specifically. A basis for the tangent space (Lie algebra) is
,
, and
. We can check that
,
, and
. Using these relations plus the antisymmetry of the Lie bracket, we know
Therefore, the matrix for the Killing form in this basis is
This matrix has 2 positive eigenvalues and one negative eigenvalue, so its signature is
. Since
acts on
preserving this form, we have
, otherwise known at the group of isometries of the disk in projective space
, otherwise known as
.
Any element of
(which, recall, was acting on the boundary of projective hyperbolic space
) therefore extends to an element of
, the isometries of hyperbolic space, i.e. we can extend the action over the disk.
This means that we can classify isometries of 2-dimensional hyperbolic space by what they do to the boundary, which is determined generally by their eigevectors (
acts on
by projecting the action on
, so an eigenvector of a matrix corresponds to a fixed line in
, so a fixed point in
. For a matrix
, we have the following:
(elliptic) In this case, there are no real eigenvalues, so no real eigenvectors. The action here is rotation, which extends to a rotation of the entire disk.
(parabolic) There is a single real eigenvector. There is a single fixed point, to which all other points are attracted (in one direction) and repelled from (in the other). For example, the action in projective coordinates sending
to
: infinity is such a fixed point.
(hyperbolic) There are two fixed point, one attracting and one repelling.
-
5. Complex Hyperbolic Space
We can do a construction analogous to real hyperbolic space over the complexes. Define a Hermitian form
on
with coordinates
by
. We will also refer to
as
. The (complex) matrix for this form is
, where
. Complex linear isomorphisms preserving this form are matrices
such that
. This is our definition for
, and we define
to be those elements of
with determinant of norm 1.
The set of points
such that
is not quite what we are looking for: first it is a
real dimensional manifold (not
as we would like for whatever our definition of “complex hyperbolic
space” is), but more importantly,
does not restrict to a positive definite form on the tangent spaces. Call the set of points
where
by
. Consider a point
in
and
in
. As with the real case, by the fact that
is in the tangent space,
Because
is hermitian, the expression on the right does not mean that
, but it does mean that
is purely imaginary. If
, then
, i.e.
is not positive definite on the tangent spaces.
However, we can get rid of this negative definite subspace.
as the complex numbers of unit length (or
, say) acts on
by multiplying coordinates, and this action preserves
: any phase goes away when we apply the absolute value. The quotient of
by this action is
. The isometry group of this space is still
, but now there are point stabilizers because of the action of
. We can think of
inside
as the diagonal matrices, so we can write
And the projectivized matrices
is the group of isometries of
, where the middle
is all vectors in
with
(which we think of as part of complex projective space). We can also approach this group by projectivizing, since that will get rid of the unwanted point stabilizers too: we have
.
5.1. Case
In the case
, we can actually picture
. We can’t picture the original
, but we are looking at the set of
such that
. Notice that
. After projectivizing, we may divide by
, so
. The set of points
which satisfy this is the interior of the unit circle, so this is what we think of for
. The group of complex projective isometries of the disk is
. The straight horizontal line is a geodesic, and the complex isometries send circles to circles, so the geodesics in
are circles perpendicular to the boundary of
in
.
Imagine the real projective model as a disk sitting at height one, and the geodesics are the intersections of planes with the disk. Complex hyperbolic space is the upper hemisphere of a sphere of radius one with equator the boundary of real hyperbolic space. To get the geodesics in complex hyperbolic space, intersect a plane with this upper hemisphere and stereographically project it flat. This gives the familiar Poincare disk model.
5.2. Real
‘s contained in
contains 2 kinds of real hyperbolic spaces. The subset of real points in
is (real)
, so we have a many
. In addition, we have copies of
, which, as discussed above, has the same geometry (i.e. has the same isometry group) as real
. However, these two real hyperbolic spaces are not isometric. the complex hyperbolic space
has a more negative curvature than the real hyperbolic spaces. If we scale the metric on
so that the real hyperbolic spaces have curvature
, then the copies of
will have curvature
.
In a similar vein, there is a symplectic structure on
such that the real
are lagrangian subspaces (the flattest), and the
are symplectic, the most negatively curved.
An important thing to mention is that complex hyperbolic space does not have constant curvature(!).
6. Poincare Disk Model and Upper Half Space Model
The projective models that we have been dealing with have many nice properties, especially the fact that geodesics in hyperbolic space are straight lines in projective space. However, the angles are wrong. There are models in which the straight lines are “curved” i.e. curved in the euclidean metric, but the angles between them are accurate. Here we are interested in a group of isometries which preserves angles, so we are looking at a conformal model. Dimension 2 is special, because complex geometry is real conformal geometry, but nevertheless, there is a model of
in which the isometries of the space are conformal.
Consider the unit disk
in
dimensions. The conformal automorphisms are the maps taking (straight) diameters and arcs of circles perpendicular to the boundary to this same set. This model is abstractly isomorphic to the Klein model in projective space. Imagine the unit disk in a flat plane of height one with an upper hemisphere over it. The geodesics in the Klein model are the intersections of this flat plane with subspaces (so they are straight lines, for example, in dimension 2). Intersecting vertical planes with the upper hemisphere and stereographically projecting it flat give geodesics in the Poincare disk model. The fact that this model is the “same” (up to scaling the metric) as the example above of
is a (nice) coincidence.

The Klein model is the flat disk inside the sphere, and the Poincare disk model is the sphere. Geodesics in the Klein model are intersections of subspaces (the angled plane) with the flat plane at height 1. Geodesics in the Poincare model are intersections of vertical planes with the upper hemisphere. The two darkened geodesics, one in the Klein model and one in the Poincare, correspond under orthogonal projection. We get the usual Poincare disk model by stereographically projecting the upper hemisphere to the disk. The projection of the geodesic is shown as the curved line inside the disk

The Poincare disk model. A few geodesics are shown.
Now we have the Poincare disk model, where the geodesics are straight diameters and arcs of circles perpendicular to the boundary and the isometries are the conformal automorphisms of the unit disk. There is a conformal map from the disk to an open half space (we typically choose to conformally identify it with the upper half space). Conveniently, the hyperbolic metric on the upper half space
can be expressed at a point
(euclidean coordinates) as
. I.e. the hyperbolic metric is just a rescaling (at each point) of the euclidean metric.
One of the important things that we wanted in our models was the ability to realize isometries of the model with isometries of the ambient space. In the case of a one-parameter family of isometries of hyperbolic space, this is possible. Suppose that we have a set of elliptic isometries. Then in the disk model, we can move that point to the origin and realize the isometries by rotations. In the upper half space model, we can move the point to infinity, and realize them by translations.
Geometric group theory is not a coherent and unified field of enquiry so much as a collection of overlapping methods, examples, and contexts. The most important examples of groups are those that arise in nature: free groups and fundamental groups of surfaces, the automorphism groups of the same, lattices, Coxeter and Artin groups, and so on; whereas the most important properties of groups are those that lend themselves to applications or can be used in certain proof templates: linearity, hyperbolicity, orderability, property (T), coherence, amenability, etc. It is natural to confront examples arising in one context with properties that arise in the other, and this is the source of a wealth of (usually very difficult) problems; e.g. do mapping class groups have property (T)? (no, by Andersen) or: is every lattice in virtually orderable?
As remarked above, it is natural to formulate these questions, but not necessarily productive. Gromov, in his essay Spaces and Questions remarks that
often the mirage of naturality lures us into featureless desert with no clear perspective where the solution, even if found, does not quench our thirst for structural mathematics . . . Another approach . . . has a better chance for a successful outcome with questions following (rather than preceding) construction of new objects.
A famous question of the kind Gromov warns against is the following:
Question: Is Thompson’s group amenable?
Recall that Thompson’s group is the group of (orientation-preserving) PL homeomorphisms of the unit interval with breakpoints at dyadic rationals (i.e. rational numbers of the form for integers
) and derivatives all powers of
. This group is a rich source of examples/counterexamples in geometric group theory: it is finitely presented (in fact
) but “looks like” a transformation group; it contains no nonabelian free group (by Brin-Squier), but obeys no law. It is not elementary amenable (i.e. it cannot be built up from finite or abelian groups by elementary operations — subgroups, quotients, extensions, directed unions), so it is “natural” to wonder whether it is amenable at all, or whether it is one of the rare examples of nonamenable groups without nonabelian free subgroups (see this post for a discussion of amenability versus the existence of free subgroups, and von Neumann’s conjecture). This question has attracted a great deal of attention, possibly because of its long historical pedigree, rather than because of the potential applications of a positive (or negative) answer.
Recently, two papers were posted on the arXiv, promising competing resolutions of the question. In February, Azer Akhmedov posted a preprint claiming to show that the group is not amenable. This preprint was revised, withdrawn, then revised again, and as of the end of April, Akhmedov continues to press his claim. Akhmedov’s argument depends on a new geometric criterion for nonamenability, roughly speaking, the existence of a
-generator subgroup and a subadditive non-negative function on the group whose values grow at a definite rate on words in the subgroup whose exponents satisfy suitable parity conditions and inequalities. The non-negative function (Akhmedov calls it a “height function”) certifies the existence of a sufficiently “bushy” subset of the group to violate Folner’s criterion for amenability. Akhmedov’s paper reads like a “conventional” paper in geometric group theory, using methods from coarse geometry, careful combinatorial and counting arguments to establish the existence of a geometric object with certain large-scale properties, and an appeal to a standard geometric criterion to obtain the desired result. Akhmedov’s paper is part of a series, relating (non)amenability to certain other interesting geometric properties, some related to the so-called “traveling salesman” property, introduced earlier by Akhmedov.
On the other hand, in May, E. Shavgulidze posted a preprint claiming to show that the group is amenable. Interestingly enough, Shavgulidze’s argument does not apply to the slightly more general class of Stein-Thompson groups in which slopes and denominators of break points can be divisible by an arbitrary (but prescribed) finite set of prime numbers. Moreover, his methods are very unlike any that one would expect to find in the typical geometric group theory paper. The argument depends on the construction, going back (in some sense) to a paper of Shavgulidze from 1978, of a measure on the space
of continuous functions on the interval which is quasi-invariant under the natural action of the group of diffeomorphisms of the interval of regularity at least
. In more detail, let
denote the group of diffeomorphisms of the interval of regularity at least
for each
, and let
denote the Banach space of continuous functions on the interval that vanish at the origin. Define
by the formula
. The space
can be equipped with a natural measure — the Wiener measure
of variance
, and this measure can be pulled back to
by
, which is thought of as a topological space with the
topology. Shavgulidze shows that the left action of
on
quasi-preserves this measure. Here the Wiener measure on
is the probability measure associated to Brownian motion (with given variance). A “sample” trajectory
from
is characterized by three properties: that it starts at the origin (i.e.
), that is it continuous almost surely (this is already implicit in the fact that the measure is supported on the space
and not some more general space), and that increments are independent, with the distribution of
equal to a Gaussian with mean zero and variance
. Shavgulidze’s argument depends first on an argument of Ghys-Sergiescu that shows Thompson’s group is conjugate (by a homeomorphism) to a discrete subgroup of the group of
diffeomorphisms of the interval. A bounded function
on
determines a continuous bounded function
on
(for
) by a certain convolution trick, using both the group structure of
, and its discreteness in
. Roughly, given an element
, the set of elements of
whose (group) composition with
is uniformly bounded in the
norm is finite; so the value of
is obtained by taking a suitable average of the value of
on this finite subset of
. This reduces the problem of the amenability of
to the existence of a suitable functional on the space of bounded continuous functions on
, which is constructed via the pulled back Wiener measure as above.
There are several distinctive features of Shavgulidze’s preprint. One of the most striking is that it depends on very delicate analytic features of the Wiener measure, and the way it transforms under the action of on
— a transformation law involving the Schwartzian derivative — and suggesting that certain parts of the argument could be clarified (at least from the point of view of a topologist?) by using projective geometry and Sturm-Liouville theory. Another is that the crucial analytic quality — namely differentiability of class
— is also crucial for many other natural problems in
-dimensional analysis and geometry, from regularity estimates in the thin obstacle problem, to Navas’ work on actions of property (T) groups on the circle. At least one of the preprints by Akhmedov and Shavgulidze must be in error (in fact, a real skeptic’s skeptic such as Michael Aschbacher is not even willing to concede that much . . .) but even if wrong, it is possible that they contain things more valuable than a resolution of the question that prompted them.
Update (7/6): Azer Akhmedov sent me a construction of a (nonabelian) free subgroup of that is discrete in the
topology. This is not quite enough regularity to intersect with Shavgulidze’s program, but it is interesting, and worth explaining. This is my (minor) modification of Azer’s construction (any errors are due to me):
Proposition: The group contains a discrete nonabelian free subgroup.
Sketch of Proof: First, decompose the interval into countably many disjoint subintervals accumulating only at the endpoints. Choose a free action on two generators by doing something generic on each subinterval, in such a way that the derivative is equal to
at the endpoints. This can certainly be accomplished; for concreteness, choose the action so that for each subinterval
there is a point
in the interior of
whose stabilizer is trivial.
Second, for each pair of distinct words in the generators, choose a subinterval and modify the action there so that the derivatives of those words in that subinterval differ by at least some definite constant at some point. In more detail: enumerate the pairs of words somehow
where each
is a pair of words
in the generators, and modify the action on the subinterval
so the words in
differ by at least
in the
norm on the interval
. Since we are modifying the generators infinitely many times, but in such a way that the support of the modification exits any compact subset of the interior, we just need to check that the modifications are
. Since there are only finitely many pairs of words, both of which are of bounded length (for any given bound), when
is sufficiently big, one of the words
,
has length at least
where
goes to infinity as
goes to infinity. Without loss of generality, we can order the pairs so that
is the “long” word.
Now this is how we modify the action in . Recall that the point
has trivial stabilizer, so the translates
of
under the suffixes of
are distinct. Take disjoint intervals about the
and observe that each
is taken to
by one of the generators. Modify this generator inside this disjoint neighborhood so that
is still taken to
, but the derivative at that point is multiplied by
, and the derivative at nearby points is not multiplied by more than
. Since the neighborhoods of the
are disjoint, these modifications are all compatible, and the derivative of the generators does not change by more than
at any point. Since
goes to infinity as
goes to infinity, we can perform such modifications for each
, and the resulting action is still
. But now the derivative of
at
has been multiplied by
, so
and
differ by at least
in the
norm. qed.
It is interesting to observe that this construction, while , is not
for any
. For big
, we have
whereas
. Introducing a “bump” which modifies the derivative by
in a subinterval of size
will blow up every Holder norm.
(Update 8/10): Mark Sapir has created a webpage to discuss Shavgulidze’s paper here. Also, Matt Brin has posted notes on Shavgulidze’s paper here. The notes are very nice, and go into great detail, as far as they go. Matt promises to update the notes periodically.
(Update 11/18): Matt Brin has let me know by email that a significant gap has emerged in Shavgulidze’s argument. He writes:
Lemma 5 is still unproven. It claims a property about the distributions
on the simplexes
that is needed for the second part of the paper. The main result does not need the particular distributions
given in the paper, but does need distributions on the
that satisfy the properties claimed by Lemmas 5, 6 and that cooperate with Lemma 9. Ufe Haagerup claims an argument that the
in the paper does not satisfy the conclusion of Lemma 5. Another distribution was said to be suggested by Shavgulidze, but at last report, it did not seem to be working out.
In light of this, it would seem to be reasonable to consider the question of whether is amenable as wide open.
(Update 9/21/2012): Justin Moore has posted a preprint on the arXiv claiming to prove amenability of . It is too early to suggest that there is expert consensus on the correctness of the proof, but certainly everything I have heard is promising. I have not had time to look carefully at the argument yet, but hope to get a chance to do so before too long.
(Update 10/2/2012): Justin has withdrawn his claim of a proof. A gap was found by Akhmedov.
A few days ago, Joel Friedman posted a paper on the arXiv purporting to give a proof of the (strengthened) Hanna Neumann conjecture, a well-known problem in geometric group theory.
Simply stated, the problem is as follows.
Conjecture (Hanna Neumann): Let be a free group, and let
and
be finitely generated subgroups. For a subgroup
of
, let
. Then there is an inequality
.
This conjecture was further strengthened by Walter Neumann (her son):
Conjecture (strengthened Hanna Neumann): With notation above, there is an inequality where the sum is taken over
, i.e. the double coset representatives.
Notice by the way that since any free group embeds into , the free group of rank
, one can assume that
has rank
above. This fact is implicit in the discussion below.
Friedman’s paper seems to be very carefully written, and contains some new ideas (which I do not yet really understand), namely an approach using sheaf theory. But in this post I want to restrict myself to some simple (and probably well-known) geometric observations.
The first step is to reduce the problem to a completely graph-theoretic one, following Stallings; in fact, Benson Farb tells me that he thinks this reduction was known to Stallings, or at least to Dicks/Formanek (and in any case is very close to some ideas Stallings and Gersten introduced to study the problem; more on that in a later post). Friedman makes the following definition:
Definition: Let be a finite group and
be two elements (that do not necessarily generate
). The directed Cayley graph
is the graph with vertex set
and with a directed edge from
to
labeled
for each
and
.
In other words, is a graph whose edges are oriented and labeled with either
or
in such a way that each vertex has at most one outgoing and one incoming edge with each label, and such that there is a transitive (on the vertices) free action of a group
on
. (Note: for some reason, Friedman wants his group to act on the right, and therefore has directed edges from
to
, but this is just a matter of convention).
For any finite graph , not necessarily connected, let
; i.e.
where the sum is taken over the connected components
of
. Friedman shows (but this reduction is well-known) that the SHNC is equivalent to the following graph-theoretic inequality:
Theorem: The SHNC is equivalent to the following statement. For any graph as above, and any two subgraphs
we have
.
The purpose of this blog entry is to show that there is a very simple proof of this inequality when is replaced with
. This is not such a strange thing to do, since
and
are equal for graphs without acyclic components (i.e. without components that are trees), and for “random” graphs
one does not expect the difference between
and
to be very big. The argument proceeds as follows. Suppose
has
vertices and
edges of kind
respectively, and define
similarly for
. Then
On the other hand, since Euler characteristic is local, we just need to count how many vertices and edges of each kind turn up in each . But this is easy: every vertex of
is equal to exactly one translate of every vertex of
, and similarly for edges of each kind. Hence
So the inequality one wants to show is which simplifies to
On the other hand, each graph has at most two edges at any vertex with either label, and therefore we have inequalities
. Subject to these constraints, the inequality above is straightforward to prove. To see this, first fix some non-negative values of
and let
be the four-dimensional cube of possible values of
. Since both sides of the inequality are linear as a function of each
or
, if the inequality is violated at any point in
one may draw a straight line in
corresponding to varying one of the co-ordinates (e.g.
) while keeping the others fixed, and deduce that the inequality must be violated on one of the faces of
. Inductively, if the inequality is violated at all, it is violated at a vertex of
, which may be ruled out by inspection; qed.
This argument shows that the whole game is to understand the acyclic components of ; i.e. those which are topologically trees, and therefore contribute
to
, but
to
.
Incidentally, for all I know, this simple argument is explicitly contained in either Stallings’ or Gersten’s paper (it is surely not original in any case). If a reader can verify this, please let me know!
Update: Walter Neumann informs me that this observation (that the inequality is true with in place of
) is in his paper in which he introduces the SHNC! He further shows in that paper that for “most”
, the SHNC is true for all
.
Update (6/29): Warren Dicks informs me that he was not aware of the reduction of SHNC to the graph-theoretic formulation described above. Friedman’s webpage acknowledges the existence of an error in the paper, and says that he is working to correct it. One problem that I know of (discovered mostly by my student Steven Frankel) concerns the commutativity of the diagram on page 10.
Update (10/22): It has been a few months since I last edited this page, and Joel Friedman has not updated either the arXiv paper, or the statement on his webpage that he is “trying to fix the error”. Since wikipedia mentions Friedman’s announcement, I thought it would be worth going on record at this point to say that Friedman’s arXiv paper (version 1 — the only version at the point I write this) is definitely in error, and that I believe the error is fundamental, and cannot be repaired (this is not to say that the paper does not contain some things of interest (it does), or that Friedman does not acknowledge the error (he does), just that it is worth clearing up any possible ambiguity about the situation for readers who are wondering about the status of the SHNC). The problem is the “not entirely standard” (quote from Friedman’s paper) diagrams, like the one on page 10. In particular, the claimed proof of Theorem 5.6, that the projections constructed in Lemma 5.5 (by a very general dimension counting argument) fit into a diagram with the desired properties is false. Any construction of projections satisfying the desired properties must be quite special. Nevertheless, one can certainly still define Friedman’s sheaf , and ask whether it has
(in Friedman’s sense); this would, as far as I can tell, prove SHNC; however, I do not know of any reason why it should hold (or whether there are any counterexamples, which might exist even if SHNC is true).

Bill Thurston 1946-2012
August 22, 2012 in Commentary, Uncategorized | Tags: Bill Thurston, obituary | by Danny Calegari | 11 comments
This morning I heard the awful news that Bill Thurston died last night. Many of us knew that Bill was very ill, but we all hoped (or imagined?) that he would still be with us for a while yet, and the suddenness of this is very harsh. As Sarah Koch put it in an email to me, “Although this was not unexpected, it is still shocking.” On the other hand, I am glad to hear that he was surrounded by family, and died peacefully.
I counted Bill as my friend, as well as my mentor, and I have many vivid and happy memories of time I spent with him. I hope that writing down a few of these reminiscences will be cathartic for me, and for others who are coping with this loss.
Read the rest of this entry »