You are currently browsing the tag archive for the ‘hyperbolic groups’ tag.
A few weeks ago, Ian Agol, Vlad Markovic, Ursula Hamenstadt and I organized a “hot topics” workshop at MSRI with the title Surface subgroups and cube complexes. The conference was pretty well attended, and (I believe) was a big success; the organizers clearly deserve a great deal of credit. The talks were excellent, and touched on a wide range of subjects, and to those of us who are mid-career or older it was a bit shocking to see how quickly the landscape of low-dimensional geometry/topology and geometric group theory has been transformed by the recent breakthrough work of (Kahn-Markovic-Haglund-Wise-Groves-Manning-etc.-) Agol. Incidentally, when I first started as a graduate student, I had a vague sense that I had somehow “missed the boat” — all the exciting developments in geometry due to Thurston, Sullivan, Gromov, Freedman, Donaldson, Eliashberg etc. had taken place 10-20 years earlier, and the subject now seemed to be a matter of fleshing out the consequences of these big breakthroughs. 20 years and several revolutions later, I no longer feel this way. (Another slightly shocking aspect of the workshop was for me to realize that I am older or about as old as 75% of the speakers . . .)
The rationale for the workshop (which I had some hand in drafting, and therefore feel comfortable quoting here) was the following:
Recently there has been substantial progress in our understanding of the related questions of which hyperbolic groups are cubulated on the one hand, and which contain a surface subgroup on the other. The most spectacular combination of these two ideas has been in 3-manifold topology, which has seen the resolution of many long-standing conjectures. In turn, the resolution of these conjectures has led to a new point of view in geometric group theory, and the introduction of powerful new tools and structures. The goal of this conference will be to explore the further potential of these new tools and perspectives, and to encourage communication between researchers working in various related fields.
I have blogged a bit about cubulated groups and surface subgroups previously, and I even began this blog (almost 4 years ago now) initially with the idea of chronicling my efforts to attack Gromov’s surface subgroup question. This question asks the following:
Gromov’s Surface Subgroup Question: Does every one-ended hyperbolic group contain a subgroup which is isomorphic to the fundamental group of a closed surface of genus at least 2?
The restriction to one-ended groups is just meant to rule out silly examples, like finite or virtually cyclic groups (i.e. “elementary” hyperbolic groups), or free products of simpler hyperbolic groups. Asking for the genus of the closed surface to be at least 2 rules out the sphere (whose fundamental group is trivial) and the torus (whose fundamental group cannot be a subgroup of a hyperbolic group). It is the purpose of this blog post to say that Alden Walker and I have managed to show that Gromov’s question has a positive answer for “most” hyperbolic groups; more precisely, we show that a random group (in the sense of Gromov) contains a surface subgroup (in fact, many surface subgroups) with probability going to 1 as a certain natural parameter (the “length”
of the random relators) goes to infinity. (update April 8: the preprint is available from the arXiv here.)
Today Jason Manning gave a talk on a vital ingredient in the proof of Agol’s theorem, which is a result in geometric group theory. The theorem is a joint project of Agol-Groves-Manning, and generalizes some earlier work they did a few years ago. Jason referred to the main theorem during his talk as the “Goal Theorem” (I guess it was the goal of his lecture), but I’m going to call it the Weak Separation Theorem, since that is a somewhat more descriptive name. The statement of the theorem is as follows.
Weak Separation Theorem (Agol-Groves-Manning): Let G be a hyperbolic group, let H be a subgroup of G which is quasiconvex, and isomorphic to the fundamental group of a virtually special NPC cube complex, and let g be an element of G which is not contained in H. Then there is a surjection so that
is hyperbolic;
is finite; and
is not contained in
.
In the remainder of this post I will try to explain the proof of this theorem, to the extent that I understand it. Basically, this amounts to my summarizing Manning’s talk (or the part of it that I managed to get down in my notes); again, any errors, foolishness, silly blog post titles etc. are due to me.
Last Friday, Henry Wilton gave a talk at Caltech about his recent joint work with Sang-hyun Kim on polygonal words in free groups. Their work is motivated by the following well-known question of Gromov:
Question(Gromov): Let be a one-ended word-hyperbolic group. Does
contain a subgroup isomorphic to the fundamental group of a closed hyperbolic surface?
Let me briefly say what “one-ended” and “word-hyperbolic” mean.
A group is said to be word-hyperbolic if it acts properly and cocompactly by isometries on a proper -hyperbolic path metric space — i.e. a path metric space in which there is a constant
so that geodesic triangles in the metric space have the property that each side of the triangle is contained in the
-neighborhood of the union of the other two sides (colloquially, triangles are thin). This condition distills the essence of negative curvature in the large, and was shown by Gromov to be equivalent to several other conditions (eg. that the group satisfies a linear isoperimetric inequality; that every ultralimit of the group is an
-tree). Free groups are hyperbolic; fundamental groups of closed manifolds with negative sectional curvature (eg surfaces with negative Euler characteristic) are word-hyperbolic; “random” groups are hyperbolic — and so on. In fact, it is an open question whether a group
that admits a finite
is word hyperbolic if and only if it does not contain a copy of a Baumslag-Solitar group
for
(note that the group
is the special case
); in any case, this is a very good heuristic for identifying the word-hyperbolic groups one typically meets in examples.
If is a finitely generated group, the ends of
really means the ends (as defined by Freudenthal) of the Cayley graph of
with respect to some finite generating set. Given a proper topological space
, the set of compact subsets of
gives rise to an inverse system of inclusions, where
includes into
whenever
is a subset of
. This inverse system defines an inverse system of maps of discrete spaces
, and the inverse limit of this system is a compact, totally disconnected space
, called the space of ends of
. A proper topological space is canonically compactified by its set of ends; in fact, the compactification
is the “biggest” compactification of
by a totally disconnected space, in the sense that for any other compactification
where
is zero dimensional, there is a continuous map
which is the identity on
.
For a word-hyperbolic group , the Cayley graph can be compactified by adding the ideal boundary
, but this is typically not totally disconnected. In this case, the ends of
can be recovered as the components of
.
A group acts on its own ends
. An elementary argument shows that the cardinality of
is one of
(if a compact set
disconnects
then infinitely many translates of
converging to
separate
from infinitely many other ends accumulating on
). A group has no ends if and only if it is finite. Stallings famously showed that a (finitely generated) group has at least
ends if and only if it admits a nontrivial description as an HNN extension or amalgamated free product over a finite group. One version of the argument proceeds more or less as follows, at least when
is finitely presented. Let
be an
-dimensional Riemannian manifold with fundamental group
, and let
denote the universal cover. We can identify the ends of
with the ends of
. Let
be a least (
-dimensional) area hypersurface in
amongst all hypersurfaces that separate some end from some other (here the hypothesis that
has at least two ends is used). Then every translate of
by an element of
is either equal to
or disjoint from it, or else one could use the Meeks-Yau “roundoff trick” to find a new
with strictly lower area than
. The translates of
decompose
into pieces, and one can build a tree
whose vertices correspond to to components of
, and whose edges correspond to the translates
. The group
acts on this tree, with finite edge stabilizers (by the compactness of
), exhibiting
either as an HNN extension or an amalgamated product over the edge stabilizers. Note that the special case
occurs if and only if
has a finite index subgroup which is isomorphic to
.
Free groups and virtually free groups do not contain closed surface subgroups; Gromov’s question more or less asks whether these are the only examples of word-hyperbolic groups with this property.
Kim and Wilton study Gromov’s question in a very, very concrete case, namely that case that is the double of a free group
along a word
; i.e.
(hereafter denoted
). Such groups are known to be one-ended if and only if
is not contained in a proper free factor of
(it is clear that this condition is necessary), and to be hyperbolic if and only if
is not a proper power, by a result of Bestvina-Feighn. To see that this condition is necessary, observe that the double
is isomorphic to the fundamental group of a Seifert fiber space, with base space a disk with two orbifold points of order
; such a group contains a
. One might think that such groups are too simple to give an insight into Gromov’s question. However, these groups (or perhaps the slightly larger class of graphs of free groups with cyclic edge groups) are a critical case for at least two reasons:
- The “smaller” a group is, the less room there is inside it for a surface group; thus the “simplest” groups should have the best chance of being a counterexample to Gromov’s question.
- If
is word-hyperbolic and one-ended, one can try to find a surface subgroup by first looking for a graph of free groups
in
, and then looking for a surface group in
. Since a closed surface group is itself a graph of free groups, one cannot “miss” any surface groups this way.
Not too long ago, I found an interesting construction of surface groups in certain graphs of free groups with cyclic edge groups. In fact, I showed that every nontrivial element of in such a group is virtually represented by a sum of surface subgroups. Such surface subgroups are obtained by finding maps of surface groups into
which minimize the Gromov norm in their (projective) homology class. I think it is useful to extend Gromov’s question by making the following
Conjecture: Let be a word-hyperbolic group, and let
be nonzero. Then some multiple of
is represented by a norm-minimizing surface (which is necessarily
-injective).
Note that this conjecture does not generalize to wider classes of groups. There are even examples of groups
with nonzero homology classes
with positive, rational Gromov norm, for which there are no
-injective surfaces representing a multiple of
at all.
It is time to define polygonal words in free groups.
Definition: Let be free. Let
be a wedge of circles whose edges are free generators for
. A cyclically reduced word
in these generators is polygonal if there exists a van-Kampen graph
on a surface
such that:
- every complementary region is a disk whose boundary is a nontrivial (possibly negative) power of
;
- the (labelled) graph
immerses in
in a label preserving way;
- the Euler characteristic of
is strictly less than the number of disks.
The last condition rules out trivial examples; for example, the double of a single disk whose boundary is labeled by . Notice that it is very important to allow both positive and negative powers of
as boundaries of complementary regions. In fact, if
is not in the commutator subgroup, then the sum of the powers over all complementary regions is necessarily zero (and if
is in the commutator subgroup, then
has nontrivial
, so one already knows that there is a surface subgroup).
Condition 2. means that at each vertex of , there is at most one oriented label corresponding to each generator of
or its inverse. This is really the crucial geometric property. If
is a van-Kampen graph as above, then a theorem of Marshall Hall implies that there is a finite cover of
into which
embeds (in fact, this observation underlies Stallings’s work on foldings of graphs). If we build a
-complex
with
by attaching two ends of a cylinder to suitable loops in two copies of
, then a tubular neighborhood of
in
(i.e. what is sometimes called a “fatgraph” ) embeds in a finite cover
of
, and its double — a surface of strictly negative Euler characteristic — embeds as a closed surface in
, and is therefore
-injective. Hence if
is polygonal,
contains a surface subgroup.
Not every word is polygonal. Kim-Wilton discuss some interesting examples in their paper, including:
- suppose
is a cyclically reduced product of proper powers of the generators or their inverses (e.g a word like
but not a word like
); then
is polygonal;
- a word of the form
is polygonal if
for each
;
- the word
is not polygonal.
To see 3, suppose there were a van-Kampen diagram with more disks than Euler characteristic. Then there must be some vertex of valence at least . Since
is positive, the complementary regions must have boundaries which alternate between positive and negative powers of
, so the degree of the vertex must be even. On the other hand, since
must immerse in a wedge of two circles, the degree of every vertex must be at most
, so there is consequently some vertex of degree exactly
. Since each
is isolated, at least
edges must be labelled
; hence exactly two. Hence exactly two edges are labelled
. But one of these must be incoming and one outgoing, and therefore these are adjacent, contrary to the fact that
does not contain a
.
1 above is quite striking to me. When is in the commutator subgroup, one can consider van-Kampen diagrams as above without the injectivity property, but with the property that every power of
on the boundary of a disk is positive; call such a van-Kampen diagram monotone. It turns out that monotone van-Kampen diagrams always exist when
, and in fact that norm-minimizing surfaces representing powers of the generator of
are associated to certain monotone diagrams. The construction of such surfaces is an important step in the argument that stable commutator length (a kind of relative Gromov norm) is rational in free groups. In my paper scl, sails and surgery I showed that monomorphisms of free groups that send every generator to a power of that generator induce isometries of the
norm; in other words, there is a natural correspondence between certain equivalence classes of monotone surfaces for an arbitrary word in
and for a word of the kind that Kim-Wilton show is polygonal (Note: Henry Wilton tells me that Brady, Forester and Martinez-Pedroza have independently shown that
contains a surface group for such
, but I have not seen their preprint (though I would be very grateful to get a copy!)).
In any case, if not every word is polygonal, all is not lost. To show that contains a surface subgroup is suffices to show that
contains a surface subgroup, where
and
differ by an automorphism of
. Kim-Wilton conjecture that one can always find an automorphism
so that
is polygonal. In fact, they make the following:
Conjecture (Kim-Wilton; tiling conjecture): A word not contained in a proper free factor of shortest length (in a given generating set) in its orbit under
is polygonal.
If true, this would give a positive answer to Gromov’s question for groups of the form .
The purpose of this post is to discuss my recent paper with Koji Fujiwara, which will shortly appear in Ergodic Theory and Dynamical Systems, both for its own sake, and in order to motivate some open questions that I find very intriguing. The content of the paper is a mixture of ergodic theory, geometric group theory, and computer science, and was partly inspired by a paper of Jean-Claude Picaud. To state the results of the paper, I must first introduce a few definitions and some background.
Let be a finite directed graph (hereafter a digraph) with an initial vertex, and edges labeled by elements of a finite set
in such a way that each vertex has at most one outgoing edge with any given label. A finite directed path in
starting at the initial vertex determines a word in the alphabet
, by reading the labels on the edges traversed (in order). The set
of words obtained in this way is an example of what is called a regular language, and is said to be parameterized by
. Note that this is not the most general kind of regular language; in particular, any language
of this kind will necessarily be prefix-closed (i.e. if
then every prefix of
is also in
). Note also that different digraphs might parameterize the same (prefix-closed) regular language
.
If is a set of generators for a group
, there is an obvious map
called the evaluation map that takes a word
to the element of
represented by that word.
Definition: Let be a group, and
a finite generating set. A combing of
is a (prefix-closed) regular language
for which the evaluation map
is a bijection, and such that every
represents a geodesic in
.
The intuition behind this definition is that the set of words in determines a directed spanning tree in the Cayley graph
starting at
, and such that every directed path in the tree is a geodesic in
. Note that there are other definitions of combing in the literature; for example, some authors do not require the evaluation map to be a bijection, but only a coarse bijection.
Fundamental to the theory of combings is the following Theorem, which paraphrases one of the main results of this paper:
Theorem: (Cannon) Let be a hyperbolic group, and let
be a finite generating set. Choose a total order on the elements of
. Then the language
of lexicographically first geodesics in
is a combing.
The language described in this theorem is obviously geodesic and prefix-closed, and the evaluation map is bijective; the content of the theorem is that
is regular, and parameterized by some finite digraph
. In the sequel, we restrict attention exclusively to hyperbolic groups
.
Given a (hyperbolic) group , a generating set
, a combing
, one makes the following definition:
Definition: A function is weakly combable (with respect to
) if there is a digraph
parameterizing
and a function
from the vertices of
to
so that for any
, corresponding to a path
in
, there is an equality
.
In other words, a function is weakly combable if it can be obtained by “integrating” a function
along the paths of a combing. One furthermore says that a function is combable if it changes by a bounded amount under right-multiplication by an element of
, and bicombable if it changes by a bounded amount under either left or right multiplication by an element of
. The property of being (bi-)combable does not depend on the choice of a generating set
or a combing
.
Example: Word length (with respect to a given generating set ) is bicombable.
Example: Let be a homomorphism. Then
is bicombable.
Example: The Brooks counting quasimorphisms (on a free group) and the Epstein-Fujiwara counting quasimorphisms are bicombable.
Example: The sum or difference of two (bi-)combable functions is (bi-)combable.
A particularly interesting example is the following:
Example: Let be a finite set which generates
as a semigroup. Let
denote word length with respect to
, and
denote word length with respect to
(which also generates
as a semigroup). Then the difference
is a bicombable quasimorphism.
The main theorem proved in the paper concerns the statistical distribution of values of a bicombable function.
Theorem: Let be a hyperbolic group, and let
be a bicombable function on
. Let
be the value of
on a random word in
of length
(with respect to a certain measure
depending on a choice of generating set). Then there are algebraic numbers
and
so that as distributions,
converges to a normal distribution with standard deviation
.
One interesting corollary concerns the length of typical words in one generating set versus another. The first thing that every geometric group theorist learns is that if are two finite generating sets for a group
, then there is a constant
so that every word of length
in one generating set has length at most
and at least
in the other generating set. If one considers an example like
, one sees that this is the best possible estimate, even statistically. However, if one restricts attention to a hyperbolic group
, then one can do much better for typical words:
Corollary: Let be hyperbolic, and let
be two finite generating sets. There is an algebraic number
so that almost all words of length
with respect to the
generating set have length almost equal to
with respect to the
generating set, with error of size
.
Let me indicate very briefly how the proof of the theorem goes.
Sketch of Proof: Let be bicombable, and let
be a function from the vertices of
to
, where
is a digraph parameterizing
. There is a bijection between the set of elements in
of word length
and the set of directed paths in
of length
that start at the initial vertex. So to understand the distribution of
, we need to understand the behaviour of a typical long path in
.
Define a component of to be a maximal subgraph with the property that there is a directed path (in the component) from any vertex to any other vertex. One can define a new digraph
without loops, with one vertex for each component of
, in an obvious way. Each component
determines an adjacency matrix
, with
-entry equal to
if there is a directed edge from vertex
to vertex
, and equal to
otherwise. A component
is big if the biggest real eigenvalue
of
is at least as big as the biggest real eigenvalue of the matrices associated to every other component. A random long walk in
will spend most of its time entirely in big components, so these are the only components we need to consider to understand the statistical distribution of
.
A theorem of Coornaert implies that there are no big components of in series; i.e. there are no directed paths in
from one big component to another (one also says that the big components do not communicate). This means that a typical long walk in
is entirely contained in a single big component, except for a (relatively short) path at the start and the end of the walk. So the distribution of
gets independent contributions, one from each big component.
The contribution from an individual big component is not hard to understand: the central limit theorem for stationary Markov chains says that for elements of corresponding to paths that spend almost all their time in a given big component
there is a central limit theorem
where the mean
and standard deviation
depend only on
. The problem is to show that the means and standard deviations associated to different big components are the same. Everything up to this point only depends on weak combability of
; to finish the proof one must use bicombability.
It is not hard to show that if is a typical infinite walk in a component
, then the subpaths of
of length
are distributed like random walks of length
in
. What this means is that the mean and standard deviation
associated to a big component
can be recovered from the distribution of
on a single infinite “typical” path in
. Such an infinite path corresponds to an infinite geodesic in
, converging to a definite point in the Gromov boundary
. Another theorem of Coornaert (from the same paper) says that the action of
on its boundary
is ergodic with respect to a certain natural measure called a Patterson-Sullivan measure (see Coornaert’s paper for details). This means that there are typical infinite geodesics
associated to components
and
for which some
takes
to a geodesic
ending at the same point in
as
. Bicombability implies that the values of
on
and
differ by a bounded amount. Moreover, since
and
are asymptotic to the same point at infinity, combability implies that the values of
on
and
also differ by a bounded amount. This is enough to deduce that
and
, and one obtains a (global) central limit theorem for
on
. qed.
This obviously raises several questions, some of which seem very hard, including:
Question 1: Let be an arbitrary quasimorphism on a hyperbolic group
(even the case
is free is interesting). Does
satisfy a central limit theorem?
Question 2: Let be an arbitrary quasimorphism on a hyperbolic group
. Does
satisfy a central limit theorem with respect to a random walk on
? (i.e. one considers the distribution of values of
not on the set of elements of
of word length
, but on the set of elements obtained by a random walk on
of length
, and lets
go to infinity)
All bicombable quasimorphisms satisfy an important property which is essential to our proof of the central limit theorem: they are local, which is to say, they are defined as a sum of local contributions. In the continuous world, they are the analogue of the so-called de Rham quasimorphisms on where
is a closed negatively curved Riemannian manifold; such quasimorphisms are defined by choosing a
-form
, and defining
to be equal to the integral
, where
is the closed oriented based geodesic in
in the homotopy class of
. De Rham quasimorphisms, being local, also satisfy a central limit theorem.
This locality manifests itself in another way, in terms of defects. Let be a quasimorphism on a hyperbolic group
. Recall that the defect
is the supremum of
over all pairs of elements
. A quasimorphism is further said to be homogeneous if
for all integers
. If
is an arbitrary quasimorphism, one may homogenize it by taking a limit
; one says that
is the homogenization of
in this case. Homogenization typically does not preserve defects; however, there is an inequality
. If
is local, one expects this inequality to be an equality. For, in a hyperbolic group, the contribution to the defect of a local quasimorphism all arises from the interaction of the suffix of (a geodesic word representing the element)
with the prefix of
(with notation as above). When one homogenizes, one picks up another contribution to the defect from the interaction of the prefix of
with the suffix of
; since these two contributions are essentially independent, one expects that homogenizing a local quasimorphism should exactly double the defect. This is the case for bicombable and de Rham quasimorphisms, and can perhaps be used to define locality for a quasimorphism on an arbitrary group.
This discussion provokes the following key question:
Question 3: Let be a group, and let
be a homogeneous quasimorphism. Is there a quasimorphism
with homogenization
, satisfying
?
Example: The answer to question 3 is “yes” if is the rotation quasimorphism associated to an action of
on
by orientation-preserving homeomorphisms (this is nontrivial; see Proposition 4.70 from my monograph).
Example: Let be any homologically trivial group
-boundary. Then there is some extremal homogeneous quasimorphism
for
(i.e. a quasimorphism achieving equality
under generalized Bavard duality; see this post) for which there is
with homogenization
satisfying
. Consequently, if every point in the boundary of the unit ball in the
norm is contained in a unique supporting hyperplane, the answer to question 3 is “yes” for any quasimorphism on
.
Any quasimorphism on can be pulled back to a quasimorphism on a free group, but this does not seem to make anything easier. In particular, question 3 is completely open (as far as I know) when
is a free group. An interesting test case might be the homogenization of an infinite sum of Brooks functions
for some infinite non-nested family of words
.
If the answer to this question is false, and one can find a homogeneous quasimorphism which is not the homogenization of any “local” quasimorphism, then perhaps
does not satisfy a central limit theorem. One can try to approach this problem from the other direction:
Question 4: Given a function defined on the ball of radius
in a free group
, one defines the defect
in the usual way, restricted to pairs of elements
for which
are all of length at most
. Under what conditions can
be extended to a function on the ball of radius
without increasing the defect?
If one had a good procedure for building a quasimorphism “by hand” (so to speak), one could try to build a quasimorphism that failed to satisfy a central limit theorem, or perhaps find reasons why this was impossible.
More ambitious than simply showing that a group is infinite is to show that it contains an infinite subgroup of a certain kind. One of the most important kinds of subgroup to study are free groups. Hence, one is interested in the question:
Question: When does a group contain a (nonabelian) free subgroup?
Again, one can (and does) ask this question both about a specific group, and about certain classes of groups, or for a typical (in some sense) group from some given family.
Example: If is a property of groups that is inherited by subgroups, then if no free group satisfies
, no group that satisfies
can contain a free subgroup. An important property of this kind is amenability. A (discrete) group
is amenable if it admits an invariant mean; that is, if there is a linear map
(i.e. a way to define the average of a bounded function over
) satisfying three basic properties:
if
(i.e. the average of a non-negative function is non-negative)
where
is the constant function taking the value
everywhere on
(i.e. the average of the constant function
is normalized to be
)
for every
and
, where
(i.e. the mean is invariant under the obvious action of
on
)
If is a subgroup of
, there are (many)
-invariant homomorphisms
taking non-negative functions to non-negative functions, and
to
; for example, the (left) action of
on
breaks up into a collection of copies of
acting on itself, right-multiplied by a collection of right coset representatives. After choosing such a choice of representatives
, one for each coset
, we can define
. Composing with
shows that every subgroup of an amenable group is amenable (this is harder to see in the “geometric” definition of amenable groups in terms of Folner sets). On the other hand, as is well-known, a nonabelian free group is not amenable. Hence, amenable groups do not contain nonabelian free subgroups.
The usual way to see that a nonabelian free group is not amenable is to observe that it contains enough disjoint “copies” of big subsets. For concreteness, let denote the free group on two generators
, and write their inverses as
. Let
denote the set of reduced words that start with either
or
, and let
denote the indicator functions of
respectively. We suppose that
is amenable, and derive a contradiction. Note that
, so
. Let
denote the set of reduced words that start with one of the strings
, and let
denote the indicator function of
. Notice that
is made of two disjoint copies of each of
. So on the one hand,
, but on the other hand,
.
Conversely, the usual way to show that a group is amenable is to use the Folner condition. Suppose that
is finitely generated by some subset
, and let
denote the Cayley graph of
(so that
is a homogeneous locally finite graph). Suppose one can find finite subsets
of vertices so that
(here
means the number of vertices in
, and
means the number of vertices in
that share an edge with
). Since the “boundary” of
is small compared to
, averaging a bounded function over
is an “almost invariant” mean; a weak limit (in the dual space to
) is an invariant mean. Examples of amenable groups include
- Finite groups
- Abelian groups
- Unions and extensions of amenable groups
- Groups of subexponential growth
and many others. For instance, virtually solvable groups (i.e. groups containing a solvable subgroup with finite index) are amenable.
Example: No amenable group can contain a nonabelian free subgroup. The von Neumann conjecture asked whether the converse was true. This conjecture was disproved by Olshanskii. Subsequently, Adyan showed that the infinite free Burnside groups are not amenable. These are groups with
generators, and subject only to the relations that the
th power of every element is trivial. When
is odd and at least
, these groups are infinite and nonamenable. Since they are torsion groups, they do not even contain a copy of
, let alone a nonabelian free group!
Example: The Burnside groups are examples of groups that obey a law; i.e. there is a word in finitely many free variables, such that
for every choice of
. For example, an abelian group satisfies the law
. Evidently, a group that obeys a law does not contain a nonabelian free subgroup. However, there are examples of groups which do not obey a law, but which also do not contain any nonabelian free subgroup. An example is the classical Thompson’s group
, which is the group of orientation-preserving piecewise-linear homeomorphisms of
with finitely many breakpoints at dyadic rationals (i.e. points of the form
for integers
) and with slopes integral powers of
. To see that this group does not obey a law, one can show (quite easily) that in fact
is dense (in the
topology) in the group
of all orientation-preserving homeomorphisms of the interval. This latter group contains nonabelian free groups; by approximating the generators of such a group arbitrarily closely, one obtains pairs of elements in
that do not satisfy any identity of length shorter than any given constant. On the other hand, a famous theorem of Brin-Squier says that
does not contain any nonabelian free subgroup. In fact, the entire group
does not contain any nonabelian free subgroup. A short proof of this fact can be found in my paper as a corollary of the fact that every subgroup
of
has vanishing stable commutator length; since stable commutator length is nonvanishing in nonabelian free groups, this shows that there are no such subgroups of
. (Incidentally, and complementarily, there is a very short proof that stable commutator length vanishes on any group that obeys a law; we will give this proof in a subsequent post).
Example: If surjects onto
, and
contains a free subgroup
, then there is a section from
to
(by freeness), and therefore
contains a free subgroup.
Example: The most useful way to show that contains a nonabelian free subgroup is to find a suitable action of
on some space
. The following is known as Klein’s ping-pong lemma. Suppose one can find disjoint subsets
and
of
, and elements
so that
,
, and similarly interchanging the roles of
and
. If
is a reduced word in
, one can follow the trajectory of a point under the orbit of subwords of
to verify that
is nontrivial. The most common way to apply this in practice is when
act on
with source-sink dynamics; i.e. the element
has two fixed points
so that every other point converges to
under positive powers of
, and to
under negative powers of
. Similarly,
has two fixed points
with similar dynamics. If the points
are disjoint, and
is compact, one can take any small open neighborhoods
of
, and then sufficiently large powers of
and
will satisfy the hypotheses of ping-pong.
Example: Every hyperbolic group acts on its Gromov boundary
. This boundary is the set of equivalence classes of quasigeodesic rays in (the Cayley graph of)
, where two rays are equivalent if they are a finite Hausdorff distance apart. Non-torsion elements act on the boundary with source-sink dynamics. Consequently, every pair of non-torsion elements in a hyperbolic group either generate a virtually cyclic group, or have powers that generate a nonabelian free group.
It is striking to see how easy it is to construct nonabelian free subgroups of a hyperbolic group, and how difficult to construct closed surface subgroups. We will return to the example of hyperbolic groups in a future post.
Example: The Tits alternative says that any linear group (i.e. any subgroup of
for some
) either contains a nonabelian free subgroup, or is virtually solvable (and therefore amenable). This can be derived from ping-pong, where
is made to act on certain spaces derived from the linear action (e.g. locally symmetric spaces compactified in certain ways, and buildings associated to discrete valuations on the ring of entries of matrix elements of
).
Example: There is a Tits alternative for subgroups of other kinds of groups, for example mapping class groups, as shown by Ivanov and McCarthy. The mapping class group (of a surface) acts on the Thurston boundary of Teichmuller space. Every subgroup of the mapping class group either contains a nonabelian free subgroup, or is virtually abelian. Roughly speaking, either elements move points in the boundary with enough dynamics to be able to do ping-pong, or else the action is “localized” in a train-track chart, and one obtains a linear representation of the group (enough to apply the ordinary Tits alternative). Virtually solvable subgroups of mapping class groups are virtually abelian.
Example: A similar Tits alternative holds for . This was shown by Bestvina-Feighn-Handel in these three papers (the third paper shows that solvable subgroups are virtually abelian, thus emphasizing the parallels with mapping class groups).
Example: If is a finitely generated group of homeomorphisms of
, then there is a kind of Tits alternative, first proposed by Ghys, and proved by Margulis: either
preserves a probability measure on
(which might be singular), or it contains a nonabelian free subgroup. To see this, first note that either
has a finite orbit (which supports an invariant probability measure) or the action is semi-conjugate to a minimal action (one with all orbits dense). In the second case, the proof depends on understanding the centralizer of the group action: either the centralizer is infinite, in which case the group is conjugate to a group of rotations, or it is finite cyclic, and one obtains an action of
on a “smaller” circle, by quotienting out by the centralizer. So one may assume the action is minimal with trivial centralizer. In this case, one shows that the action has the property that for any nonempty intervals
in
, there is some
with
; i.e. any interval may be put inside any other interval by some element of the group. For such an action, it is very easy to do ping-pong. Incidentally, a minor variation on this result, and with essentially this argument, was established by Thurston in the context of uniform foliations of
-manifolds before Ghys proposed his question.
Example: If is an (algebraic) family of representations of a (countable) free group
into an algebraic group, then either some element
is in the kernel of every
, or the set of faithful representations is “generic”, i.e. the intersection of countably many open dense sets. This is because the set of representations for which a given element is in the kernel is Zariski closed, and therefore its complement is open and either empty or dense (one must add suitable hypotheses or conditions to the above to make it rigorous).
As an experiment, I plan to spend the next five weeks documenting my current research on this blog. This research comprises several related projects, but most are concerned in one way or another with the general program of studying the geometry of a space by probing it with surfaces. Since I am nominally a topologist, these surfaces are real -manifolds, and I am usually interested in working in the homotopy category (or some rational “quotient” of it). I am especially concerned with surfaces with boundary, and even (occasionally) with corners.
Since it is good to have a “big question” lurking somewhere in the background (for the purposes of motivation and advertising, if nothing else), I should admit from the start that I am interested in Gromov’s well-known question about surface subgroups, which asks:
Question (Gromov): Does every one-ended word-hyperbolic group contain a closed hyperbolic surface subgroup?
I don’t have strong feelings about whether the answer to this question is “yes” or “no”, but I do think the question can be sharpened usefully in many ways, and it is my intention to do so. Gromov’s question is certainly inspired by questions such as Waldhausen’s conjecture and the virtual fibration conjecture in -manifold topology, but it is hard to imagine that a proof of one of these conjectures would shed much light on Gromov’s question in general. At least one essential tool in
-manifold topology — namely Dehn’s lemma — has no meaningful analogue in geometric group theory, and I think it is important to try to imagine different methods of constructing surface groups from “first principles”.
Another long-term project that informs much of my current research is the problem of understanding stable commutator length in free groups. The interested reader can learn something about this from my monograph (which can be downloaded from this page). I hope to explain why this is a fundamental and interesting problem, with rich structure and many potential applications.

Recent Comments