You are currently browsing the tag archive for the ‘hyperbolic groups’ tag.
A few weeks ago, Ian Agol, Vlad Markovic, Ursula Hamenstadt and I organized a “hot topics” workshop at MSRI with the title Surface subgroups and cube complexes. The conference was pretty well attended, and (I believe) was a big success; the organizers clearly deserve a great deal of credit. The talks were excellent, and touched on a wide range of subjects, and to those of us who are mid-career or older it was a bit shocking to see how quickly the landscape of low-dimensional geometry/topology and geometric group theory has been transformed by the recent breakthrough work of (Kahn-Markovic-Haglund-Wise-Groves-Manning-etc.-) Agol. Incidentally, when I first started as a graduate student, I had a vague sense that I had somehow “missed the boat” — all the exciting developments in geometry due to Thurston, Sullivan, Gromov, Freedman, Donaldson, Eliashberg etc. had taken place 10-20 years earlier, and the subject now seemed to be a matter of fleshing out the consequences of these big breakthroughs. 20 years and several revolutions later, I no longer feel this way. (Another slightly shocking aspect of the workshop was for me to realize that I am older or about as old as 75% of the speakers . . .)
The rationale for the workshop (which I had some hand in drafting, and therefore feel comfortable quoting here) was the following:
Recently there has been substantial progress in our understanding of the related questions of which hyperbolic groups are cubulated on the one hand, and which contain a surface subgroup on the other. The most spectacular combination of these two ideas has been in 3-manifold topology, which has seen the resolution of many long-standing conjectures. In turn, the resolution of these conjectures has led to a new point of view in geometric group theory, and the introduction of powerful new tools and structures. The goal of this conference will be to explore the further potential of these new tools and perspectives, and to encourage communication between researchers working in various related fields.
I have blogged a bit about cubulated groups and surface subgroups previously, and I even began this blog (almost 4 years ago now) initially with the idea of chronicling my efforts to attack Gromov’s surface subgroup question. This question asks the following:
Gromov’s Surface Subgroup Question: Does every one-ended hyperbolic group contain a subgroup which is isomorphic to the fundamental group of a closed surface of genus at least 2?
The restriction to one-ended groups is just meant to rule out silly examples, like finite or virtually cyclic groups (i.e. “elementary” hyperbolic groups), or free products of simpler hyperbolic groups. Asking for the genus of the closed surface to be at least 2 rules out the sphere (whose fundamental group is trivial) and the torus (whose fundamental group cannot be a subgroup of a hyperbolic group). It is the purpose of this blog post to say that Alden Walker and I have managed to show that Gromov’s question has a positive answer for “most” hyperbolic groups; more precisely, we show that a random group (in the sense of Gromov) contains a surface subgroup (in fact, many surface subgroups) with probability going to 1 as a certain natural parameter (the “length” of the random relators) goes to infinity. (update April 8: the preprint is available from the arXiv here.)
Today Jason Manning gave a talk on a vital ingredient in the proof of Agol’s theorem, which is a result in geometric group theory. The theorem is a joint project of Agol-Groves-Manning, and generalizes some earlier work they did a few years ago. Jason referred to the main theorem during his talk as the “Goal Theorem” (I guess it was the goal of his lecture), but I’m going to call it the Weak Separation Theorem, since that is a somewhat more descriptive name. The statement of the theorem is as follows.
Weak Separation Theorem (Agol-Groves-Manning): Let G be a hyperbolic group, let H be a subgroup of G which is quasiconvex, and isomorphic to the fundamental group of a virtually special NPC cube complex, and let g be an element of G which is not contained in H. Then there is a surjection so that
- is hyperbolic;
- is finite; and
- is not contained in .
In the remainder of this post I will try to explain the proof of this theorem, to the extent that I understand it. Basically, this amounts to my summarizing Manning’s talk (or the part of it that I managed to get down in my notes); again, any errors, foolishness, silly blog post titles etc. are due to me.
Last Friday, Henry Wilton gave a talk at Caltech about his recent joint work with Sang-hyun Kim on polygonal words in free groups. Their work is motivated by the following well-known question of Gromov:
Question(Gromov): Let be a one-ended word-hyperbolic group. Does contain a subgroup isomorphic to the fundamental group of a closed hyperbolic surface?
Let me briefly say what “one-ended” and “word-hyperbolic” mean.
A group is said to be word-hyperbolic if it acts properly and cocompactly by isometries on a proper -hyperbolic path metric space — i.e. a path metric space in which there is a constant so that geodesic triangles in the metric space have the property that each side of the triangle is contained in the -neighborhood of the union of the other two sides (colloquially, triangles are thin). This condition distills the essence of negative curvature in the large, and was shown by Gromov to be equivalent to several other conditions (eg. that the group satisfies a linear isoperimetric inequality; that every ultralimit of the group is an -tree). Free groups are hyperbolic; fundamental groups of closed manifolds with negative sectional curvature (eg surfaces with negative Euler characteristic) are word-hyperbolic; “random” groups are hyperbolic — and so on. In fact, it is an open question whether a group that admits a finite is word hyperbolic if and only if it does not contain a copy of a Baumslag-Solitar group for (note that the group is the special case ); in any case, this is a very good heuristic for identifying the word-hyperbolic groups one typically meets in examples.
If is a finitely generated group, the ends of really means the ends (as defined by Freudenthal) of the Cayley graph of with respect to some finite generating set. Given a proper topological space , the set of compact subsets of gives rise to an inverse system of inclusions, where includes into whenever is a subset of . This inverse system defines an inverse system of maps of discrete spaces , and the inverse limit of this system is a compact, totally disconnected space , called the space of ends of . A proper topological space is canonically compactified by its set of ends; in fact, the compactification is the “biggest” compactification of by a totally disconnected space, in the sense that for any other compactification where is zero dimensional, there is a continuous map which is the identity on .
For a word-hyperbolic group , the Cayley graph can be compactified by adding the ideal boundary , but this is typically not totally disconnected. In this case, the ends of can be recovered as the components of .
A group acts on its own ends . An elementary argument shows that the cardinality of is one of (if a compact set disconnects then infinitely many translates of converging to separate from infinitely many other ends accumulating on ). A group has no ends if and only if it is finite. Stallings famously showed that a (finitely generated) group has at least ends if and only if it admits a nontrivial description as an HNN extension or amalgamated free product over a finite group. One version of the argument proceeds more or less as follows, at least when is finitely presented. Let be an -dimensional Riemannian manifold with fundamental group , and let denote the universal cover. We can identify the ends of with the ends of . Let be a least (-dimensional) area hypersurface in amongst all hypersurfaces that separate some end from some other (here the hypothesis that has at least two ends is used). Then every translate of by an element of is either equal to or disjoint from it, or else one could use the Meeks-Yau “roundoff trick” to find a new with strictly lower area than . The translates of decompose into pieces, and one can build a tree whose vertices correspond to to components of , and whose edges correspond to the translates . The group acts on this tree, with finite edge stabilizers (by the compactness of ), exhibiting either as an HNN extension or an amalgamated product over the edge stabilizers. Note that the special case occurs if and only if has a finite index subgroup which is isomorphic to .
Free groups and virtually free groups do not contain closed surface subgroups; Gromov’s question more or less asks whether these are the only examples of word-hyperbolic groups with this property.
Kim and Wilton study Gromov’s question in a very, very concrete case, namely that case that is the double of a free group along a word ; i.e. (hereafter denoted ). Such groups are known to be one-ended if and only if is not contained in a proper free factor of (it is clear that this condition is necessary), and to be hyperbolic if and only if is not a proper power, by a result of Bestvina-Feighn. To see that this condition is necessary, observe that the double is isomorphic to the fundamental group of a Seifert fiber space, with base space a disk with two orbifold points of order ; such a group contains a . One might think that such groups are too simple to give an insight into Gromov’s question. However, these groups (or perhaps the slightly larger class of graphs of free groups with cyclic edge groups) are a critical case for at least two reasons:
- The “smaller” a group is, the less room there is inside it for a surface group; thus the “simplest” groups should have the best chance of being a counterexample to Gromov’s question.
- If is word-hyperbolic and one-ended, one can try to find a surface subgroup by first looking for a graph of free groups in , and then looking for a surface group in . Since a closed surface group is itself a graph of free groups, one cannot “miss” any surface groups this way.
Not too long ago, I found an interesting construction of surface groups in certain graphs of free groups with cyclic edge groups. In fact, I showed that every nontrivial element of in such a group is virtually represented by a sum of surface subgroups. Such surface subgroups are obtained by finding maps of surface groups into which minimize the Gromov norm in their (projective) homology class. I think it is useful to extend Gromov’s question by making the following
Conjecture: Let be a word-hyperbolic group, and let be nonzero. Then some multiple of is represented by a norm-minimizing surface (which is necessarily -injective).
Note that this conjecture does not generalize to wider classes of groups. There are even examples of groups with nonzero homology classes with positive, rational Gromov norm, for which there are no -injective surfaces representing a multiple of at all.
It is time to define polygonal words in free groups.
Definition: Let be free. Let be a wedge of circles whose edges are free generators for . A cyclically reduced word in these generators is polygonal if there exists a van-Kampen graph on a surface such that:
- every complementary region is a disk whose boundary is a nontrivial (possibly negative) power of ;
- the (labelled) graph immerses in in a label preserving way;
- the Euler characteristic of is strictly less than the number of disks.
The last condition rules out trivial examples; for example, the double of a single disk whose boundary is labeled by . Notice that it is very important to allow both positive and negative powers of as boundaries of complementary regions. In fact, if is not in the commutator subgroup, then the sum of the powers over all complementary regions is necessarily zero (and if is in the commutator subgroup, then has nontrivial , so one already knows that there is a surface subgroup).
Condition 2. means that at each vertex of , there is at most one oriented label corresponding to each generator of or its inverse. This is really the crucial geometric property. If is a van-Kampen graph as above, then a theorem of Marshall Hall implies that there is a finite cover of into which embeds (in fact, this observation underlies Stallings’s work on foldings of graphs). If we build a -complex with by attaching two ends of a cylinder to suitable loops in two copies of , then a tubular neighborhood of in (i.e. what is sometimes called a “fatgraph” ) embeds in a finite cover of , and its double — a surface of strictly negative Euler characteristic — embeds as a closed surface in , and is therefore -injective. Hence if is polygonal, contains a surface subgroup.
Not every word is polygonal. Kim-Wilton discuss some interesting examples in their paper, including:
- suppose is a cyclically reduced product of proper powers of the generators or their inverses (e.g a word like but not a word like ); then is polygonal;
- a word of the form is polygonal if for each ;
- the word is not polygonal.
To see 3, suppose there were a van-Kampen diagram with more disks than Euler characteristic. Then there must be some vertex of valence at least . Since is positive, the complementary regions must have boundaries which alternate between positive and negative powers of , so the degree of the vertex must be even. On the other hand, since must immerse in a wedge of two circles, the degree of every vertex must be at most , so there is consequently some vertex of degree exactly . Since each is isolated, at least edges must be labelled ; hence exactly two. Hence exactly two edges are labelled . But one of these must be incoming and one outgoing, and therefore these are adjacent, contrary to the fact that does not contain a .
1 above is quite striking to me. When is in the commutator subgroup, one can consider van-Kampen diagrams as above without the injectivity property, but with the property that every power of on the boundary of a disk is positive; call such a van-Kampen diagram monotone. It turns out that monotone van-Kampen diagrams always exist when , and in fact that norm-minimizing surfaces representing powers of the generator of are associated to certain monotone diagrams. The construction of such surfaces is an important step in the argument that stable commutator length (a kind of relative Gromov norm) is rational in free groups. In my paper scl, sails and surgery I showed that monomorphisms of free groups that send every generator to a power of that generator induce isometries of the norm; in other words, there is a natural correspondence between certain equivalence classes of monotone surfaces for an arbitrary word in and for a word of the kind that Kim-Wilton show is polygonal (Note: Henry Wilton tells me that Brady, Forester and Martinez-Pedroza have independently shown that contains a surface group for such , but I have not seen their preprint (though I would be very grateful to get a copy!)).
In any case, if not every word is polygonal, all is not lost. To show that contains a surface subgroup is suffices to show that contains a surface subgroup, where and differ by an automorphism of . Kim-Wilton conjecture that one can always find an automorphism so that is polygonal. In fact, they make the following:
Conjecture (Kim-Wilton; tiling conjecture): A word not contained in a proper free factor of shortest length (in a given generating set) in its orbit under is polygonal.
If true, this would give a positive answer to Gromov’s question for groups of the form .
The purpose of this post is to discuss my recent paper with Koji Fujiwara, which will shortly appear in Ergodic Theory and Dynamical Systems, both for its own sake, and in order to motivate some open questions that I find very intriguing. The content of the paper is a mixture of ergodic theory, geometric group theory, and computer science, and was partly inspired by a paper of Jean-Claude Picaud. To state the results of the paper, I must first introduce a few definitions and some background.
Let be a finite directed graph (hereafter a digraph) with an initial vertex, and edges labeled by elements of a finite set in such a way that each vertex has at most one outgoing edge with any given label. A finite directed path in starting at the initial vertex determines a word in the alphabet , by reading the labels on the edges traversed (in order). The set of words obtained in this way is an example of what is called a regular language, and is said to be parameterized by . Note that this is not the most general kind of regular language; in particular, any language of this kind will necessarily be prefix-closed (i.e. if then every prefix of is also in ). Note also that different digraphs might parameterize the same (prefix-closed) regular language .
If is a set of generators for a group , there is an obvious map called the evaluation map that takes a word to the element of represented by that word.
Definition: Let be a group, and a finite generating set. A combing of is a (prefix-closed) regular language for which the evaluation map is a bijection, and such that every represents a geodesic in .
The intuition behind this definition is that the set of words in determines a directed spanning tree in the Cayley graph starting at , and such that every directed path in the tree is a geodesic in . Note that there are other definitions of combing in the literature; for example, some authors do not require the evaluation map to be a bijection, but only a coarse bijection.
Fundamental to the theory of combings is the following Theorem, which paraphrases one of the main results of this paper:
Theorem: (Cannon) Let be a hyperbolic group, and let be a finite generating set. Choose a total order on the elements of . Then the language of lexicographically first geodesics in is a combing.
The language described in this theorem is obviously geodesic and prefix-closed, and the evaluation map is bijective; the content of the theorem is that is regular, and parameterized by some finite digraph . In the sequel, we restrict attention exclusively to hyperbolic groups .
Given a (hyperbolic) group , a generating set , a combing , one makes the following definition:
Definition: A function is weakly combable (with respect to ) if there is a digraph parameterizing and a function from the vertices of to so that for any , corresponding to a path in , there is an equality .
In other words, a function is weakly combable if it can be obtained by “integrating” a function along the paths of a combing. One furthermore says that a function is combable if it changes by a bounded amount under right-multiplication by an element of , and bicombable if it changes by a bounded amount under either left or right multiplication by an element of . The property of being (bi-)combable does not depend on the choice of a generating set or a combing .
Example: Word length (with respect to a given generating set ) is bicombable.
Example: Let be a homomorphism. Then is bicombable.
Example: The Brooks counting quasimorphisms (on a free group) and the Epstein-Fujiwara counting quasimorphisms are bicombable.
Example: The sum or difference of two (bi-)combable functions is (bi-)combable.
A particularly interesting example is the following:
Example: Let be a finite set which generates as a semigroup. Let denote word length with respect to , and denote word length with respect to (which also generates as a semigroup). Then the difference is a bicombable quasimorphism.
The main theorem proved in the paper concerns the statistical distribution of values of a bicombable function.
Theorem: Let be a hyperbolic group, and let be a bicombable function on . Let be the value of on a random word in of length (with respect to a certain measure depending on a choice of generating set). Then there are algebraic numbers and so that as distributions, converges to a normal distribution with standard deviation .
One interesting corollary concerns the length of typical words in one generating set versus another. The first thing that every geometric group theorist learns is that if are two finite generating sets for a group , then there is a constant so that every word of length in one generating set has length at most and at least in the other generating set. If one considers an example like , one sees that this is the best possible estimate, even statistically. However, if one restricts attention to a hyperbolic group , then one can do much better for typical words:
Corollary: Let be hyperbolic, and let be two finite generating sets. There is an algebraic number so that almost all words of length with respect to the generating set have length almost equal to with respect to the generating set, with error of size .
Let me indicate very briefly how the proof of the theorem goes.
Sketch of Proof: Let be bicombable, and let be a function from the vertices of to , where is a digraph parameterizing . There is a bijection between the set of elements in of word length and the set of directed paths in of length that start at the initial vertex. So to understand the distribution of , we need to understand the behaviour of a typical long path in .
Define a component of to be a maximal subgraph with the property that there is a directed path (in the component) from any vertex to any other vertex. One can define a new digraph without loops, with one vertex for each component of , in an obvious way. Each component determines an adjacency matrix , with -entry equal to if there is a directed edge from vertex to vertex , and equal to otherwise. A component is big if the biggest real eigenvalue of is at least as big as the biggest real eigenvalue of the matrices associated to every other component. A random long walk in will spend most of its time entirely in big components, so these are the only components we need to consider to understand the statistical distribution of .
A theorem of Coornaert implies that there are no big components of in series; i.e. there are no directed paths in from one big component to another (one also says that the big components do not communicate). This means that a typical long walk in is entirely contained in a single big component, except for a (relatively short) path at the start and the end of the walk. So the distribution of gets independent contributions, one from each big component.
The contribution from an individual big component is not hard to understand: the central limit theorem for stationary Markov chains says that for elements of corresponding to paths that spend almost all their time in a given big component there is a central limit theorem where the mean and standard deviation depend only on . The problem is to show that the means and standard deviations associated to different big components are the same. Everything up to this point only depends on weak combability of ; to finish the proof one must use bicombability.
It is not hard to show that if is a typical infinite walk in a component , then the subpaths of of length are distributed like random walks of length in . What this means is that the mean and standard deviation associated to a big component can be recovered from the distribution of on a single infinite “typical” path in . Such an infinite path corresponds to an infinite geodesic in , converging to a definite point in the Gromov boundary . Another theorem of Coornaert (from the same paper) says that the action of on its boundary is ergodic with respect to a certain natural measure called a Patterson-Sullivan measure (see Coornaert’s paper for details). This means that there are typical infinite geodesics associated to components and for which some takes to a geodesic ending at the same point in as . Bicombability implies that the values of on and differ by a bounded amount. Moreover, since and are asymptotic to the same point at infinity, combability implies that the values of on and also differ by a bounded amount. This is enough to deduce that and , and one obtains a (global) central limit theorem for on . qed.
This obviously raises several questions, some of which seem very hard, including:
Question 1: Let be an arbitrary quasimorphism on a hyperbolic group (even the case is free is interesting). Does satisfy a central limit theorem?
Question 2: Let be an arbitrary quasimorphism on a hyperbolic group . Does satisfy a central limit theorem with respect to a random walk on ? (i.e. one considers the distribution of values of not on the set of elements of of word length , but on the set of elements obtained by a random walk on of length , and lets go to infinity)
All bicombable quasimorphisms satisfy an important property which is essential to our proof of the central limit theorem: they are local, which is to say, they are defined as a sum of local contributions. In the continuous world, they are the analogue of the so-called de Rham quasimorphisms on where is a closed negatively curved Riemannian manifold; such quasimorphisms are defined by choosing a -form , and defining to be equal to the integral , where is the closed oriented based geodesic in in the homotopy class of . De Rham quasimorphisms, being local, also satisfy a central limit theorem.
This locality manifests itself in another way, in terms of defects. Let be a quasimorphism on a hyperbolic group . Recall that the defect is the supremum of over all pairs of elements . A quasimorphism is further said to be homogeneous if for all integers . If is an arbitrary quasimorphism, one may homogenize it by taking a limit ; one says that is the homogenization of in this case. Homogenization typically does not preserve defects; however, there is an inequality . If is local, one expects this inequality to be an equality. For, in a hyperbolic group, the contribution to the defect of a local quasimorphism all arises from the interaction of the suffix of (a geodesic word representing the element) with the prefix of (with notation as above). When one homogenizes, one picks up another contribution to the defect from the interaction of the prefix of with the suffix of ; since these two contributions are essentially independent, one expects that homogenizing a local quasimorphism should exactly double the defect. This is the case for bicombable and de Rham quasimorphisms, and can perhaps be used to define locality for a quasimorphism on an arbitrary group.
This discussion provokes the following key question:
Question 3: Let be a group, and let be a homogeneous quasimorphism. Is there a quasimorphism with homogenization , satisfying ?
Example: The answer to question 3 is “yes” if is the rotation quasimorphism associated to an action of on by orientation-preserving homeomorphisms (this is nontrivial; see Proposition 4.70 from my monograph).
Example: Let be any homologically trivial group -boundary. Then there is some extremal homogeneous quasimorphism for (i.e. a quasimorphism achieving equality under generalized Bavard duality; see this post) for which there is with homogenization satisfying . Consequently, if every point in the boundary of the unit ball in the norm is contained in a unique supporting hyperplane, the answer to question 3 is “yes” for any quasimorphism on .
Any quasimorphism on can be pulled back to a quasimorphism on a free group, but this does not seem to make anything easier. In particular, question 3 is completely open (as far as I know) when is a free group. An interesting test case might be the homogenization of an infinite sum of Brooks functions for some infinite non-nested family of words .
If the answer to this question is false, and one can find a homogeneous quasimorphism which is not the homogenization of any “local” quasimorphism, then perhaps does not satisfy a central limit theorem. One can try to approach this problem from the other direction:
Question 4: Given a function defined on the ball of radius in a free group , one defines the defect in the usual way, restricted to pairs of elements for which are all of length at most . Under what conditions can be extended to a function on the ball of radius without increasing the defect?
If one had a good procedure for building a quasimorphism “by hand” (so to speak), one could try to build a quasimorphism that failed to satisfy a central limit theorem, or perhaps find reasons why this was impossible.
More ambitious than simply showing that a group is infinite is to show that it contains an infinite subgroup of a certain kind. One of the most important kinds of subgroup to study are free groups. Hence, one is interested in the question:
Question: When does a group contain a (nonabelian) free subgroup?
Again, one can (and does) ask this question both about a specific group, and about certain classes of groups, or for a typical (in some sense) group from some given family.
Example: If is a property of groups that is inherited by subgroups, then if no free group satisfies , no group that satisfies can contain a free subgroup. An important property of this kind is amenability. A (discrete) group is amenable if it admits an invariant mean; that is, if there is a linear map (i.e. a way to define the average of a bounded function over ) satisfying three basic properties:
- if (i.e. the average of a non-negative function is non-negative)
- where is the constant function taking the value everywhere on (i.e. the average of the constant function is normalized to be )
- for every and , where (i.e. the mean is invariant under the obvious action of on )
If is a subgroup of , there are (many) -invariant homomorphisms taking non-negative functions to non-negative functions, and to ; for example, the (left) action of on breaks up into a collection of copies of acting on itself, right-multiplied by a collection of right coset representatives. After choosing such a choice of representatives , one for each coset , we can define . Composing with shows that every subgroup of an amenable group is amenable (this is harder to see in the “geometric” definition of amenable groups in terms of Folner sets). On the other hand, as is well-known, a nonabelian free group is not amenable. Hence, amenable groups do not contain nonabelian free subgroups.
The usual way to see that a nonabelian free group is not amenable is to observe that it contains enough disjoint “copies” of big subsets. For concreteness, let denote the free group on two generators , and write their inverses as . Let denote the set of reduced words that start with either or , and let denote the indicator functions of respectively. We suppose that is amenable, and derive a contradiction. Note that , so . Let denote the set of reduced words that start with one of the strings , and let denote the indicator function of . Notice that is made of two disjoint copies of each of . So on the one hand, , but on the other hand, .
Conversely, the usual way to show that a group is amenable is to use the Folner condition. Suppose that is finitely generated by some subset , and let denote the Cayley graph of (so that is a homogeneous locally finite graph). Suppose one can find finite subsets of vertices so that (here means the number of vertices in , and means the number of vertices in that share an edge with ). Since the “boundary” of is small compared to , averaging a bounded function over is an “almost invariant” mean; a weak limit (in the dual space to ) is an invariant mean. Examples of amenable groups include
- Finite groups
- Abelian groups
- Unions and extensions of amenable groups
- Groups of subexponential growth
and many others. For instance, virtually solvable groups (i.e. groups containing a solvable subgroup with finite index) are amenable.
Example: No amenable group can contain a nonabelian free subgroup. The von Neumann conjecture asked whether the converse was true. This conjecture was disproved by Olshanskii. Subsequently, Adyan showed that the infinite free Burnside groups are not amenable. These are groups with generators, and subject only to the relations that the th power of every element is trivial. When is odd and at least , these groups are infinite and nonamenable. Since they are torsion groups, they do not even contain a copy of , let alone a nonabelian free group!
Example: The Burnside groups are examples of groups that obey a law; i.e. there is a word in finitely many free variables, such that for every choice of . For example, an abelian group satisfies the law . Evidently, a group that obeys a law does not contain a nonabelian free subgroup. However, there are examples of groups which do not obey a law, but which also do not contain any nonabelian free subgroup. An example is the classical Thompson’s group , which is the group of orientation-preserving piecewise-linear homeomorphisms of with finitely many breakpoints at dyadic rationals (i.e. points of the form for integers ) and with slopes integral powers of . To see that this group does not obey a law, one can show (quite easily) that in fact is dense (in the topology) in the group of all orientation-preserving homeomorphisms of the interval. This latter group contains nonabelian free groups; by approximating the generators of such a group arbitrarily closely, one obtains pairs of elements in that do not satisfy any identity of length shorter than any given constant. On the other hand, a famous theorem of Brin-Squier says that does not contain any nonabelian free subgroup. In fact, the entire group does not contain any nonabelian free subgroup. A short proof of this fact can be found in my paper as a corollary of the fact that every subgroup of has vanishing stable commutator length; since stable commutator length is nonvanishing in nonabelian free groups, this shows that there are no such subgroups of . (Incidentally, and complementarily, there is a very short proof that stable commutator length vanishes on any group that obeys a law; we will give this proof in a subsequent post).
Example: If surjects onto , and contains a free subgroup , then there is a section from to (by freeness), and therefore contains a free subgroup.
Example: The most useful way to show that contains a nonabelian free subgroup is to find a suitable action of on some space . The following is known as Klein’s ping-pong lemma. Suppose one can find disjoint subsets and of , and elements so that , , and similarly interchanging the roles of and . If is a reduced word in , one can follow the trajectory of a point under the orbit of subwords of to verify that is nontrivial. The most common way to apply this in practice is when act on with source-sink dynamics; i.e. the element has two fixed points so that every other point converges to under positive powers of , and to under negative powers of . Similarly, has two fixed points with similar dynamics. If the points are disjoint, and is compact, one can take any small open neighborhoods of , and then sufficiently large powers of and will satisfy the hypotheses of ping-pong.
Example: Every hyperbolic group acts on its Gromov boundary . This boundary is the set of equivalence classes of quasigeodesic rays in (the Cayley graph of) , where two rays are equivalent if they are a finite Hausdorff distance apart. Non-torsion elements act on the boundary with source-sink dynamics. Consequently, every pair of non-torsion elements in a hyperbolic group either generate a virtually cyclic group, or have powers that generate a nonabelian free group.
It is striking to see how easy it is to construct nonabelian free subgroups of a hyperbolic group, and how difficult to construct closed surface subgroups. We will return to the example of hyperbolic groups in a future post.
Example: The Tits alternative says that any linear group (i.e. any subgroup of for some ) either contains a nonabelian free subgroup, or is virtually solvable (and therefore amenable). This can be derived from ping-pong, where is made to act on certain spaces derived from the linear action (e.g. locally symmetric spaces compactified in certain ways, and buildings associated to discrete valuations on the ring of entries of matrix elements of ).
Example: There is a Tits alternative for subgroups of other kinds of groups, for example mapping class groups, as shown by Ivanov and McCarthy. The mapping class group (of a surface) acts on the Thurston boundary of Teichmuller space. Every subgroup of the mapping class group either contains a nonabelian free subgroup, or is virtually abelian. Roughly speaking, either elements move points in the boundary with enough dynamics to be able to do ping-pong, or else the action is “localized” in a train-track chart, and one obtains a linear representation of the group (enough to apply the ordinary Tits alternative). Virtually solvable subgroups of mapping class groups are virtually abelian.
Example: A similar Tits alternative holds for . This was shown by Bestvina-Feighn-Handel in these three papers (the third paper shows that solvable subgroups are virtually abelian, thus emphasizing the parallels with mapping class groups).
Example: If is a finitely generated group of homeomorphisms of , then there is a kind of Tits alternative, first proposed by Ghys, and proved by Margulis: either preserves a probability measure on (which might be singular), or it contains a nonabelian free subgroup. To see this, first note that either has a finite orbit (which supports an invariant probability measure) or the action is semi-conjugate to a minimal action (one with all orbits dense). In the second case, the proof depends on understanding the centralizer of the group action: either the centralizer is infinite, in which case the group is conjugate to a group of rotations, or it is finite cyclic, and one obtains an action of on a “smaller” circle, by quotienting out by the centralizer. So one may assume the action is minimal with trivial centralizer. In this case, one shows that the action has the property that for any nonempty intervals in , there is some with ; i.e. any interval may be put inside any other interval by some element of the group. For such an action, it is very easy to do ping-pong. Incidentally, a minor variation on this result, and with essentially this argument, was established by Thurston in the context of uniform foliations of -manifolds before Ghys proposed his question.
Example: If is an (algebraic) family of representations of a (countable) free group into an algebraic group, then either some element is in the kernel of every , or the set of faithful representations is “generic”, i.e. the intersection of countably many open dense sets. This is because the set of representations for which a given element is in the kernel is Zariski closed, and therefore its complement is open and either empty or dense (one must add suitable hypotheses or conditions to the above to make it rigorous).
As an experiment, I plan to spend the next five weeks documenting my current research on this blog. This research comprises several related projects, but most are concerned in one way or another with the general program of studying the geometry of a space by probing it with surfaces. Since I am nominally a topologist, these surfaces are real -manifolds, and I am usually interested in working in the homotopy category (or some rational “quotient” of it). I am especially concerned with surfaces with boundary, and even (occasionally) with corners.
Since it is good to have a “big question” lurking somewhere in the background (for the purposes of motivation and advertising, if nothing else), I should admit from the start that I am interested in Gromov’s well-known question about surface subgroups, which asks:
Question (Gromov): Does every one-ended word-hyperbolic group contain a closed hyperbolic surface subgroup?
I don’t have strong feelings about whether the answer to this question is “yes” or “no”, but I do think the question can be sharpened usefully in many ways, and it is my intention to do so. Gromov’s question is certainly inspired by questions such as Waldhausen’s conjecture and the virtual fibration conjecture in -manifold topology, but it is hard to imagine that a proof of one of these conjectures would shed much light on Gromov’s question in general. At least one essential tool in -manifold topology — namely Dehn’s lemma — has no meaningful analogue in geometric group theory, and I think it is important to try to imagine different methods of constructing surface groups from “first principles”.
Another long-term project that informs much of my current research is the problem of understanding stable commutator length in free groups. The interested reader can learn something about this from my monograph (which can be downloaded from this page). I hope to explain why this is a fundamental and interesting problem, with rich structure and many potential applications.