First Week of Knot Theory in MasterMath
Course on Quantum Groups and Knot Theory
This lecture
(September 12, 2007)
provides a first introduction to knot theory.
The following notions are treated:
At the end you will find Exercises.
A knot is a closed curve
embedded in real Euclidean space E. At least,
this is a first approximation of what we mean.
In fact, we want the knot to be tame, not wild.
The adjective
closed used above indicates that the curve has no end
point and no starting point. But it could also be interpreted as the knot
being a closed subset of E with respect to the
usual topology on E. Thus, a knot is the image
of a continuous embedding of the circle that is closed. The picture below
shows that this is still not enough to guarantee that the knot is tame.
It would suffice to require that the embedding is smooth. A better way
of avoiding degeneracy (for our purposes) uses tubular neighborhoods: demand
that the knot has a tubular neighborhood that is not self-intersecting; this
means that there is a positive constant e such that the intersection
of the knot with the ball of radius e around each point of the knot
is a single segment of the knot. A knot with this property is called
tame (as opposed to wild). We will restrict
attention to tame knots, and so, when referring to a knot we implicitly assume
it is tame.
By the way, a segment
of the knot is often referred to as a strand.
If we move the curve in a smooth manner, without ever self-intersecting
it along the way, we do not want to think of the result as a different curve
from the original one. Such a smooth deformation, during which the curve is
the closed image of a continuous embedding of the circle at all time, is
called an isotopy. Two knots are said to be isotopic if
one can be obtained from the other by such a smooth deformation. Isotopy is an
equivalence relation.
In every isotopy class of a knot, there is a representative consisting
of a finite number of line segments. Such a knot is called piecewise linear.
Moreover, given two piecewise linear knots that are isotopic, one can deform
one to the other by means of a homotopy within a simplicial complex inside
E in which both knots occur as 1-dimensional
subcomplexes. Therefore, another approach to handle the nondegeneracy
condition for knots is to consider piecewise linear knots and simplicial
complexes only.
The embedding of a circle in E as the
boundary of a disk is called the unknot or the
trivial knot. In many senses, it is the easiest of all
knots.
The question whether the unkot is a knot is similar to issue whether
zero should be counted as a natural number.
A link
is the union of a finite number of disjoint knots in
E. The comments on
knots all apply for
links after circle has been replaced by disjoint union of a finite
number of circles.
In particular, two links are considered equal if they are isotopic.
Many statements about knots also apply to links. This may be a good
reason to extend the study to links. However, the most important rationale
for including links is that certain operations that we will consider leave
invariant the class of links, but not the class of knots.
The knots making up the link are called the components
of the link.
Example 1. The Borromean rings
The Borromean rings are depicted below. This link has three components.
None of the three rings can be separated from the other two.
But take away one, and the other two fall apart.
The embedding of a set of n
circles in E as
the boundaries of a set of disjoint disks is called the
trivial link on n components.
The fundamental problem of knot theory is to determine, when given two
knots, whether or not they are isotopic. It is the quest for an algorithm
that, on input two knots, decides the issue whether one can be smoothly
transformed into the other. Of course, the same question can be asked for
links. A theoretic description of an algorithm
determining if two knots are isotopic has been given by
Hemion.
A more modest problem of knot theory is to determine, when given
a knot, whether or not it is isotopic to the unknot.
Haken has given
a description of an algorithm for this purpose.
Both
algorithm descriptions
are too involved to program even for simple cases;
see
Hass.
In this course we mainly discuss the approach through invariants: functions defined on the set of
knots, preferably easy to compute, for the purpose of distinguishing
nonisotopic links from each other. The big issue then is to find a complete
set of invariants, that is, enough in order to tell any two non-isotopic knots
apart. In order to be able to compute with links at all, we work with link
diagrams.
A
link diagram is a planar graph whose nodes have valency
four, supplied with a little extra information. The diagram is interpreted as a
projection on the plane of a knot in 3-dimensional space. During the
projection crossings may occur: points in the plane lying on the projection of
two parts of the curves making up the link. The projection is always chosen so
that no more than two parts project onto arcs going through the same point of
the plane and the points lying in two projected parts are
isolated. Conversely, given a planar graph of valency four, the link can be
reconstructed by connecting opposite edges at each node and separating the two
segments, except that it is not clear which part passes over and which goes
under the crossing. This brings us to the little extra information alluded
to: it tells us exactly that. Usually, the crossings are drawn in the plane
in such a way that it is immediately clear which segment is going over and
which is going under at each node.
Example 2. The Whitehead link
Two link diagrams are said to be the same when they are equal up to
planar isotopy.
Here, a planar isotopy is a smooth deformation
of the plane
such that, at each stage of the deformation, the link diagram represent the
same planar graph as the original link diagram.
Each link diagram represents a unique link.
Each link can have many link diagrams representing it.
Of course, a knot diagram is just a link diagram
corresponding to a knot.
An
orientation of a link is a direction of travelling along each of its
components. If such an orientation is given, we speak of an
oriented link.
A link can be oriented in
{2}^{c} ways,
where
c
is the number of its components.
Example 3. The Whitehead link
The reflection (or
mirror image) of a link is its
image under reflection in a mirror. It is convenient, but not essential
to place the mirror in such a way that the whole link is at one side of
it. A link is called achiral
if it is isotopic to its reflection.
A link diagram of the reflection of a link with link diagram D
can be obtained by reversing the over and under passes at each crossing
of D.
The reverse of an oriented link is obtained by
reversing the orientation of each of its components. A link is called
invertible if it is isotopic to its reverse.
The signed Gauss code of a link is a particular encoding of a link diagram
for algorithmic use. This is how it works for an oriented link.
-
Label the nodes of the link.
-
Order the components of the link.
-
Pick the first (in the chosen ordering) component not yet dealt with.
Traverse it in the direction of the orientation,
starting at an arbitrary node.
At each node, record its label, whether the
travel is an over or an undercrossing, and record the orientation of the node.
-
Stop when all components are traversed.
The result is the so-called signed Gauss code.
The signed Gauss code is not unique. Not even after the link is
oriented, the nodes are labelled, and the components are ordered. There is
still the freedom of choosing the starting node for each component. By viewing
the sequences as cycles (as in permutation cycles), this ambiguity disappears.
Not every Gauss code lookalike leads to a link diagram. An obvious necessary
condition is that every label occurs exactly twice, but there are more.
Each oriented link diagram is determined uniquely up to planar isotopy
by its signed Gauss code.
Example 6. The Whitehead link
A Reidemeister move of type I, II, or III is a transformation of a link
diagram by means of substitution of a subconfiguration of the knot as
occurring at the top or bottom of a column in the above picture by the
other subconfiguration in the same column, where the ends of the
strands are matched to the link as they were before; the reflected versions of
these rules also count as Reidemeister rules of the same type.
A Reidemeister move of type O is a transformation of a link
diagram by means of substitution of a strand without crossings
at the outside of the diagram by a strand without crossings at the
other side of the diagram.
This transformation is symbolized in the two pictures below.
A Reidemeister 0 move can be easily seen as a sequence
of Reidemeister II
and III moves, porting the strand to the other side by moving it
under (or over, but stay with one choice) all other strands and crossings. It
can be interpreted by viewing the link diagram as embedded in the
2-dimensional sphere rather than the Euclidean plane, by just closing up the
plane with an additional point at infinity. The result of the Reidemeister 0
move corresponds to letting the point at infinity cross the strand
that is moved.
Peter Vos (TU Eindhoven) wrote a Java program to help understand the Reidemeister
moves. If you have Java installed, it can be (web)started by clicking
on the Knotweaver link.
It is easy to see that if a link diagram D is obtained from
the link diagram E by a Reidemeister move, then D and
E represent the same link. The surprise of the result below is that
the converse also holds.
Two link diagrams represent the same link up to isotopy if and only if
one can be obtained from the other by a sequence of planar isotopies and
Reidemeister moves.
The nontrivial implication is a rather straightforward but tedious proof.
We omit it here.
Seifert circles form a step towards the construction of a surface whose
boundary is a given link.
Start from an oriented link diagram D.
At each crossing follow each arc coming into the crossing and
join it to the adjacent arc leaving the crossing.
The resulting strands are disjoint circles,
called Seifert circles.
Example 7. The Whitehead link
A Seifert surface of a link L is an
orientable surface whose boundary is L.
Here, a surface is the closure of a 2-dimensional subset of E such that the intersection of a small enough sphere around
each of its points with the surface is a disk. It is called orientable if the
orientation of a disk around an inner point never changes after following a
continuous closed path containing it.
Each link has a Seifert surface.
Let L be a link. Draw a link diagram of L.
Construct the Seifert circles of
L. Fill each Seifert circle by a disk whose boundary is the circle.
If, in the plane, one Seifert circle is circumscribed by the other, we think
of the corresponding disks as stacked on top of each other in Euclidean space.
Now reinstate the crossings as strands in space and fill up the part between
the strands to a sheet glued to the Seifert circles connected by the strands.
The resulting surface is a Seifert surface for L.
The orientation of the Seifert surface is indicated by the orientation
of the Seifert circles.
QED
Example 8. The Whitehead link
Jack van
Wijk's pages provide a pleasant introduction
to Seifert surfaces and many ways to visualize them.
A link invariant is a map defined on the set of
links that is invariant under isotopy. The origin of this name lies in the
problem of defining such maps and computing the image of a link under the map.
Often, Reidemeister's theorem is used for this purpose as follows.
-
A map is defined on all link diagrams.
-
The
map is shown to be constant on the equivalence class generated by planar
isotopy and the Reidemeister moves I, II, and III.
Reidemeister's
theorem then allows us to conclude that the map is actually defined on
links. So, a link invariant is often given as a map on link diagrams.
The importance of such maps is that it
may help to tell two links apart. Some link invariants are defined on
oriented links instead of ordinary links. The number of components is a very
simple example of a link invariant. The number of tricolorings of a link
diagram is a relatively simple example of a link invariant.
Markov's theorem
can be also used to define link invariants, by use of
braids instead of link diagrams.
The crossing number of a link diagram is simply the number of
crossings appearing in that diagram. The
crossing number of a link L is
the minimum over all crossing numbers of link diagrams
representing L.
The number of components of a link is a link invariant. This invariant is not
helpful in telling knots apart.
By definition, the crossing number is a link invariant. But there is no
explicit method of generating, for a given link, all link diagrams with
crossing numbers less than a given number. So in terms of telling two knots
apart, the crossing number is of no great use. Still it is an intuitively
appealing quantity and is often used in classifications of
lists with a limited number of crossings.
The
genus of a surface is a well-known invariant. Together
with its orientability (a Boolean variable) and the number of boundary
components, it characterizes the surface up to homotopy.
A
Seifert surface is orientable and has just as many boundary components as it
has link components. So it is determined by its genus.
The genus of a surface with boundary is, by definition the genus of the
closed surface obtained by capping off each boundary component with a disk.
The genus
g\left(L\right)
of a Seifert surface of a link diagram L with
s Seifert circles, c crossings
and m components is given by
g\left(L\right)=\frac{2+c-s-m}{2}.
Given a triangulation of the surface
by f polygons, e edges, and
v vertices, its genus g
is given by
2-2·g\left(L\right)=v-e+f.
Now apply this to a triangulation based on
s Seifert circles and m caps for
polygons, the
2·c strands between crossings for edges, and
the c crossings for vertices.
QED
The genus of a link is the minimal of all genera
of Seifert surfaces (of diagrams for) the link.
Using an orientation of a link diagram we can orient its crossings as
follows. If, along the travelling direction of the upper segments, the under
segment is oriented right-to-left, the crossing is positively oriented and we
will say that the sign of the crossing is 1.
Otherwise, it is negatively oriented and the sign
of the crossing is
-1.
The writhe of an oriented link diagram is the
sum of the signs over all crossings.
Clearly, the writhe of an oriented link diagram changes
under a Reidemeister I move.
The writhe of an oriented link diagram is invariant under Reidemeister moves
II and III.
In a Reidemeister II move, the orientations of the two crossings occurring
add up to zero. In a Reidemeister III move,
the orientations of the crossings, marked by the strands that
cross, are equal before and after the move.
QED
The writhe is not an invariant as it changes under Reidemeister I moves.
By selecting those crossings that involve two distinct components of a link,
we guarantee that the self-crossing of Reidemeister I is never counted. The
linking number of an oriented link diagram is defined as
half the sum of the crossing signs, taking only crossings involving two
distinct components of the link diagram. Notice that this is an integer.
As an immediate consequence, we have a link invariant.
The linking number is an oriented link invariant.
This invariant does
not distinguish between knots.
An arc of a link diagram is a connected component
of the link diagram in the plane; it is bounded by ends where the link
is not drawn as it passes under a crossing. A tricoloring of a link diagram
is a coloring of all of its arcs with (at most) three colors in such a way
that, at each node, an odd number of colors appears.
The number of tricolorings of a link diagram is a link invariant.
At each crossing three arcs pass. Label the colors by the number 0, 1, 2
modulo 3. The requirement that, at a given crossing,
an odd number of colors appears
means that the labels of the arcs passing through that crossing add up to 0
modulo 3. Therefore, solving the resulting linear equations, one for each node
and having three terms each, we find that the number of solutions is a power
of 3.
Example 9. The Whitehead link
The arcs are labelled A, ..., F.
These labels represent integers modulo 3 and, in order
to be a tricoloring, need to satisfy the following equations, one for each of
the 6 crossings.
A+D+E=0
A+B+F=0
B+E+F=0
B+C+E=0
C+D+F=0
D+E+F=0
These equations force all variables to be equal and so there are
exactly three solutions. So this invariant distinguishes the Whitehead link
from the unlink on two components but does not distinguish the Whitehead link
from the simple chain of two rings.
A knot is said to be composed if, by a single cut
with scissors, cutting two adjacent parallel strands of the knot diagram, and
subsequently joining the ends at the same sides of the cut, we obtain two
disjoint nontrivial knots. A link that is composed this way of two links
(possibly trivial) L and K
is called the connected sum
of L and K and denoted
L+K.
If a knot is not trivial and not composed, then it
is called prime.
The connected sum turns the set of links into a commutative monoid,
with the trivial knot as its zero.
Every knot has a unique (up to isotopy) decomposition as a connected sum
of prime knots.
Example 10.
The knot
is the connected sum of two trivial knots.
The genus of links is additive with respect to connected sum, that is,
g\left(K+L\right)=g\left(K\right)+g\left(L\right)
for any two links K and L.
It follows that a knot is trivial if and only if its genus is trivial.
- Knot (1.5 pts)
It has been stated that every isotopy class of a knot
has a representative consisting of a finite number of line segments.
Verify this for the case of the trefoil,
by drawing it as a closed path on the points of
{\mathbb{Z}}^{3},
only connecting vertices at Euclidean distance 1.
- Link (1 pt)
Draw the simple link of two rings as a
1-dimensional subcomplex of a simply connected
simplicial complex in the Euclidean space E.
- Link diagram (1 pt)
Prove the following statements.
-
Every knot having a diagram with at most two crossings is trivial.
-
Every nontrivial knot having a diagram with three crossings
is a trefoil.
- Reflection and reverse (1 pt)
Prove that the operations reflection and reverse commute.
- Gauss code (1.5 pts)
Each Gauss code for a link diagram with n crossings
has the following properties.
-
Each nonzero integer in the range
\left\{-n,\mathrm{...},n\right\}
occurs exactly once.
-
There is no subsequence of the form
i,j,-i,-j.
Show that these conditions do not guarantee that the code is actually the
Gauss code of a link diagram.
- Gauss code (1 pt)
Let G be the signed Gauss code of an oriented link.
Express the signed Gauss code of the reflection of the oriented link
and of the reverse of the oriented link in terms of G.
- Gauss code (1.5 pts)
For each of the Gauss codes below, determine all extensions to
signed Gauss codes
-
1,-2,3,-1,2,-3
-
1,-2,-3,3,2,-1
- Reidemeister moves (2 pts)
Consider the logo of the International Mathematical Olympiad 1995.
It has two components. Call the horizontal compontent with a
self-crossing K and the vertical one without self-crossings
L.
Clearly, the IMO link is isotopic to the link whose diagram is
For, this is just a clockwise rotation of the original diagram by 90
degrees. But, in this isotopy, the components K and L are
interchanged. Give a chain of Reidemeister moves that transforms the IMO
diagram into the other one without interchanging the components. So at the
end, K should be without self-crossing and L should have a
single self-crossing.
- Reidemeister theorem (2 pts)
Consider the knot diagram D below.
-
Prove that the knot of D is trivial.
-
Verify that there are no instances of Reidemeister III
in D.
-
Notice that D has 28 crossings.
Show that, in each chain of Reidemeister moves from D to the
unknot, there is a diagram with more than 28 crossings.
- Seifert surface (1 pt)
Show that the Seifert surface of a knot diagram
does not depend on the choice of
orientation of the diagram.
How about links?
- Crossing number (1 pt)
Show that
there is at most one nontrivial knot up to isotopy that is not a trefoil
and has a knot diagram with no more than four crossings.
- Genus (1 pt)
Determine the genus of the trefoil.
- Linking number (.5 pts)
Show, by means of the linking number, that the link consisting of
two chained rings is not isotopic to the link consisting of two separated
rings.
- Linking number (1 pt)
Consider the oriented link diagrams
with signed Gauss code
1, -2, 3, -3, 4, -1, 2, -4 / ++-+
and
-1, 2, -3, 1, -2, 3, 4, -4 / ++++
Clearly their writhes are +2 and +4, respectively. Prove that the two link
diagrams represent isotopic knots.
- Tricoloring a link diagram (1 pt)
Determine the number of tricolorings of the trefoil. Conclude that the trefoil
is not isotopic to the unknot.
- Prime knot (1 pt)
Prove
that the number of tricolorings of the connected sum of two knots
is one third of the product of the numbers of tricolorings of the two knots.