math120_wim.pdf

An Introduction to Group Theory and the Rubik’s Cube
Lucas Garron
June 8, 2011
1
Introduction and Motivation
If you’ve done enough math or speedcubing, you’ve probably heard that the Rubik’s Cube is a
very useful example for studying the mathematical subject of group theory. The states of the
Rubik’s Cube form a group, which means that they satisfy certain properties; the Rubik’s Cube is
also moderately complicated, which means that a lot of parts of it have interesting properties that
can be used to visualize important concepts in group theory. In particular, the Rubik’s cube group
also contains many subgroups that we can use to understand parts of the cube. This paper is
intended to be a layman’s informal introduction to the Rubik’s Cube Group through a look at its
subgroups. I am assuming that you have some experience with Rubik’s Cubes and mathematical
thinking, but I will try to assume little about what you actually know.
2
2.1
Groups and the Rubik’s Cube
Groups
Groups are not some esoteric and complicated mathematical concept; in fact, the reason gruop
theory is so important is that many mathematical structures you might be familiar with are actually
groups.
1
2.1 Groups
2 GROUPS AND THE RUBIK’S CUBE
Example 1 “The integers under addition” is a description of a common group. Since the set1
{..., −3, −2, −1, 0, 1, 2, 3, ...} is sometimes called Z, this group is often written as (Z, +).
The fact that (Z, +) is a group means that it is possible to add integers to each other, and that
addition in this sense is “well-behaved”. Let’s look at some examples:
• If we add two integers, we always get another integer – the integers are closed under addition.
(If this seems to obvious, notice that 1/2 is not an integer, so we would say the integers are
not closed under division – (Z, ÷) is not a group.)
• If we’re adding 1 + 2 + 3, it also doesn’t make a difference whether we consider it (1 + 2) + 3
or 1 + (2 + 3) – this is because the integers are associative.
• In fact, it doesn’t matter in what order we add two integers: x + y = y + x, no matter what
integers x and y we pick – the integers also happen to be commutative.
• There is also a very important number in the integers: 0. If we add 0 to anything, we always
get back the original number. 0 is called the identity.
• Every integer x also has an inverse −x, and if we add them together we always get the
identity: 0.
Each of these properties might seem obvious, but addition would be more confusing if these
properties were not true – and in fact there are similar mathematical structures that don’t have
some of these properties. However, a group is a structure (a set with an operation) that has the
following properties from above: being closed, associative, having an identity, and having
inverses. Groups also may or may not be commutative.
Definition 1 A group (G, ?) is a set G with an operation ?, such that:
1. Identity: There is an element e such that e ? g = g ? e for every e in G.
2. Closure: If g and h are both in G, then g ? h is also in G
3. Inverses: If g is in G, then there is some other element h such that g ? h = h ? g = e. This
value of h is given the notation g −1 (or −g, when ? is addition).
4. Associativity: If we combine three elements g, h, and k (each in G, then (g ? h) ? k is always
the same as g ? (h ? k)
If g ? h = h ? g is true for any g and h in G, then the group is called commutative or abelian.
1
In mathematics, a set of objects is simply a collection of objects, often written like {2, 5, 20} or
{house, carrot, imagination}. It can be useful to think of it as a “list of things”, except that the order doesn’t
matter – something is either in a certain set or not. For example, {1, 2, 3} and {3, 1, 2, 1} are descriptions of the
same set, because “Is X in the set?” has the same answer for both sets, no matter what X is. Sets can also be
infinite and complicated, so that it wouldn’t make sense to write them down – which is why it’s good to think of a
set in terms of what’s in them and what’s not. For example, “the set of prime numbers” is infinite, but it’s easy to
tell whether something is in the set or not, assuming we can tell what a prime number is.
2
2.1 Groups
2 GROUPS AND THE RUBIK’S CUBE
For the integers, we can see that G = Z, “ ? ” = “+00 , and e = 0, making these statements agree
with our earlier look at the integers.
There is also one more important concept which shows up commonly in the study of the cube:
the order of a group or an element.
Definition 2 The order of a group (G, ?) is the number of elements in G, written as |G|.
Definition 3 The order of an element g in a group (G, ?) is the smallest number of times we
have to repeat it to get back to the identity. If we write g ? g ? ... ? g as g n , then the order of g is
{z
}
|
n
the smallest positive integer i such that g i = e. We write the order of g as |g|.
(In case you’re wondering, the reason that these definitions use the same word is that the
powers g 0 , g 1 , ... form a subgroup of of G generated by g which has the same order as g. This
should become more clear when we define those terms below.)
It might be the case that an element doesn’t have an order; for example, 1 doesn’t have an
order in (Z, +); no multiple of 1 is equal to 0. In this case, we often say that the order of 1 is
infinite. However, the following is true:
Theorem 1 If (G, ?) is a finite group (i.e. G has only finitely many elements), then every element
has an order.
Proof: Since G is finite, it can’t be the case that g 0 , g 1 , g 2 , g 3 , ... are all different, so there
must be some element g i that is equal to another element g j that comes after it. Using some
implicit, but hopefully logical, notation: g −i is the inverse of g i , so we can “unwind” to show that
g j−i = g j g −i = g i g −i = e. Thus, j − i (which is positive) is an upper bound for the order of g. It
might be the order, or the order is some smaller number.
Since the Rubik’s Cube has a finite number of states, this means that every algorithm has an
order. For example |R| = 4. However, we’re getting ahead of ourselves; to understand what this
all should mean, let’s take a look at the Rubik’s Cube.
(a) A Rubik’s Cube in
its unique solved state.
(b) A solved Rubik’s
Cube after one move
performed on its right
side: R.
3
(c) An arbitrary arrangement of the cube,
reached by performing
successive turns.
2.2 A Crash Course in Rubik’s Cube Terminology and2 Notation
GROUPS AND THE RUBIK’S CUBE
2.2
A Crash Course in Rubik’s Cube Terminology and Notation
A 3x3x3 Rubik’s Cube has 26 cubies with 54 stickers on 6 faces (9 stickers), which we call {back,
front, up, down, left, right}, abbreviated respectively as {B, F, U, D, L, R}. Of these cubies, there
are 8 edges, 12 corners, and 6 centers. Each side is composed of 9 “cubies” (4 edges, 4 corners,
and 1 center).
A Rubik’s Cube has 432520032744898560002 distinct states that can be reached by performing
moves on the sides of the cube.
If we look the side R and turn it clockwise around the center, then this is also the move denoted
R. If we turn it counter-clockwise, the move is R0 , and if we turn it either way twice it is R2.
Exactly the same holds for other sides in place of R. Note that “do an R move followed by a U 0
move” is traditionally denoted as R U 0 when we are studying the cube. Such a move sequence is
also called an algorithm. These definitions are contrary to mathematical convention, but they
are clearly established for the study of the Rubik’s Cube.
2.3
The Rubik’s Cube
I stated earlier that the Rubik’s Cube is a group. The states of a Rubik’s cube are a set, which
we’ll call G (and which has order |G| = 43252003274489856000). but if we want an operation ?,
what should it in this case? How do you add two states? One reasonable suggestion might be to
consider what happens if we perform some moves on a cube. Lets say we perform some algorithm
A that puts the cube into some state X, and another algorithm B that puts the cube into state Y .
It would be nice if “X ? Y ” happened to be the state you get if you perform the algorithms after
each other, A B. Since every position can be created by some sequence of moves, this allows us
to assume that you can add any two positions, even if you don’t know what moves might create
those positions. It turns out that this exactly defines the group (G, ?).
As with any such definition, we should check whether it is well-defined. There are multiple
algorithms for each state, and it is not guaranteed that the definition always gives us the same
“sum”. To see why this is an issue, consider the algorithms T1 = RU R0 U 0 R0 F R2U 0 R0 U 0 RU R0 F 0
and T2 = R2U 0 R2U R2U D0 R2U R2U 0 R2D on the 4x4x4 cube. Both of these seem to be algorithms
for the same state – the so-called T-Permutation. Now consider the algorithm r (an inner layer),
and what happens if we perform r followed by a T-Permutation: it matters which T-Permutation
we use, because the center pieces are affected differently! r ? T1 6= r ? T2 ; the action is not faithful
unless we can distinguish all the cubies, e.g. when the center cubies are labeled on a supercube.
Therefore, the states of the 4x4x4 cube do not form a group unless we are careful about the
definition of “state”.
It turns out that this is not an issue on the 3x3x3 cube. This cube is small enough (each
cubie) that it is possible to tell where every sticker in a state came from. Any algorithms for
the same state move the stickers in the same permutation. Therefore, on a 3x3x3 cube it is
justifiable to think of RU R0 U 0 R0 F R2U 0 R0 U 0 RU R0 F 0 and R2U 0 R2U R2U D0 R2U R2U 0 R2D, as well
as the resulting state, all as representations of the T-Permutation. A state and an algorithm
describe exactly the same thing, although there are only a finite number of states, and an infinite
number of algorithms for each state. Both algorithms result in the same state starting from solved,
2
It is an amusing curiosity that the number of unsolved states of the Rubik’s Cube, 43252003274489855999,
is a prime number, as observed by Chris Hardwick and discussed in June of 2007. By the prime number theorem,
this is not particularly likely. I found this property to hold for the 3x3x3, 8x8x8, and 11x11x11 cube, but no others
until 40x40x40.
4
2.4 Getting Specificational: Permutations
2 GROUPS AND THE RUBIK’S CUBE
and it is guaranteed that “applying” these algorithms to other states will always give the same
result. In fact, we can omit the ? symbol because composing moves by writing them one after the
other is well-defined. In group theory, the operation of such a group (where we take an element
and apply it to another one to get a new element) is often called a composition. This is why
cube theorists sometimes use descriptions like “the superflip position composed with four dots”.
It is possible to think about cube moves mainly as permutations, but it is often nice to understand
that we could always just “do the moves” because the cube is a group. For example, R is a 1-move
algorithm, and R R0 is an alternate description for the solved state
Now, our definitions for the Rubik’s Cube satisfy all the properties of a group:
• The set of elements, G, is simply the set of Rubik’s Cube states (all of which which can be
described by multiple algorithms), and ? is composition
• Identity: e is the solved state. Composing the solved state with something is idempotent (it
does nothing to the other state).
• Closure: Two states can always be composed.
• Inverses: Every state has an inverse. For example, the inverse of (RU R0 F 0 U 2) is (U 2F RU 0 R0 ).
However, (L0 U 0 LF U 2) is also an inverse. Thus it must be the case that (RU R0 F 0 U 2) =
(L0 U 0 LF U 2); both of these are descriptions of the inverse state.
• Associativity: If we have compose three states as A1 A2 A3 , it makes no difference if we
compute an algorithm for A2 A3 and apply it to A1 , or if we apply A3 to an algorithm for A1 ;
the result is the same.
2.4
Getting Specificational: Permutations
Although it is good to establish the intuitive grounds behind the definitions, it is important to
understand how puzzles like the Rubik’s Cube are mathematically defined, and that is through
permutations. A permutation is an operation that takes some arrangement elements, and shuffles
them around. For example, when we scramble a Rubik’s Cube we move its 26 cubies around, and
when we are done each cubie has moved to some new position (possibly the same). We say that
our scramble induces a permutation on the cubies. Similarly, it induces a permutation on the
stickers of the cube. Ignoring the centers of each side (because they never move), the Rubik’s Cube
(a) The result of either T-Permutation algorithm
(b) The algorithm r
(c) The algorithm (r) ?
(R U R’ U’ R’ F R2 U’
R’ U’ R U R’ F’)
5
(d) The algorithm (r) ?
(R2 U’ R2 U R2 U D’
R2 U R2 U’ R2 D)
2.4 Getting Specificational: Permutations
2 GROUPS AND THE RUBIK’S CUBE
has 48 stickers that can move. If we perform the move R, then the sticker at position 25 moves
to the one at position 27, which moves to 32, which moves to 30, which moves back to 25. This
is called a cycle of stickers, and it would be written as (25 27 32 30). It should make sense that
every state can be described in terms of how it moves the stickers. For example, R induces the
permutation with the following cycle structure:
R = (25 27 32 30)(26 29 31 28)(3 38 43 19)(5 36 45 21)(8 33 48 24)
We can figure out where any sticker goes by looking at the cycles: sticker 38 moves to the
position of sticker 43; sticker 28 to 26 (since a cycle permutes the stickers in a loop), and sticker 20
stays in place because it is not moved by any cycle. Instead of writing down the cycle structure,
it would also have been useful
to write down the actual permutation map (1 7→ 1, 2 7→ 2, 3 7→
1 2 3 4 5 ...
38, 4 7→ 5, 7→ 36, ...), or
that tells us where each sticker goes, but this
1 2 38 4 36 ...
takes up more space.
It should now make more sense to say what a state is: A state of the Rubik’s Cube is a
permutation of stickers. An algorithm that turns a solved cube into that state will induce exactly
the same permutation of stickers. If we want to compose two permutations, we simply figure
out where each sticker is moved by the first permutation, then where the new position of that
sticker is moved by the second permutation. Thus, R U is the state defined by the permutation
that results when we compose R with U – which exactly corresponds to how the stickers permute
when we apply those two moves. However, we can now do something cooler: If we have two
arbitrary states of the Rubik’s Cube, we can compose them just by composing their successive
permutations, without having to find an algorithm for either one! That we can also do this
through algorithms (such that the answer is well-defined) comes from the fact that an algorithm is
really just an explicit composition of the permutations that are basic moves of the Rubik’s Cube.
To be precise, the permutations of the stickers are not the Rubik’s Cube group; instead, the
Rubik’s Cube group acts on the set of stickers, and the permutations of this group action give
us full information about the Rubik’s Cube group.
1
2
3
4
U
5
6
7
8
9
10 11 17 18 19 25 26 27 33 34 35
12
L
13 20
F
21 28
R
29 36
B
37
14 15 16 22 23 24 30 31 32 38 39 40
41 42 43
44
D
45
46 47 48
(a) Labeled Stickers in a flattened view
6
3 SUBGROUPS
3
Subgroups
A subgroup is simply a group that is contained in another group. We can use the definition of a
group to find out the criteria for a subgroup.
Definition 4 (H, ?) is a subgroup of (G, ?) if H is a subset of G (that is, all the elements of H
are also in G), and the following are true:
• e is in H, since every group must have an identity, and e is the only candidate in G
• H is closed: if h1 and h2 are in H, then h1 ? h2 must also be in H for H to be a group.
• H has inverses: for any h in H, the inverse of h must also be. Else, again, the corresponding
condition of a group is violated.
• The elements of H are associative: This is automatically true, since
Proposition 1 If R is in a certain subgroup of the Rubik’s Cube, so are R2 and R0 .
Proof: Since a subgroup is closed, R ? R = R2 must also be in the group. The inverse, R0 ,
must also be in the group.
What if we consider a subgroup containing both R and U ? It must be the case that R2 and R0
are in the subgroup, as well as R U and U 0 R2... In fact, any algorithm composed of U -turns and
R-turns must be in the subgroup. There may be other states in the subgroup (we didn’t say L is
not in the group), but there are at least some elements that must be guaranteed to be in such a
subgroup. In fact, mathematicians have a name for “the smallest subgroup containing R and U ”;
it is called “the subgroup generated by R and U ”.
Definition 5 Given a set of elements of a group G (e.g. cube states) g1 , g2 , g3 , ..., the smallest
subgroup containing all these elements is denoted hg1 , g2 , g3 , ...i. Such a subgroup always exists.
Proof: Note that such a set is always guaranteed to exist, since G is such a set. However, it
is not immediate that this set is a subgroup. However, let’s consider all subgroups that contain
g1 , g2 , g3 , .... There must be some minimum set that they all contain (their intersection); let’s
call it X. If we go through the conditions of a group again, we can show that X must be a group:
• X contains the identity, since it is an element of all these subgroups.
• X is closed: If g and h are in X, they are in every one of these subgroups, as is g ? h
• X contains the inverses of all its elements. Similarly, if g is in X, it is in all these subgroup,
as is gı – thus gı is in X.
• X is obviously associative, since the original group was.
Thus, by definition, X “forces” itself to be a group, and it is unambiguous. to refer to a
subgroup generated by any arbitrary elements.
Theorem 2 Any subgroup can be written using generators.
Proof: If we take a subgroup, and consider the subgroup generated by its elements, we obviously get. . . itself. There may be a shorter representation, but there is guaranteed to be a
presentation using generators.
7
3.1 Examples of Rubik’s Cube Subgroups
3.1
3 SUBGROUPS
Examples of Rubik’s Cube Subgroups
Here are some examples of subgroups of the cube:
• Cubers will be familiar with hR, U i as a two-gen subgroup.
• The so-called PLLs (Permutations of the Last Layer, preserving sticker orientation) form
a subgroup. It can be written as h U , R0 U R0 U 0 R0 U 0 R0 U RU R2 , RB 0 RF 2R0 BRF 2R2 i,
which might be characterized as “the PLLs are generated by U , the U b-Permutation, and
the Ab-Permutation”. (If you have a cube in front of you, try it out! Set up an arbitrary
PLL and check that it can be solved by U b and Ab and AUFs (U -turns).)
• hei is the trivial subgroup. It might seem silly, but it it is good to know by name. For
example, the trivial subgroup is a subgroup of every other subgroup.
• Every valid state of the Rubik’s Cube can be reached by turning the six sides. Thus, G =
hB, F, U, D, L, Ri.
• It is possible to generate any move using the 5 others, as with D = (L0 RF 2B2L0 R)U (L0 RF 2B2L0 R).
Thus, it is also true that G = hB, F, U, L, Ri, since any state has algorithm using all six generators that can be rewritten by replacing D using that algorithm. Informally, we might say
that the cube is five-gen.
• It is a little-known fact (among cubers) that the entire Rubik’s Cube Group is actually twogen. For example, G = hU BLU L0 U 0 B 0 , R2F LD0 R0 i. There are some interesting properties
which were discussed on the speedsolving.com forum in March 2010. In particular, Tom
Rokicki found that if you take any two random states of the cube, there is about a 50%
chance that these two states generate the entire group. It is also possible to generate the
cube using two elements of orders as low as 2 and 4; this demonstrates how easily two
algorithms can “tangle”.
• If we take any state X of the Rubik’s Cube, then hXi is a subgroup. Notice that it is
always commutative, since all the elements can be written as X i for some integer i, and
that |X| = |hXi|. It is sometimes unclear to cubers that repeating an algorithm repeatedly
will always return to the original state, but this is a simple fact from a group-theoretic
perspective.
We can ask what possible subgroups we can get from this, and the most important question
to answer is what possible orders subgroups generated by one element can have (note from
above that two elements can already generate the entire group). By analyzing all possible
cycle decompositions of states, we can show that there are 73 possible orders:
{1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 18, 20, 21, 22,
24, 28, 30, 33, 35, 36, 40, 42, 44, 45, 48, 55, 56, 60, 63, 66, 70, 72,
77, 80, 84, 90, 99, 105, 110, 112, 120, 126, 132, 140, 144, 154, 165, 168, 180, 198,
210, 231, 240, 252, 280, 315, 330, 336, 360, 420, 462, 495, 504, 630, 720, 840, 990, 1260}
The solved state is obviously the only state with order 1. There are many states with order
1260, such as RF 2B 0 U B 0 . There was a discussion in August 2010 that covers a little more
8
3.1 Examples of Rubik’s Cube Subgroups
3 SUBGROUPS
of this. It is also known how many states of each order, as reported in the Cubic Circular by
David Singmaster in 1982, but the combinatorics behind this are more complicated.
• Sylow Subgroups are important subgroups of any group. The order of the Rubik’s Cube
factors as:
|G| = 43252003274489856000 = 227 · 314 · 53 · 72 · 111
One of the Sylow’s theorems states that G contains certain Sylow subgroups of each of the
following orders: 22 7, 31 4, 53 , 72 , 111 , and that the Sylow subgroups of any given order are
all “similar” in an important sense (they are conjugate). I constructed explicit generators
for these groups in March 2010.
9

Download Report

math120_wim.pdf

Paperzz.com

Your Paperzz