A Tale of Two Subgroups

Two subgroups and semi-direct products
1
First remarks
Throughout, we shall keep the following notation: G is a group, written
multiplicatively, and H and K are two subgroups of G. We define the
subset HK of G as follows:
HK = {hk : h ∈ H, k ∈ K}.
It is not in general a subgroup. Note the following:
Proposition 1.1. Suppose that H ∩ K = {1}. Then the function F : H ×
K → G defined by
F (h, k) = hk
is injective and its image is HK. In particular, if H and K are both finite
and H ∩ K = {1}, then
#(HK) = #(H) · #(K).
Proof. Clearly, the image of F is HK by definition. To see that F is injective, suppose that
F (h1 , k1 ) = F (h2 , k2 ).
−1
Then by definition h1 k1 = h2 k2 . Thus h−1
2 h1 = k2 k1 . Since H is a
−1
subgroup, h2 h1 ∈ H, and since K is a subgroup, k2 k1−1 ∈ K. Thus
−1
−1
h−1
∈ H ∩ K = {1}, and so h−1
= 1. It follows
2 h1 = k2 k1
2 h1 = k2 k1
−1
that h−1
h
=
1,
so
that
h
=
h
,
and
similarly
k
k
=
1
so
that k2 = k1 .
1
1
2
2
2
1
Thus (h1 , k1 ) = (h2 , k2 ) and F is injective.
Corollary 1.2. Suppose that G is finite, that H and K are two subgroups
of G with H ∩ K = {1} and that #(H) · #(K) = #(G). Then the function
F : H × K → G defined by F (h, k) = hk is a bijection from H × K to G.
1
Proof. Proposition 1.1 says that F : H ×K → G is injective, but both H ×K
and G have the same number of elements, so F is surjective as well, hence
a bijection.
The next proposition provides some examples where the hypothesis H ∩
K = {1} is satisfied.
Proposition 1.3. Suppose that H and K are finite subgroups of a group G.
(i) If the orders #(H) and #(K) are relatively prime, then H ∩ K = {1}.
(ii) If #(H) = p is a prime number and H is not contained in K, then
H ∩ K = {1}. For example, if #(H) = #(K) = p and H 6= K, then
H ∩ K = {1}.
Proof. (i) was a homework problem (by Lagrange’s theorem, the order of
H ∩ K must divide both #(H) and #(K) and hence must be 1). To see (ii),
note that, since #(H) is prime, it is isomorphic to Z/pZ, and so the only
subgroups of H are either H or {1}. But H ∩ K is a subgroup of H, not
equal to H since otherwise H ⊆ K. So H ∩ K = {1}.
We now consider the case where one of the subgroups, say H, is a normal
subgroup of G. In this case, we have the following:
Proposition 1.4. Suppose that H C G and that K ≤ G. Then HK is a
subgroup of G, not necessarily normal. Moreover, H C HK and K ≤ HK.
Proof. HK is closed under multiplication since
(h1 k1 )(h2 k2 ) = h1 (k1 h2 k1−1 )k1 k2 ,
and this is in HK since, as H is normal, k1 h2 k1−1 ∈ H. Identity: since 1 ∈ H
and 1 ∈ K, 1 = 1·1 ∈ HK. Inverses: (hk)−1 = k −1 h−1 = (k −1 h−1 k)k −1 and
this is in HK since, again by normality, k −1 h−1 k ∈ H. Clearly H ≤ HK
and K ≤ HK: for example, 1 ∈ K and, for all h ∈ H, h = h · 1 ∈ HK.
Thus H ⊆ HK, hence H ≤ HK, and the proof that K ≤ HK is similar.
Since H C G, H C HK as well.
The question as to what the quotient group HK/H looks like is answered
by the Second Isomorphism Theorem:
Theorem 1.5. Let H and K be two subgroups of G, with H C G. Then
HK is a subgroup of G, H C HK, H ∩ K is a normal subgroup of K, and
HK/H ∼
= K/(H ∩ K).
2
In particular, if H and K are both finite , then
#(HK) =
#(H) · #(K)
.
#(H ∩ K)
Proof. We have seen that HK ≤ G and that H C HK. Also, H ∩ K ≤ K,
and, if x ∈ H ∩ K, then, for all k ∈ K, kxk −1 ∈ H since x ∈ H and H is
normal, and kxk −1 ∈ K since x ∈ K and K is a subgroup. Thus H ∩ K is
a normal subgroup of K.
To prove the theorem, we shall find a surjective homomorphism f from
K to HK/H whose kernel is H ∩ K. The First Isomorphism Theorem
then says that K/(H ∩ K) ∼
= HK/H. Define f (k) = kH, the H-coset
containing k. Thus f is the composition π ◦ i, where i : K → HK is the
inclusion and π : HK → HK/H is the quotient homomorphism. Hence f
is a homomorphism since it is a composition of two homomorphisms, and
clearly Ker f = Ker π ∩ K = H ∩ K.
We will be done if we show that f is surjective, in other words that
every coset xH, with x ∈ HK, is equal to some coset of the form kH with
k ∈ K. But since x ∈ HK, x = hk for some h ∈ H and k ∈ K. Since
H is normal, hk = kh0 for some h0 ∈ H, and clearly kh0 H = kH. Thus
xH = hkH = kh0 H = kH = f (k), so that f is surjective. The final equality
holds since then
#(HK)/#(H) = #(K)/#(H ∩ K).
In case neither of H, K is normal, a somewhat more involved argument
along the lines of the proof of Proposition 1.1 shows the following counting
formula:
Proposition 1.6. Let H and K be two subgroups of G, and define F : H ×
K → G as before by F (h, k) = hk. Then the image of F is HK. Moreover,
for all x ∈ HK, there is a bijection from the inverse image F −1 (x) to the
subgroup H ∩ K. In particular, if H and K are both finite , then
#(HK) =
2
#(H) · #(K)
.
#(H ∩ K)
Direct products and semi-direct products
We turn now to a construction which generalizes the direct product of two
groups. To set the background, we start with the following:
3
Theorem 2.1. Suppose that H and K are two subgroups of a group G such
that
(i) H ∩ K = {1};
(ii) HK = G;
(iii) For all h ∈ H and k ∈ K, hk = kh.
Then G ∼
= H × K.
Proof. Define F : H × K → G by: F (h, k) = hk. We shall show that F is an
isomorphism. By Proposition 1.1, the function F is injective, and its image
is HK = G by assumption (ii). Thus F is a bijection. To see that F is an
isomorphism, note that
F ((h1 , k1 )(h2 , k2 )) = F (h1 h2 , k1 k2 ) = (h1 h2 )(k1 k2 ) = h1 (h2 k1 )k2
= h1 (k1 h2 )k2 = (h1 k1 )(h2 k2 )
= F (h1 , k1 )F (h2 , k2 ).
Thus F is an isomorphism.
Example 2.2. The abelian group C∗ contains two subgroups, R>0 and
U (1), the subgroup of positive real numbers (under multiplication). Clearly
R>0 ∩ U (1) = {1}, since a positive real number of absolute value 1 is 1.
z
Also, every nonzero complex number z can be written as z = |z| · , where
|z|
|z| ∈ R>0 and z/|z| ∈ U (1). Hence C∗ = R>0 U (1). By the theorem,
C∗ ∼
= R>0 × U (1).
Let us give an application to finite groups where we use this criterion:
Proposition 2.3. Let p be a prime and let G be a group of order p2 . The
either G ∼
= Z/p2 Z or G ∼
= (Z/pZ) × (Z/pZ).
Proof. We have seen previously that G is abelian. By Lagrange’s theorem,
every element of G, not the identity, has order p or p2 . If there exists an
element of G whose order is p2 , then G is cyclic of order p2 and hence G ∼
=
Z/p2 Z. Thus we may assume that every element of G, not the identity, has
order p. Choose x ∈ G, x 6= 1. Then hxi has order p. In particular, hxi =
6 G.
Choose an element y ∈ G, y ∈
/ hxi. Then y has order p as well. Since hxi and
hyi are two different subgroups of G, by Proposition 1.3, hxi ∩ hyi = {1}. By
Corollary 1.2, as #(hxi)#(hyi) = p2 = #(G), hxihyi = G. Thus Conditions
(i) and (ii) above are satisfied, and Condition (iii) is automatic since G is
abelian. Thus G ∼
= hxi × hyi ∼
= (Z/pZ) × (Z/pZ).
4
In general, the condition above that, for all h ∈ H and k ∈ K, hk = kh
(i.e. the condition that every element of H commutes with every element
of K), or equivalently the condition that G ∼
= H × K, is too restrictive,
and we want to relax it a bit and as a by-product describe a new class of
interesting groups. To motivate the construction, consider the following set
of assumptions on a pair of subgroups H and K of a group G:
Assumption. Let G be a group and let H and K be subgroups of G such
that:
(i) H ∩ K = {1};
(ii) HK = G;
(iii) H is a normal subgroup of G.
Note: We do not require that K is also a normal subgroup of G. In fact, if
K is also a normal subgroup of G, then, as we have seen in the homework,
for all h ∈ H and k ∈ K, hk = kh (by considering the product hkh−1 k −1
which lies in both H and K), so we are back in the situation where G is
isomorphic to the direct (Cartesian) product H × K of H and K.
Using the data above, let us see how to understand the group structure
on H. By Proposition 1.1 and the assumptions (i) and (ii) above, we see that
every element of G can be written uniquely as hk for h ∈ H and k ∈ K,
i.e. the function F : H × K → G defined by F (h, k) = hk is a bijection. To
understand the group law on G, or equivalently on H × K, note that
(h1 k1 )(h2 k2 ) = (h1 · (k1 h2 k1−1 ))(k1 k2 ),
which is of the form h0 k 0 for some (in fact unique) h0 ∈ H, k 0 ∈ K since
H is normal. To rewrite this operation in another way, define a function
f : K → Aut H by f (k) = ik , where ik denotes conjugation by k, i.e. f (k)
is the automorphism of H defined by f (k)(h) = khk −1 . Then we see that
multiplication in G is given by the rule
(h1 k1 )(h2 k2 ) = (h1 · f (k1 )(h2 ))(k1 k2 ).
We can ask if it is possible to reverse this process. Concretely, given two
abstract groups H and K and a homomorphism f : K → Aut H, define a
binary operation on the Cartesian product H × K by the rule
(h1 , k1 ) ∗ (h2 , k2 ) = (h1 · f (k1 )(h2 ), k1 k2 ).
5
Note that this operation depends on the choice of f . In particular, ∗ is the
usual product operation in H × K ⇐⇒ f is the trivial homomorphism
from K to Aut H, i.e. f (k) = Id for all k ∈ K.
Proposition 2.4. Let H and K be two groups and let f : K → Aut H be a
homomorphism. Define a binary operation ∗ on H × K by the formula
(h1 , k1 ) ∗ (h2 , k2 ) = (h1 · f (k1 )(h2 ), k1 k2 ).
Then H × K, together with this operation, is a group H o K, the semi-direct
product of H and K. Under this operation, H × {1} is a normal subgroup
of H o K isomorphic to H, {1} × K is a subgroup of H o K isomorphic
to K, and the conjugation action of K ∼
= {1} × K on H × {1} ∼
= H can be
identified with the action given by the automorphism f .
The notation o is meant to suggest both the product × and the fact that
H corresponds to a normal subgroup of H o K. Of course, the definition of
the group operation depends not just on the groups H and K but also on
the homomorphism f : K → Aut H. To make this dependence explicit, we
sometimes write H of K for the semi-direct product.
Proof of Proposition 2.4. As usual, the hardest part to check is associativity:
((h1 , k1 ) ∗ (h2 , k2 )) ∗ (h3 , k3 ) = (h1 · f (k1 )(h2 ), k1 k2 ) ∗ (h3 , k3 )
= (h1 · f (k1 )(h2 ) · f (k1 k2 )(h3 ), k1 k2 k3 )
= (h1 · f (k1 )(h2 ) · f (k1 )(f (k2 )(h3 )), k1 k2 k3 ),
where we have freely used associativity in G and the fact that f is a homomorphism. On the other hand,
(h1 , k1 ) ∗ ((h2 , k2 ) ∗ (h3 , k3 )) = (h1 , k1 ) ∗ (h2 · f (k2 )(h3 ), k2 k3 )
= (h1 · f (k1 )(h2 · f (k2 )(h3 )), k1 k2 k3 )
= (h1 · f (k1 )(h2 ) · f (k1 )(f (k2 )(h3 )), k1 k2 k3 ),
where we have also used the fact that f (k1 ) is an automorphism of H. Hence
∗ is associative. Clearly
(1, 1) ∗ (h, k) = (1 · f (1)(h), 1 · k) = (Id(h), k) = (h, k),
and
(h, k) ∗ (1, 1) = (h · f (k)(1), k · 1) = (h, k),
6
since f (k) is an automorphism and hence f (k)(1) = 1. Finally, it is straightforward to check that the inverse of (h, k) is (f (k)−1 (h−1 ), k −1 ), where the element f (k)−1 (h−1 ) is the inverse function of the automorphism f (k) (which
is f (k −1 ) since f is a homomorphism) applied to the element k −1 of K:
(h, k) ∗ (f (k)−1 (h−1 ), k −1 ) = (h · f (k)(f (k)−1 (h−1 )), kk −1 )
= (hh−1 , kk −1 ) = (1, 1),
whereas
(f (k)−1 (h−1 ), k −1 ) ∗ (h, k) = (f (k)−1 (h−1 ) · f (k −1 )(h), k −1 k)
= (f (k)−1 (h−1 h), 1) = ((f (k)−1 (1), 1) = (1, 1),
where we have again used the fact that f (k −1 ) is an automorphism, as well
as f (k −1 ) = f (k)−1 .
Note that, given h1 and h2 in H,
(h1 , 1) ∗ (h2 , 1) = (h1 · f (1)(h2 ), 1 · 1) = (h1 · Id(h2 ), 1) = (h1 h2 , 1).
Hence the function g1 : H → H o K defined by g1 (h) = (h, 1) is a homomorphism from H to H o K whose image is clearly H × {1}, showing that
H × {1} is a subgroup of H o K. Since g1 is clearly injective, it induces an
isomorphism from H to H × {1}. Likewise, since, given k1 , k2 ∈ K,
(1, k1 ) ∗ (1, k2 ) = (1 · f (k1 )(1), k1 k2 ) = (1, k1 k2 ),
the function g2 : K → H o K is an injective homomorphism whose image is
{1} × K, so that {1} × K is a subgroup of H o K isomorphic to K. Note
that
(h, 1) ∗ (1, k) = (h · f (1)(1), 1 · k) = (h, k),
so that every element of H o K can be (uniquely) written as a product of
an element in H × {1} times an element in {1} × K. The same statement
is true in the other order, but it is more complicated since (1, k) ∗ (h, 1) =
(f (k)(h), k). We then have the formula
(1, k) ∗ (h, 1) ∗ (1, k)−1 = (f (k)(h), k) ∗ (1, k −1 )
= (f (k)(h) · f (k)(1), kk −1 ) = (f (k)(h), 1).
This says that conjugation by an element of {1} × K takes H × {1} to itself,
and since H × {1} is a subgroup, conjugation by an element of H × {1}
also takes H × {1} to itself. Since every element of H o K is a product of
7
an element in H × {1} and an element in {1} × K, H × {1} is a normal
subgroup of H o K. Moreover, identifying H × {1} with H via g1−1 and
identifying {1} × K with K via g2−1 , the conjugation action of {1} × K on
H × {1} is identified with the action of K on H via the homomorphism f .
This concludes the proof of Proposition 2.4.
Then the remarks before the statement of Proposition 2.4 show the following:
Theorem 2.5. Let G be a group and let H and K be two subgroups of G
such that
(i) H ∩ K = {1};
(ii) HK = G;
(iii) H is a normal subgroup of G.
Then there is a homomorphism from K to Aut H defined by conjugation.
Using this homomorphism to define H oK, there is an isomorphism F : G →
H o K such that F (H) is the normal subgroup H × {1} and F (K) is the
subgroup {1} × K.
Remark 2.6. If G is isomorphic to a semi-direct product of H and K, then
identifying H with the corresponding subgroup of G, H is a normal subgroup
of G and G/H ∼
= K by the second isomorphism theorem, Theorem 1.5 (or
directly). This says that, in the case where G contains a normal subgroup
H such that G/H ∼
= K, and moreover G is isomorphic to the semi-direct
product H o K, then we can reconstruct G from the smaller building blocks
H and K, together with the knowledge of the homomorphism f : K →
Aut H.
However, while the construction of a semi-direct product is important in
group theory (and we shall give many examples below), one should not be
too optimistic for its applicability. To say that a group G is a semi-direct
product, one needs a normal subgroup H and another subgroup K such that
the quotient homomorphism π induces an isomorphism from K to G/H.
This is often expressed as follows: we can “lift” the quotient group G/H to
a subgroup of G. This usually turns out to be impossible, even in very simple
cases. For example, the quotient of Z/4Z by the subgroup H = h2i ∼
= Z/2Z
is isomorphic to Z/2Z, but there is no subgroup K of Z/4Z which maps
isomorphically onto the quotient Z/2Z, and Z/4Z is not isomorphic to a
product, semi-direct or otherwise, of Z/2Z with Z/2Z. There is a gadget
8
(group cohomology) which classifies (in a certain sense) all groups G which
have a normal subgroup H and such that the quotient is isomorphic to K,
in terms of H and K and a homomorphism f : K → Aut H, provided that
H is abelian, but there is no mechanism for handling the general case.
Example 2.7. (1) Every direct product H × K is a semi-direct product
with f : K → Aut H the trivial homomorphism (f (k) = Id for all k ∈ K).
(2) The dihedral group Dn of order 2n is a semi-direct product. Here H =
hρi is the cyclic subgroup of order n consisting of the rotations, and K = hτ i
is the order 2 subgroup generated by a reflection. Via the discussion of HW
6, Problem 2, H = hA2π/n i and K = hRi . Since H has index 2, it is normal,
and τ · ρ · τ −1 = τ · ρ · τ = ρ−1 . In terms of the matrices A2kπ/n and R, since
R has order 2 and hence R−1 = R, we have
RA2kπ/n R−1 = RA2kπ/n R = A−2kπ/n .
Thus Dn ∼
= Z/nZoZ/2Z, where the homomorphism f : Z/2Z → Aut(Z/nZ)
is given by f (0) = Id and f (1) = − Id.
(3) Similarly, the group O2 contains
the
normal subgroup SO2 and the
1 0
, in the notation of Problem 2 of
subgroup hRi, where R = B0 =
0 −1
HW 3. Since R has order 2 and, and, for all A ∈ SO2 , RAR−1 = A−1 , we
see that O2 is a semi-direct product SO2 o {±1} ∼
= U (1) o {±1}. Here, the
corresponding homomorphism f : {±1} → Aut U (1) is defined by f (1) = Id
and f (−1) is the automorphism of U (1) sending z to z −1 .
(4) Let G be the group of linear-affine functions ` : R → R under function
composition. Here a linear affine function is a function of the form `(x) =
ax+b with a 6= 0. There are two very natural subgroups of G: the translation
subgroup H consisting of all functions of the form tb (x) = x + b, and the
similitudes K, consisting of all functions of the form ma (x) = ax, a 6= 0.
Note that H ∼
= R and K ∼
= R∗ . Clearly H ∩ K = {Id} and, if `(x) = ax + b,
then ` = tb ◦ma . Finally, a calculation shows that ma ◦tb ◦m−1
a = tab . Hence
H is normal, and the action of K ∼
= R∗ on H ∼
= R is by sending ma to the
automorphism of the (additive) group R given by multiplication by a. Thus
G∼
= R o R∗ using the action described above.
A very similar argument works for the group G of affine-linear transformations of Rn ; here L : Rn → Rn is affine-linear if it is of the form L(x) =
Ax + b, where A ∈ GLn (R) and b ∈ Rn , with L−1 (x) = A−1 x − A−1 b.
Then G has a subgroup isomorphic to Rn (the translation subgroup) and
a subgroup isomorphic to GLn (R). In this case, if Tp is translation by p,
9
so that Tp (x) = x + p, and L(x) = Ax + b is as above, then a calculation
shows that
(L ◦ Tp ◦ L−1 )(x) = x + Ap = TAp (x).
Hence the translation subgroup is a normal subgroup isomorphic to Rn ,
GLn (R) is a (non-normal) subgroup of G, and G ∼
= Rn o GLn (R), where the
n
action of GLn (R) on R is the usual one (matrix multiplication).
Finally, one often (especially in physics, and for n = 3) considers the
Euclidean group, the subgroup E of G, where A ∈ SOn , so an element L
of E is of the form L(x) = Ax + b, A ∈ SOn . Thus A is a rigid motion
of Rn fixing the origin followed by a translation. In case n = 4 replacing
SOn by the Lorentz group gives a group of symmetries well-adapted to
electromagnetism and special relativity.
(5) As in the homework, let B be the group of upper triangular 2×2 matrices:
a b
: a, b, d ∈ R, ad 6= 0 .
B=
0 d
Inside B there are two natural subgroups: the subgroup T of strictly upper
triangular matrices,
1 b
:b∈R ,
T =
0 1
and the subgroup D of diagonal matrices D:
a 0
D=
: a, d ∈ R, ad 6= 0 .
0 d
Clearly T ∩ D = {Id}, and since
a 0
a b
1 bd−1
,
=
0 d
0 d
0
1
we see that T · D = B. We have seen in the homework that T C B. Hence
B ∼
= T o D. In this case, the action of D ∼
= (R∗ )2 on T ∼
= R is given by:
∗
2
(a, d) ∈ (R ) is sent to the automorphism of R given by multiplication by
ad−1 .
(6) Let G be a group of order pq, where p and q are distinct primes and
p < q. If P is a p-Sylow subgroup and Q is the q-Sylow subgroup, then we
have seen in class that QCG and P ∩Q = {1} since their orders are relatively
prime. Finally, QP = G since #(G) = #(Q)#(P ), by Corollary 1.2. Hence
G∼
= Q o P.
10
The additional piece of information is the homomorphism f : P ∼
= Z/pZ →
Aut Q ∼
= Aut(Z/qZ). Clearly f is trivial ⇐⇒ G is abelian ⇐⇒ P is normal ⇐⇒ there is a unique p-Sylow subgroup. Otherwise, f corresponds
to a non-trivial homomorphism from Z/pZ to Aut(Z/qZ). As we have seen,
Aut(Z/qZ) = (Z/qZ)∗ is a group of order q − 1, and since Z/pZ is simple,
f is non-trivial ⇐⇒ f is injective. This is only possible if p divides q − 1,
by Lagrange’s theorem, since the image of f is then a subgroup of (Z/qZ)∗
of order p. Of course, by part (3) of the Sylow theorem, we know that the
number of p-Sylow subgroups is a divisor of pq and ≡ 1 mod p, so if this
number is not 1, it must be q and we must have q ≡ 1 mod p, i.e. p divides
q − 1. In fact, using the fact that (Z/qZ)∗ is cyclic, one can show that every
two nonabelian groups of order pq are isomorphic.
11