How to Count
November 19, 2023
In the map of mathematics, combinatorics (the study of counting) is a continent. Counting is fundamental to everything we do, so it's not a shock that it would occupy such a prominent spot in the collective consciousness of mathematicians.
You might think counting is easy; indeed, we learn to count from a young age. But counting the number of possible configurations of a Rubik's Cube, for example, is anything but easy. In such a scenario, the combinatorist's primary tool is symmetry.
Consider how multiple different configurations of a Rubik's Cube are the same when you spin the cube around -- how we handle these symmetric arrangements has everything to do with counting problems.
Groups and Symmetry
You do not need to know anything about groups to understand this article. Everything you need to know is in this box.
Groups
A group is a collection of (typically abstract) objects and some way of combining them. For example, consider the set of all ways to rotate a cube:
- Combining two rotations gives you another rotation
- We consider rotating by 0 degrees, i.e. doing nothing, as a rotation (the identity rotation)
It's always helpful to give names, so let's call the rotations of a cube , and let be a 90° rotation, so (the symbol means "is contained in"). Then, we use the to apply the rotation to the cube.
Importantly, observe that takes us from one coloring of the cube to a different coloring (we care about orientation, so red in front vs. blue in front matters). We need to make the distinction between rotations of the cube and the cube colorings themselves; continuing to name things, we'll call the set of cube colorings, and the set of rotations as before. Convince yourself that, if we have colors, there are colorings in .[1]
In the above diagram, we have two colorings and , related by the equation . In particular, we have
- and .
- eats an and an , and spits out an , in the form .
Obviously, if you have a cube that's all red, rotating it around isn't gonna change the coloring. But it gets more complicated: for example, a cube that's green on the top and bottom, and yellow on the sides. Rotating up or down changes the coloring, but left and right doesn't.
Algebraically, we'd write this as , and we'd say that fixes , or that is a fixed point of .
Orbits and Stabilizers
Let's flesh out this idea of fixed points. If we call the green-and-yellow cube from before , it might be helpful to think about which rotations fix . Counting them, there are 8 total (identity, spinning around, flipping). We call this collection of "fixing" rotations the stabilizer of , or , or just (I'm partial to the latter for readability).
What about everything else not in the stabilizer? Well, there are two other colorings that we could rotate to; we call this collection of "neighbor" colorings the orbit of , or , or just (I'm partial to the latter, orb is a cool word). In this case, the orbit of has three elements:
In general, orbit and stabilizer are inversely proportional; when stabilizer is big, orbit is small, and when stabilizer is small, orbit is big. Let's work through a couple examples.
Consider a cube that's all red. Every rotation gives you the same coloring, so the stabilizer is all of . However, we're not able to get to any other colorings, so the orbit is just the cube itself.
Consider a cube that's different colors on all six sides. Every rotation changes the coloring, so the stabilizer is just the identity rotation. For the same reason, the orbit has one coloring for each rotation, so it's much larger.
The orbit-stabilizer theorem tells us that orbit and stabilizer vary inversely to each other.[2] This should be intuitive: if lots of rotations fix your coloring, then they can't bring you to many new ones. Inversely, if your rotations generally don't keep you at the same coloring, they must bring you to lots of new ones.
Burnside's Lemma
This is the crown jewel. The usual formula looks like a bunch of notation, so I'll write it out in English:
In other words,
There's a lot going on here, but the underlying reasoning is pretty simple. If the size of stabilizers is large, then the size of orbits must be small (by the orbit-stabilizer theorem). If the orbits are small, there must be many of them, since each coloring is in some orbit. So large stabilizers equals many orbits.[3]
The upshot is that we can use this formula to count orbits while only thinking about stabilizers. For example, say I want to count the number of colorings of the cube up to rotation, which if you think about it, is just asking how many orbits there are. This is normally a hard problem, but Burnside's Lemma lets us do it by counting fixed points, which is way easier.
- For each face we have colors to choose from, i.e. there are ways to do it. Since there are six faces, there are ways color the cube. ↩
- It actually makes a stronger statement. For any , we have . ↩
- If you're paying close attention, you'll notice I'm waving my hands over the difference beteween "colorings fixed by " and "rotations that fix ". These end up summing to the same thing, and the intuition works either way, but it's an important point. If you want to prove to yourself why they add up to the same thing, imagine drawing a table where the rows are elements of , the columns are rotations in , and each cell has a checkmark depending on if is fixed by . It should be clear that the order of summation doesn't matter. ↩