Sets of Measure Zero and the Cantor Set
Learning objectives
- Construct the Cantor middle-thirds set as the limit of an iterated removal process
- Compute its Lebesgue measure and prove that it is zero
- Show that the Cantor set is uncountable via base-3 expansions, hence “small in measure, large in cardinality”
- Identify the topological properties of the Cantor set: closed, compact, perfect, totally disconnected, nowhere dense
The Cantor set is a single example that demolishes most of the intuitions you brought from calculus. It is uncountable, the same cardinality as , yet has Lebesgue measure zero. It is closed and compact, yet contains no intervals. It is the prototypical fractal: self-similar at every scale. Every counterexample-driven argument in real analysis eventually traces back to this set or a relative of it, and it is a useful sanity check for any new conjecture: "does it still work for the Cantor set?"
The Cantor set is intrinsically diagrammatic. We describe its self-similar structure in the prose below; an interactive Cantor-construction widget is not currently available in lang-core, so the construction is presented as a static description with explicit numerical checks.
The construction
Start with C_0 = [0, 1]. To get , remove the open middle third:
C_1 = [0, 1/3] \cup [2/3, 1]
To get , remove the open middle third of each of the two intervals in :
C_2 = [0, 1/9] \cup [2/9, 1/3] \cup [2/3, 7/9] \cup [8/9, 1]
Continue indefinitely. The Cantor set is the intersection:
C = \bigcap_{n=0}^\infty C_n
At step , the set consists of closed intervals each of length . The total length of is , which tends to as .
The Cantor set has measure zero
The set is the nested intersection of the , so for every . Sending :
So . Equivalently, the total length removed is
\sum_{n=0}^\infty \frac{2^n}{3^{n+1}} = \frac{1/3}{1 - 2/3} = 1,
which exhausts the original interval [0, 1] exactly.
The Cantor set is uncountable
Every real number x \in [0, 1] has a base-3 expansion with each . The middle-thirds construction removes exactly the numbers whose base-3 expansion is forced to contain a :
- removes numbers starting (where the cannot be avoided by choosing the alternate expansion).
- The next step removes numbers with a in the second digit, and so on.
What remains is precisely the set of numbers in [0, 1] that have a base-3 expansion using only digits and . Map (with ) to where . This is a surjection C \to [0, 1], so |C| \geq |[0, 1]| = |\mathbb{R}|. Since C \subseteq [0, 1], we get . The Cantor set is uncountable.
Topology: closed, compact, perfect, totally disconnected
Each is a finite union of closed intervals, hence closed. The intersection of closed sets is closed, so is closed. Being a closed and bounded subset of , is compact (Heine-Borel). It is perfect, every point of is a limit point, because the endpoints of the intervals at every level are arbitrarily close to other points of . It is totally disconnected: between any two distinct points of lies a removed open interval, so contains no interval. Finally, it is nowhere dense: its closure (itself) has empty interior.
- Fractal geometry: The Cantor set has Hausdorff dimension , a fractional dimension, the prototype example of fractal geometry. It is the simplest non-trivial example used to teach scaling and self-similarity.
- Dynamical systems: The attractor of the doubling map on the circle, restricted to certain invariant subsets, is a Cantor set. Strange attractors in chaotic systems (the Smale horseshoe) are Cantor-set-like in cross-section.
- Probability and percolation: The set of paths in a percolation cluster at criticality, after suitable scaling, often has Cantor-like measure-zero limits. The classical Cantor distribution, uniform on , is the canonical "singular" probability distribution, neither discrete nor absolutely continuous.
- Null events in probability: The event "an infinite coin-flip sequence avoids every other digit in the limit" has Cantor-set probability structure. Measure-zero events that are nonetheless not impossible are how probabilists discuss zero-probability outcomes.
Pause and think: Is the number in the Cantor set? (Hint: write in base . It is . Only digits and appear, so yes.) What does this tell you about the structure of : does it have any nonempty interior as a subset of ? (Equivalently: does it contain any open interval, however small?)
Try it
- Predict first: after step of the Cantor construction, how many disjoint intervals does have, and what is the total length? (Hint: intervals of length .)
- Show that the endpoints of the construction intervals are all in . Is the cardinality of these endpoints countable or uncountable?
- "Fat Cantor set" challenge: modify the construction to remove the middle fraction at step (instead of always the middle third). Show that the resulting set has positive Lebesgue measure but is still closed and nowhere dense. (Hint: compute the geometric sum of removed lengths.)
- True or false: the Cantor set is countable because it consists of "endpoints" of removed intervals. (Hint: this is wrong, the endpoints are countable but they are a strict subset of .)
- Show that the Cantor function (the "devil's staircase"), a continuous, non-decreasing function from [0, 1] to [0, 1] that is constant on every removed interval, has derivative zero almost everywhere yet rises from to . (Sketch only: the derivative is zero on the complement of , which has full measure.)
A trap to watch for
It is tempting to think the Cantor set "is" the endpoints, the points that mark the construction stages. These endpoints form a countable subset of . But itself is uncountable: most of its points are not endpoints, but interior limit points of the infinite nesting. The number is in and never appears as an endpoint at any finite stage. Confusing the set of endpoints with the Cantor set is the single most common source of "the Cantor set is countable" errors in student work.
What you now know
You can construct the Cantor set, prove it has Lebesgue measure zero, prove it is uncountable, and list its core topological properties. More broadly, you understand that "small in measure" and "small in cardinality" are independent notions, the Cantor set is the canonical example. The next section finally builds the Lebesgue integral on top of this measure theory, the modern replacement for the Riemann integral.
Mark section complete →
References
- Garrity, T. (2002). All the Mathematics You Missed: But Need to Know for Graduate School. Cambridge University Press, ch. 12.
- Royden, H. L., Fitzpatrick, P. M. (2010). Real Analysis (4th ed.). Pearson, ch. 2.
- Folland, G. B. (1999). Real Analysis: Modern Techniques and Their Applications (2nd ed.). Wiley, ch. 1.
- Falconer, K. (2014). Fractal Geometry: Mathematical Foundations and Applications (3rd ed.). Wiley, ch. 2.
- Stein, E. M., Shakarchi, R. (2005). Real Analysis. Princeton UP, ch. 1.