In a first course in differential geometry/topology we are often taught the following version of Sard's theorem:
Let $f:\mathbb{R}^m \supset M\to N\subset \mathbb{R}^n$ be a mapping of class $C^k$ (all partial derivatives up to order $k$ exists and are continuous) for $k \geq 1$. Let $S\subset N$ be defined as above. And if either
- $m \leq n$; or
- $m > n$ and $k \geq m - n + 1$,
then $S$ has measure 0.
Over at MathOverflow, Sergei Ivanov has shown that in the case $m = n = 1$, the statement can be modified so that $C^1$ is not necessary.
In this post, we will describe counterexamples in the cases $m > n$ if the condition on $k$ is not satisfied. We will also sketch a natural generalisation of Ivanov's argument for higher dimensions.
Extremely degenerate critical points
The argument given by Ivanov was implicitly contained in Sard's original 1942 paper. It contains the following observation based on Taylor's Theorem:
Now we prove
Ivanov's argument is the special case $m = n = k = 1$. Note that we did not assume that $f$ is $C^k$; we merely assumed $k$ times differentiability (and the vanishing of the top $k$ derivatives at the "critical" points.
Low regularity counterexample
The counterexample to Sard's theorem when $k$ is too small was already constructed by Hassler Whitney 7 years prior1. It is an interesting application of the Whitney extension theorem.
In view of the extension theorem, to construct a counterexample to Sard's theorem in dimensions $m = 2, n = 1, k = 1$ it suffices to construct a closed subset $A$ of $\mathbb{R}^2$ on which $f$ is non-constant but is compatible with having vanishing partial derivatives everywhere. (That this is possible is due to the fact that along non-rectifiable curves, the value of $f$ at two points cannot necessarily be represented by "an integral of the derivative".)
To be more precise, it suffices that we construct a set $A\subset \mathbb{R}^2$ and a function $f:A \to\mathbb{R}$ such that
- $A$ is closed
- $f(A)$ has positive measure
- for $x,x'\in A$ we have that $|f(x) - f(x')| = o(|x - x'|)$
The third condition ensures that the function $f$ is compatible with having vanishing derivatives everywhere along $A$. Note that the Whitney extension of such a function $f$ would necessarily have the set $S$ containing $f(A)$, which has positive measure.
As with often the case, the construction of $A$ and $f$ relies on a self-similar procedure that is familiar from fractal constructions. Indeed, a self-similar construction allows for $|f(x) - f(x')|$ and $|x - x'|$ to scale differently, which is also behind such constructions as the Cantor staircase.
We start by taking the unit square and removing four smaller squares from it, each of the four squares have side length $\alpha$. On the remaining set we draw line segments connecting the midpoints of the squares as shown in red. This gives our construction at step 1. We iterate now inside each of the small squares by taking a copy of the step 1 construction, scaling it by $\alpha$, and pasting it inside so the path connects. We show the procedure for one of the small squares in blue.
The set $A$ will consist of the closure of the union of the red paths with all of its scaled copies. It is clear that this is a closed, connected set that is topologically identical to the closed unit interval. We explicitly parametrise the set $A$ as follows: divide the unit interval into nine equal portions. The odd numbered portions correspond to the red parts in the image above. The even numbered portions correspond to the portions within the small squares. Each of the even numbered portions we divide into nine parts and repeat. In other words, we use a construction similar to the Cantor staircase. Express the numbers in the interval $[0,1]$ in base 9. Let $p:(0,1)\to \mathbb{N}\cup \infty$ be the function that assigns to a number $s$ the position (after the "decimal" point) in base nine of the first appearance of a even "digit". So $p(0.21345\ldots_9) = 1$, and $p(0.1357802\ldots_9) = 5$. Then we can conclude that $x$ belongs to one of the line segments added at the $p(x)$th step of the construction if $p(x)$ is finite. The points where $p(x)$ are infinite are the limit points added during the closure.
To define $f$ on $A$, first we send $f(0) = 0$ and $f(1) = 1$. For $x\in (0,1)$, we consider its base-9 expansion and truncate it after the first even digit. So $0.13572841\ldots_9 \mapsto 0.13572$. Now that the remaining digits are all odd except the last, we replace them by the rule $n \mapsto (n-1)/2$; the last digit we replace by $n\mapsto n/2$. So that $0.13572 \mapsto 0.01231$. This last number we interpret in base-4 (where $0.\cdots \beta 4_4 = 0.\cdots (\beta+1)_4$ by convention), which we regard as the value of $f(x)$. So as a function on $[0,1]$, the construction of $f$ is exactly analogous to the Cantor staircase function, and is uniformly continuous but not absolutely continuous. It is almost everywhere constant and not differentiable at the "limit points" when considered as a function on $[0,1]$. On the other hand, its differentiability property improves when considered as a function on $A$, because of the change of scale built into the construction.
We see immediately on each of the line segments $f$ is constant. Furthermore, a bit more reflection shows that if $\ell$ and $\ell'$ are two line segments of $A$ which touch each other, we have that $f(\ell) = f(\ell')$. This implies that if $x\neq x'$ are such that $f(x)\neq f(x')$, either (a) at least one of $x,x'$ has $p$ that is infinite, or (b) there is some $x_0$ between them with $p(x_0) = \infty$. If $x, x'$ in base-9 expansion differs at the $j$th digit, the difference of their base-9 expansions must be strictly greater than $9^{-j-1}$. Or, in other words, after the $j+1$th stage of the construction of $A$, the two points must be separated by one whole step. This means that the planar distance $|x-x'|$ scales like $\alpha^{j}$. On the other hand the difference $|f(x) - f(x')|$ scales like $4^{-j}$, hence we have that $|f(x) - f(x')| = O(|x - x'|^{\log_\alpha 1/4})$. So if $\alpha$ is chosen to be $\alpha > 1/4$, we would have that $|f(x) - f(x')| = o(|x - x'|)$ as desired. (In Whitney's original paper, $\alpha$ is chosen to be exactly a third.)
Note that the "path length" of the set $A$ is precisely $r\sum_j (4\alpha)^j$ where $r$ is the total length of the red segments in the image above. We see that when $\alpha < \frac14$ the path length is finite. This means that $A$ represents a rectifiable path and hence we can express $f$ as an integral of its derivatives along $A$, and hence the desired property is not possible. (Or, in other words, we cannot "cancel" the singularity in the Cantor staircase with singularities in the embedding [as there are none].) When $\alpha > \frac14$ we see that "path length" diverges strongly. This non-rectifiability is what allows us to finish this construction.
Lastly, as a sanity check, we can make sure that $f$ cannot be extended to a $C^2$ function with first derivative vanishing everywhere along $A$. Were that to be possible we need $|f(x) - f(x')| = o(|x - x'|^2)$. From the construction above we see that this requires $\alpha^2 > 1/4$. But to fit four identical squares into a bigger one we need necessarily $\alpha < 1/2$, and so $C^2$ is not possible, in agreement with Sard's theorem.