Differential Geometry (6): Smooth Manifolds — Geometry Beyond Embedded Surfaces

The first five chapters of this series lived inside $\mathbb{R}^3$ . We had curves and surfaces, parametrized explicitly, with all the geometric data — first and second fundamental forms, principal curvatures, Christoffel symbols, the Theorema Egregium, Gauss-Bonnet — built up from coordinates we could write down. The Theorema Egregium revealed that the intrinsic story can be told without reference to the embedding. But “without reference to the embedding” still meant “the embedding exists; we just choose not to use it.”

This chapter takes the next step: we cut the surface loose from $\mathbb{R}^3$ entirely. Smooth manifolds are spaces that locally look like $\mathbb{R}^n$ but need not be subsets of any larger space. They are the natural setting for general relativity, gauge theory, the topology of moduli spaces, and most of modern geometry. The price is some upfront axiomatics; the payoff is a framework that scales to arbitrary dimensions and arbitrary topologies, handling everything from 4-dimensional spacetime to infinite-dimensional function spaces.

Once you accept the abstract framework, everything from the previous chapters becomes a special case. Curves are 1-manifolds. Surfaces are 2-manifolds (embedded in $\mathbb{R}^3$ or not). The intrinsic apparatus — metric, Christoffel symbols, geodesics, curvature — generalizes seamlessly. The extrinsic apparatus (shape operator, Gauss map, principal curvatures) does not, because there is no “outside” to compare to. This is a feature, not a bug: general relativity needs geometry on 4-dimensional spacetime, which does not sit inside any physical 5-dimensional space.

Charts, Atlases, and the Definition of a Smooth Manifold#

Stereographic projection: covering the circle with two charts

The intuition behind a manifold: a space that, near any point, looks indistinguishable from a patch of $\mathbb{R}^n$ . The surface of the Earth looks flat from ground level — this is the local-Euclidean property. But globally it wraps around, has no edges, and is compact. A manifold is the mathematical framework that captures this: locally Euclidean, globally possibly complicated.

A topological $$n$$ -manifold is a topological space $$M$$ satisfying three conditions: (1) Hausdorff — any two distinct points have disjoint neighborhoods; (2) second-countable — the topology has a countable basis; (3) locally Euclidean of dimension $$n$$ — every point $p \in M$ has an open neighborhood $$U$$ homeomorphic to an open subset of $\mathbb{R}^n$ . A pair $(U, \varphi)$ where $\varphi: U \to V \subseteq \mathbb{R}^n$ is a homeomorphism is called a coordinate chart. The map $\varphi$ assigns $$n$$ real coordinates to each point in $$U$$ .

Charts and atlas making a topological space into a smooth manifold

A single chart rarely covers the whole manifold. The sphere $$S^2$$ is the canonical example: it is compact, but any chart maps to an open subset of $\mathbb{R}^2$ (which cannot be compact), so no single chart suffices. We need multiple charts whose domains cover $$M$$ . Where two charts $(U_\alpha, \varphi_\alpha)$ and $(U_\beta, \varphi_\beta)$ overlap, the transition map $\varphi_\beta \circ \varphi_\alpha^{-1}: \varphi_\alpha(U_\alpha \cap U_\beta) \to \varphi_\beta(U_\alpha \cap U_\beta)$ moves between the two coordinate descriptions of the overlap. This is a map from one open subset of $\mathbb{R}^n$ to another — a familiar object from multivariable calculus — and we can ask about its differentiability.

Transition map between two overlapping charts

A smooth atlas is a collection of charts $\{(U_\alpha, \varphi_\alpha)\}$ covering $$M$$ such that all transition maps are $C^\infty$ (infinitely differentiable). A smooth structure on $$M$$ is a maximal smooth atlas — one that contains every chart smoothly compatible with the given ones. A smooth manifold is a topological manifold equipped with a smooth structure.

The two technical conditions (Hausdorff, second-countable) exclude pathological spaces while keeping all geometrically natural examples. Without Hausdorff: the “line with two origins” (two copies of $\mathbb{R}$ identified everywhere except at $$0$$ ) is locally Euclidean but not a manifold — it has two “versions” of zero that cannot be separated by neighborhoods. Without second-countable: the “long line” (an uncountable well-ordered sequence of unit intervals) is Hausdorff and locally Euclidean but too large to be useful — it cannot embed in any $\mathbb{R}^N$ . Both conditions are necessary for partitions of unity to exist (a technical tool we will need shortly).

This definition captures exactly what is needed for calculus on $$M$$ . A function $f: M \to \mathbb{R}$ is smooth if $f \circ \varphi^{-1}: V \to \mathbb{R}$ is smooth (as an ordinary function on an open subset of $\mathbb{R}^n$ ) for every chart $\varphi$ . The smooth compatibility of transition maps guarantees that this is well-defined: if $f \circ \varphi_\alpha^{-1}$ is smooth and the transition map $\varphi_\beta \circ \varphi_\alpha^{-1}$ is smooth, then $f \circ \varphi_\beta^{-1} = (f \circ \varphi_\alpha^{-1}) \circ (\varphi_\alpha \circ \varphi_\beta^{-1})$ is smooth by composition. No chart is privileged; smoothness is a chart-independent notion.

A concrete grounding: take $$S^2$$ with stereographic projection from the north pole, $\varphi_N(p_1, p_2, p_3) = (p_1/(1-p_3), p_2/(1-p_3))$ , and from the south pole, $\varphi_S(p_1, p_2, p_3) = (p_1/(1+p_3), p_2/(1+p_3))$ . The transition map on the overlap $S^2 \setminus \{N, S\}$ is the inversion $(u, v) \mapsto (u, v)/(u^2 + v^2)$ , smooth on $\mathbb{R}^2 \setminus \{0\}$ . Two charts, one transition map. The sphere is a smooth 2-manifold.

Examples That Ground the Abstraction#

$\mathbb{R}^n$ and its open subsets. The identity map gives a single global chart. Open subsets inherit the smooth structure. Example: $\mathrm{GL}(n, \mathbb{R}) = \{A \in M_n(\mathbb{R}) : \det A \neq 0\}$ is an open subset of $\mathbb{R}^{n^2}$ (since $\det$ is continuous and $\mathrm{GL}(n)$ is the preimage of $\mathbb{R} \setminus \{0\}$ ), hence a smooth manifold of dimension $$n^2$$ . The space of invertible matrices is a manifold — this is the starting point for Lie group theory.

Two-chart atlas for the 2-sphere from stereographic projection

The sphere $$S^n$$ . As above, two stereographic charts suffice. More generally, the $$n$$ -sphere is defined as $\{x \in \mathbb{R}^{n+1} : |x|^2 = 1\}$ , and it gets its smooth structure either from stereographic charts or from the regular value theorem (see below). It is compact, orientable, and simply connected for $n \geq 2$ .

The torus $T^n = \mathbb{R}^n / \mathbb{Z}^n$ . Identify points that differ by integer vectors. Charts are small open cubes in $\mathbb{R}^n$ (smaller than the period), projected to the quotient. Transition maps are translations (trivially smooth). The 2-torus is a compact orientable surface of genus 1 with Euler characteristic 0.

Real projective space $\mathbb{RP}^n$ . The set of lines through the origin in $\mathbb{R}^{n+1}$ , equivalently $$S^n$$ with antipodal points identified. Charts: where $x_i \neq 0$ , use the $$n$$ ratios $$x_j/x_i$$ ( $j \neq i$ ) as coordinates. Transition maps are rational functions (smooth on their domains). $\mathbb{RP}^n$ is compact; it is non-orientable for $$n$$ even and orientable for $$n$$ odd.

Lie groups. A Lie group is a smooth manifold $$G$$ that is simultaneously a group, with smooth multiplication $G \times G \to G$ and smooth inversion $G \to G$ . The prototypes: $\mathrm{GL}(n, \mathbb{R})$ (dimension $$n^2$$ ), $\mathrm{O}(n)$ (dimension $$n(n-1)/2$$ , the orthogonal matrices), $\mathrm{SO}(n)$ (the connected component of $\mathrm{O}(n)$ containing the identity), $\mathrm{U}(n)$ (dimension $$n^2$$ , unitary matrices), $\mathrm{SU}(n)$ (dimension $$n^2 - 1$$ ). The rotation group $\mathrm{SO}(3)$ is a 3-manifold diffeomorphic to $\mathbb{RP}^3$ . The gauge group of the Standard Model of particle physics is $\mathrm{U}(1) \times \mathrm{SU}(2) \times \mathrm{SU}(3)$ , a 12-dimensional manifold.

Configuration and phase spaces. The configuration space of a rigid body in 3D is $\mathbb{R}^3 \times \mathrm{SO}(3)$ (position of center of mass times orientation), a 6-manifold. The phase space of a system of $$n$$ particles is $T^*(\mathbb{R}^{3n}) \cong \mathbb{R}^{6n}$ , a $$6n$$ -dimensional manifold. Classical mechanics lives on these spaces, and Hamiltonian mechanics requires their manifold structure (specifically, a symplectic structure on the phase space).

Grassmannians and moduli spaces. The Grassmannian $\mathrm{Gr}(k, n)$ — the set of all $$k$$ -dimensional subspaces of $\mathbb{R}^n$ — is a smooth manifold of dimension $$k(n-k)$$ . For $$k = 1$$ , it is projective space $\mathbb{RP}^{n-1}$ . The Grassmannian appears in optimization (low-rank matrix approximation), in control theory (as a state space), and in algebraic geometry (as a parameter space for linear subspaces). It is a compact manifold with a rich geometric structure.

Moduli spaces — parameter spaces for geometric objects — are among the most important manifolds in modern mathematics. The moduli space of flat connections on a surface, the moduli space of Riemann surfaces of genus $$g$$ , the moduli space of instantons on a 4-manifold: these are all smooth manifolds (or orbifolds, if symmetry creates singularities), and their geometry encodes deep information about the objects they parametrize. Donaldson’s revolutionary work on 4-manifolds (1983) extracted topological invariants from the geometry of moduli spaces of instantons. Witten’s topological quantum field theories reinterpret these invariants in physical language.

Surfaces as 2-manifolds. Every regular surface $S \subset \mathbb{R}^3$ from the previous chapters is a smooth 2-manifold (the parametrization patches are charts, and their overlap maps are smooth). But now we can also consider abstract 2-manifolds that do not embed in $\mathbb{R}^3$ : the flat torus $\mathbb{R}^2/\mathbb{Z}^2$ with its flat metric (it admits no smooth isometric embedding in $\mathbb{R}^3$ , only a $$C^1$$ one by Nash-Kuiper), or exotic surfaces constructed by gluing polygons with unusual identifications.

One more illuminating example: the Klein bottle. Take a square $$[0,1]^2$$ and identify the top edge with the bottom edge (same direction) and the left edge with the right edge (opposite direction). The resulting space is a compact 2-manifold that is non-orientable — it has no consistent “inside” or “outside.” It cannot be embedded in $\mathbb{R}^3$ without self-intersection, but it is a perfectly well-defined smooth 2-manifold (charts are small patches of the fundamental square, and transition maps at the identified edges include a reflection). Its Euler characteristic is $\chi = 0$ , and by Gauss-Bonnet it admits a flat metric. The Klein bottle exists as an abstract manifold even though we cannot “see” it embedded without crossings in 3-dimensional space. This illustrates the power of the abstract framework: geometric objects that are awkward to embed can be studied with the same tools as spheres and tori, once we accept charts and atlases as the fundamental language.

Tangent Vectors as Derivations#

On a surface in $\mathbb{R}^3$ , a tangent vector at $$p$$ is a velocity vector $\gamma'(0)$ of some curve through $$p$$ — it lives in the ambient $\mathbb{R}^3$ and happens to be tangent to the surface. For abstract manifolds with no ambient space, we need a purely intrinsic definition.

Animation: transitioning between chart coordinates

The insight: a tangent vector at $$p$$ is completely characterized by its action as a directional derivative. Given any smooth function $f: M \to \mathbb{R}$ , a tangent vector $$v$$ at $$p$$ produces a number $v(f) \in \mathbb{R}$ — the rate of change of $$f$$ in the direction $$v$$ . This number is linear in $$f$$ and satisfies the Leibniz (product) rule: $$v(fg) = f(p)v(g) + g(p)v(f)$$ .

We define a tangent vector at $$p$$ to be a linear map $v: C^\infty(M) \to \mathbb{R}$ satisfying the Leibniz rule. Such a map is called a derivation at $$p$$ . The set of all derivations at $$p$$ forms a real vector space, the tangent space $$T_pM$$ .

In a chart $(U, \varphi)$ with coordinates $(x^1, \ldots, x^n)$ , the partial derivative operators $\partial/\partial x^i|_p$ defined by $(\partial/\partial x^i|_p)(f) = \partial(f \circ \varphi^{-1})/\partial x^i|_{\varphi(p)}$ are derivations at $$p$$ . They form a basis for $$T_pM$$ (one can prove this rigorously), so $\dim T_pM = n$ . Every tangent vector $$v$$ has a unique expansion $v = \sum_i v^i \,\partial/\partial x^i|_p$ with real coefficients $$v^i$$ .

Under a change of coordinates $x \mapsto \tilde{x}$ , the components transform by the Jacobian: $\tilde{v}^j = \sum_i (\partial\tilde{x}^j/\partial x^i) v^i$ . This is the classical “contravariant vector” transformation law. The derivation definition packages it intrinsically: you never write down the Jacobian explicitly; it emerges from the chain rule when you compute $$v(f)$$ in different charts.

The equivalence with the “curve velocity” picture: given a smooth curve $\gamma: (-\varepsilon, \varepsilon) \to M$ with $\gamma(0) = p$ , define $v_\gamma(f) = (f \circ \gamma)'(0)$ . This is a derivation at $$p$$ (linearity is clear; Leibniz follows from the product rule for real functions). Two curves give the same derivation iff they have the same coordinate velocity in any chart. So tangent vectors are equivalence classes of curves, or derivations, or $$n$$ -tuples-that-transform-by-the-Jacobian — three equivalent viewpoints, used interchangeably depending on context.

Why the derivation approach wins for abstract manifolds: it requires nothing beyond the smooth structure. No ambient space, no curves “in $\mathbb{R}^3$ ”, no embeddings. Just smooth functions and their directional derivatives. The entire tangent space is built from the algebra $C^\infty(M)$ alone.

A worked example to make this concrete. On $$S^2$$ with stereographic coordinates $$(u, v)$$ , consider the derivation $v = 3\,\partial/\partial u|_p + 2\,\partial/\partial v|_p$ at the point $p = \varphi_N^{-1}(1, 0)$ (some specific point on the sphere). For a function $$f(p_1, p_2, p_3) = p_1^2 + p_2^2$$ (the squared distance from the $$z$$ -axis), we have $f \circ \varphi_N^{-1}(u,v) = 4(u^2 + v^2)/(1 + u^2 + v^2)^2$ . Computing $v(f) = 3\,\partial_u(f \circ \varphi_N^{-1})|_{(1,0)} + 2\,\partial_v(f \circ \varphi_N^{-1})|_{(1,0)}$ gives a specific real number — the directional derivative of $$f$$ in the direction $$v$$ at $$p$$ . The answer is intrinsic to the sphere: it does not depend on how the sphere sits in $\mathbb{R}^3$ , only on the smooth structure.

The beauty of this formalism is that it works identically for a 100-dimensional manifold, where there is no hope of geometric visualization. You pick a chart, write down coordinate partial derivatives, express your tangent vector as a linear combination, and compute. The abstract definition guarantees that the result is meaningful (independent of chart) without requiring any ambient space to “live in.”

Smooth Maps, the Differential, and Submanifolds#

If $F: M \to N$ is a smooth map between manifolds, the differential (or pushforward) of $$F$$ at $$p$$ is the linear map $dF_p: T_pM \to T_{F(p)}N$ defined by $(dF_p(v))(g) = v(g \circ F)$ for all $g \in C^\infty(N)$ . In coordinates: $$dF_p$$ is represented by the Jacobian matrix $(\partial F^j/\partial x^i)$ , exactly as in multivariable calculus.

The differential captures how $$F$$ acts “infinitesimally”: tangent vectors at $$p$$ (infinitesimal displacements) map linearly to tangent vectors at $$F(p)$$ . It is the manifold analog of the total derivative. The chain rule holds: $d(G \circ F)_p = dG_{F(p)} \circ dF_p$ . The differential of the identity is the identity on tangent spaces.

Three important classes of smooth maps, defined by the rank of $$dF_p$$ :

Immersion: $$dF_p$$ injective at every $$p$$ . The map does not collapse any tangent directions. Example: a figure-eight curve in the plane is an immersion of $$S^1$$ (but not an embedding, since it self-intersects). A surface patch $\mathbf{x}: U \subset \mathbb{R}^2 \to \mathbb{R}^3$ is an immersion when $\mathbf{x}_u \times \mathbf{x}_v \neq 0$ .
Submersion: $$dF_p$$ surjective at every $$p$$ . The map “hits all directions” in the target. Example: the height function $$f(x,y,z) = z$$ on $\mathbb{R}^3$ is a submersion (its derivative $$(0,0,1)$$ is always surjective onto $\mathbb{R}$ ).
Diffeomorphism: $$F$$ is a bijection with both $$F$$ and $F^{-1}$ smooth. Then $$dF_p$$ is an isomorphism at every point. Diffeomorphic manifolds are “the same” smooth manifold.

The regular value theorem (a consequence of the implicit function theorem, one of the workhorses of differential topology): if $c \in N$ is a regular value of $F: M \to N$ (meaning $$dF_p$$ is surjective for every $p \in F^{-1}(c)$ ), then $F^{-1}(c)$ is a smooth submanifold of $$M$$ with dimension $\dim M - \dim N$ .

This is how most manifolds arise in practice. The sphere: $S^n = f^{-1}(1)$ where $f: \mathbb{R}^{n+1} \to \mathbb{R}$ , $$f(x) = |x|^2$$ . Since $df_x = 2x^T \neq 0$ when $x \neq 0$ , and all points on $$S^n$$ have $|x| = 1 \neq 0$ , the value 1 is regular, so $$S^n$$ is a smooth manifold of dimension $$n$$ . The orthogonal group: $\mathrm{O}(n) = F^{-1}(I)$ where $F(A) = A^TA: M_n(\mathbb{R}) \to \mathrm{Sym}_n(\mathbb{R})$ . One checks that $$I$$ is a regular value, giving $\dim \mathrm{O}(n) = n^2 - n(n+1)/2 = n(n-1)/2$ . The special linear group: $\mathrm{SL}(n, \mathbb{R}) = \det^{-1}(1)$ , which has dimension $$n^2 - 1$$ (the determinant is a submersion at every invertible matrix). All these arise “for free” from the regular value theorem — no explicit charts needed.

Sard’s theorem adds: the set of critical values (where $$dF$$ fails to be surjective) has Lebesgue measure zero in $$N$$ . So “almost every” value is regular, and level sets are “generically” smooth submanifolds. This is the hidden engine of differential topology: transverse behavior is generic; pathologies require special arrangement and are negligible in measure.

The Tangent Bundle, Vector Fields, and Flows#

Assemble all tangent spaces into one object: the tangent bundle $TM = \bigsqcup_{p \in M} T_pM$ . A point of $$TM$$ is a pair $$(p, v)$$ with $p \in M$ and $v \in T_pM$ . The tangent bundle is itself a smooth manifold of dimension $$2n$$ : in a chart $(U, \varphi = (x^1, \ldots, x^n))$ on $$M$$ , coordinates on $$TU$$ are $(x^1, \ldots, x^n, v^1, \ldots, v^n)$ where $v = \sum v^i\partial/\partial x^i$ .

The projection $\pi: TM \to M$ , $\pi(p, v) = p$ , makes $$TM$$ a vector bundle of rank $$n$$ — the fibers $\pi^{-1}(p) = T_pM$ are vector spaces varying smoothly with $$p$$ .

A vector field on $$M$$ is a smooth section of $\pi$ : a smooth map $X: M \to TM$ with $\pi \circ X = \mathrm{id}_M$ . It assigns a tangent vector $X(p) \in T_pM$ to each point $$p$$ , varying smoothly. In coordinates: $X = \sum_i X^i(x)\,\partial/\partial x^i$ where the $$X^i$$ are smooth functions.

Vector fields generate flows. The integral curve of $$X$$ through $$p$$ is the solution $\gamma(t)$ to the ODE $\gamma'(t) = X(\gamma(t))$ with $\gamma(0) = p$ . By existence and uniqueness of ODEs, such a curve exists (at least locally). The flow $\phi_t: M \to M$ sends each point to its time- $$t$$ position along the integral curve. For small $$t$$ , $\phi_t$ is a diffeomorphism, and $\phi_{t+s} = \phi_t \circ \phi_s$ (it is a one-parameter group of diffeomorphisms). Vector fields are the infinitesimal generators of symmetries.

The Lie bracket $$[X, Y]$$ of two vector fields measures the failure of their flows to commute: $$[X, Y](f) = X(Y(f)) - Y(X(f))$$ . In coordinates: $[X, Y]^k = \sum_i(X^i\partial_i Y^k - Y^i\partial_i X^k)$ . The set of all smooth vector fields $\mathfrak{X}(M)$ with the Lie bracket is an infinite-dimensional Lie algebra — the algebraic structure encoding “infinitesimal transformations” of the manifold.

For Lie groups, the left-invariant vector fields (invariant under left multiplication) form a finite-dimensional Lie subalgebra $\mathfrak{g}$ , the Lie algebra of $$G$$ . This linearizes the group structure: the nonlinear multiplication law of $$G$$ is captured, to first order, by the linear Lie bracket on $\mathfrak{g}$ . The Lie algebra of $\mathrm{SO}(3)$ is the space of $3 \times 3$ skew-symmetric matrices with the commutator bracket — this is the algebra of angular velocities in classical mechanics.

A concrete example to ground the abstraction: consider the vector field $X = -y\,\partial/\partial x + x\,\partial/\partial y$ on $\mathbb{R}^2$ . Its integral curves solve $\dot x = -y$ , $\dot y = x$ , which gives circles: $\gamma(t) = (r\cos(t + \theta_0), r\sin(t + \theta_0))$ . The flow is $\phi_t(x,y) = (x\cos t - y\sin t, x\sin t + y\cos t)$ — rotation by angle $$t$$ . So the vector field $$X$$ generates the one-parameter group of rotations of the plane. This is the prototypical example of “vector field as infinitesimal symmetry.”

On $$S^2$$ , consider the vector fields generating rotations about the three coordinate axes: $$X_1$$ , $$X_2$$ , $$X_3$$ . Their Lie brackets satisfy $$[X_1, X_2] = X_3$$ , $$[X_2, X_3] = X_1$$ , $$[X_3, X_1] = X_2$$ — the commutation relations of $\mathfrak{so}(3)$ . The flows of these vector fields are the rotations of the sphere about the respective axes. The non-commutativity of the Lie bracket ( $[X_1, X_2] \neq 0$ ) reflects the non-commutativity of rotations: rotating first about $$x$$ then about $$y$$ gives a different result than rotating first about $$y$$ then about $$x$$ . The Lie bracket captures this non-commutativity at the infinitesimal level.

Partitions of Unity and Global Constructions#

A piece of abstract technology that makes the manifold framework operational: partitions of unity.

Given an open cover $\{U_\alpha\}$ of $$M$$ , a partition of unity subordinate to this cover is a collection of smooth functions $\rho_\alpha: M \to [0, 1]$ such that: the support of $\rho_\alpha$ lies in $U_\alpha$ ; the collection is locally finite (each point has a neighborhood meeting only finitely many supports); and $\sum_\alpha \rho_\alpha(p) = 1$ for all $p \in M$ .

Theorem. On any smooth manifold (Hausdorff, second-countable), partitions of unity exist for any open cover.

Why this matters: partitions of unity are the glue that converts local constructions into global ones. The most important application:

Every smooth manifold admits a Riemannian metric. Proof: in each chart $U_\alpha$ , define the flat metric $g_\alpha = \delta_{ij}\,dx^i\,dx^j$ (the standard Euclidean inner product on tangent vectors in those coordinates). Use a partition of unity to combine: $g = \sum_\alpha \rho_\alpha \,g_\alpha$ . This is a smooth, symmetric, positive-definite bilinear form on each tangent space (positive-definiteness survives convex combination with non-negative weights summing to 1). So $$g$$ is a Riemannian metric on all of $$M$$ .

This means the entire apparatus of chapters 2-5 — distances, geodesics, Christoffel symbols, curvature, Gauss-Bonnet — is available on any smooth manifold, once we choose a metric. The manifold provides the smooth structure; the metric provides the geometry. The two are logically independent: the same manifold admits many different metrics, and the geometry changes with the choice.

Other applications of partitions of unity: constructing smooth bump functions (1 on a closed set, 0 outside a neighborhood), extending smooth functions from submanifolds to the whole manifold, defining integration on non-compact manifolds, and proving that every manifold admits a countable cover by precompact chart domains.

A related existence result: the Nash embedding theorem (1956). While every manifold admits some smooth embedding in a Euclidean space (Whitney), Nash proved something much harder: every Riemannian manifold (manifold with a chosen metric) admits an isometric embedding — one that preserves the metric — in a sufficiently high-dimensional Euclidean space. The required dimension is much larger than Whitney’s $$2n+1$$ (Nash needs roughly $$n^2$$ dimensions for smooth isometric embedding). Nash’s theorem is a technical tour de force, proved using a hard implicit function theorem, and it shows that abstract Riemannian manifolds are always realizable as “surfaces” in high-dimensional flat space. But the embedding dimension is so large that it is rarely useful in practice — the abstract framework remains more natural for doing geometry.

The cotangent bundle $T^*M = \bigsqcup_p T_p^*M$ (dual to the tangent bundle) will also play a major role. A point of $$T^*M$$ is a pair $(p, \omega)$ where $\omega: T_pM \to \mathbb{R}$ is a linear functional. Sections of $$T^*M$$ are one-forms — the dual objects to vector fields. In coordinates, a one-form looks like $\omega = \sum_i \omega_i(x)\,dx^i$ , where the $$dx^i$$ are dual to $\partial/\partial x^i$ . The cotangent bundle carries a natural symplectic structure (chapter 9+), and it is the natural phase space of Hamiltonian mechanics: positions live in $$M$$ , momenta live in $$T^*_pM$$ .

The Whitney Embedding Theorem and the Philosophy of Abstraction#

A reassuring foundational result: the abstract framework does not produce spaces that are “too exotic” to visualize.

Theorem (Whitney, 1936). Every smooth $$n$$ -manifold can be smoothly embedded in $\mathbb{R}^{2n+1}$ (and immersed in $\mathbb{R}^{2n}$ ).

So every abstract manifold can be realized as a subset of a Euclidean space. But the embedding is not part of the structure — it is a convenience, not a necessity. Different embeddings give different extrinsic geometry (different normal vectors, different shape operators) but the same intrinsic geometry. The Theorema Egregium already taught us this lesson for surfaces; the manifold framework makes it systematic.

For the classification of manifolds by dimension:

Dimension 1: Every connected 1-manifold is diffeomorphic to $\mathbb{R}$ (non-compact) or $$S^1$$ (compact). Simple.
Dimension 2: Closed orientable surfaces are classified by genus. Closed non-orientable surfaces by a different invariant. Complete, classical.
Dimension 3: Perelman’s proof (2003) of Thurston’s geometrization conjecture gives a complete classification via 8 model geometries. Every closed 3-manifold decomposes into pieces, each carrying one of eight types of geometric structure.
Dimension 4: Rich and mysterious. Exotic smooth structures on $\mathbb{R}^4$ exist (Donaldson, Freedman, 1982-83) — uncountably many! The smooth 4-dimensional Poincare conjecture remains open. Dimension 4 is the “hardest” dimension.
Dimensions $\geq 5$ : Paradoxically more tractable. Surgery theory and the h-cobordism theorem (Smale, 1961) give systematic classification tools. The high-dimensional Poincare conjecture is proved.

The dimension-4 mystery deserves a word. In every other dimension, $\mathbb{R}^n$ admits a unique smooth structure (up to diffeomorphism). But $\mathbb{R}^4$ admits uncountably many distinct smooth structures — “exotic $\mathbb{R}^4$ ’s” that are homeomorphic to standard $\mathbb{R}^4$ but not diffeomorphic to it. This means there exist coordinate changes on $\mathbb{R}^4$ that are continuous but cannot be made smooth — a phenomenon that does not occur in any other dimension. The proof uses gauge theory (Donaldson’s work on instantons) and Freedman’s topological classification of 4-manifolds. Dimension 4 is the unique dimension where the topology is rich enough to support exotic smooth structures and the dimension is low enough that surgery theory does not apply to simplify things.

The philosophical upshot: the smooth manifold concept is the correct framework for geometry beyond embedded surfaces. It handles arbitrary dimensions, arbitrary topologies, and arbitrary applications (physics, data science, optimization on constraint surfaces) uniformly. The upfront investment in axiomatics pays off permanently.

A closing thought on the relationship between the abstract and the concrete. Every computation on a manifold ultimately happens in a chart — in coordinates, the calculation looks like ordinary multivariable calculus. The manifold framework does not eliminate coordinate calculations; it provides a clean answer to the question “what do these calculations mean?” When you compute Christoffel symbols on a sphere using spherical coordinates, you are doing a chart-dependent calculation. The manifold tells you: the result has geometric meaning provided it transforms correctly under change of charts. The smooth structure is exactly what guarantees this transformation behavior. After enough practice, the formalism becomes invisible — you think of the geometry directly, and the charts are a computational convenience rather than a conceptual crutch.

A concrete extended example to cement this: consider the function $f: S^2 \to \mathbb{R}$ given by $$f(p) = p_3$$ (the height function — the $$z$$ -coordinate of the point). In the stereographic chart from the north pole, $f \circ \varphi_N^{-1}(u,v) = (u^2 + v^2 - 1)/(u^2 + v^2 + 1)$ . The partial derivatives are $\partial_u f = 4u/(u^2 + v^2 + 1)^2$ and $\partial_v f = 4v/(u^2 + v^2 + 1)^2$ . At the origin (corresponding to the south pole, where $$f = -1$$ ), both partials vanish: the south pole is a critical point of $$f$$ (specifically, a minimum). At $u^2 + v^2 \to \infty$ (approaching the north pole), $f \to 1$ : the north pole is a maximum. Between them, there are no other critical points — $$f$$ is a Morse function with exactly two critical points. This is the minimum possible for a function on $$S^2$$ , forced by the Euler characteristic: $\chi(S^2) = 2 = (\text{number of maxima}) - (\text{number of saddles}) + (\text{number of minima}) = 1 - 0 + 1$ .

This connects the smooth manifold framework back to the Gauss-Bonnet theorem of the previous chapter: the Euler characteristic, computed there as $\int K\,dA / (2\pi)$ , also constrains the critical-point structure of functions on the manifold (Morse theory). The same topological invariant governs both curvature and function behavior — the two are reflections of the same underlying topology.

Deeper Examples and Common Pitfalls#

The earlier sections defined manifolds via charts and atlases, tangent vectors as derivations, smooth maps and submanifolds, and the Whitney embedding theorem. This section computes specific examples in detail, points out where beginners stumble, and shows where the abstract definitions pay off.

A worked numerical example: charts on the 2-sphere#

\varphi_N(x, y, z) = (x, y) / (1 - z), \quad \varphi_S(x, y, z) = (x, y) / (1 + z).

\varphi_S \circ \varphi_N^{-1}(u, v) = \frac{(2u/(u^2+v^2+1), 2v/(u^2+v^2+1))}{1 + (u^2+v^2-1)/(u^2+v^2+1)} = \frac{(2u, 2v)}{(u^2+v^2+1) + (u^2+v^2-1)} = \frac{(u, v)}{u^2+v^2}.

This is the inversion in the unit circle. It is smooth (in fact, real-analytic) on its domain $\mathbb{R}^2 \setminus \{0\}$ , which is exactly where both charts overlap (everything except the two poles). So the two charts form a smooth atlas on $$S^2$$ . Verify a tangent computation: at the equator point $$(1, 0, 0)$$ , the chart $\varphi_N$ assigns coordinates $$(1, 0)$$ . A tangent vector at this point along the direction of the $$y$$ -axis (i.e., $\partial/\partial y|_{(1,0,0)}$ as an embedded vector in $\mathbb{R}^3$ ) corresponds in the chart to a vector $\partial / \partial v|_{(1, 0)}$ — you can check by differentiating the inverse stereographic formula.

A worked numerical example: tangent vectors as derivations on $$S^2$$ #

Take the smooth function $$f(x, y, z) = z$$ on $$S^2$$ . In the north-pole chart $\varphi_N$ , this becomes $f \circ \varphi_N^{-1}(u, v) = (u^2 + v^2 - 1)/(u^2 + v^2 + 1)$ . At the point $\varphi_N(0, 0, 1) = (0, 0)$ — wait, the north pole is excluded from $\varphi_N$ . Take instead the point $$(0, 0, 0)$$ on the equator, $\varphi_N(0, 0, 0) = (0, 0)$ . Wait, also wrong: $\varphi_N(0, 0, 0) = (0, 0)/(1 - 0) = (0, 0)$ . So the equator point $$(0, 0, 0)$$ would project to the origin? Only if it’s actually on the equator at $\theta = 0$ ; let me redo: $\varphi_N(1, 0, 0) = (1, 0)$ . So at $$(1, 0, 0)$$ , $\varphi_N$ has chart coordinates $$(1, 0)$$ .

\frac{2v(u^2 + v^2 + 1) - (u^2 + v^2 - 1) \cdot 2v}{(u^2 + v^2 + 1)^2} \bigg|_{(1, 0)} = \frac{0 - 0}{4} = 0.

So $$X(z) = 0$$ at this point: moving in the chart- $$v$$ direction at $$(1, 0, 0)$$ does not change the $$z$$ -coordinate to first order. Geometrically, the chart- $$v$$ direction at this point is tangent to the equator (constant $$z$$ ), confirming the calculation.

Counterexample: a non-Hausdorff “manifold”#

Take two copies of the real line, $\mathbb{R}_1$ and $\mathbb{R}_2$ , and glue them by identifying $x_1 \sim x_2$ for all $x \neq 0$ . The result is the line with two origins. Each origin has a neighborhood homeomorphic to $\mathbb{R}$ , so locally Euclidean. The space is second-countable. But it is not Hausdorff: the two origins cannot be separated by disjoint open sets (any open neighborhood of either origin contains an interval $(-\varepsilon, 0) \cup (0, \varepsilon)$ , and the two origins both sit “above” this). This space fails to be a manifold because the Hausdorff condition is part of the definition. The pathology: a sequence converging to 0 from below has both origins as limits. Uniqueness of limits fails, the integration of ODEs becomes ill-posed (a flow line through the identified part has two possible continuations), and the entire smooth-manifold machinery breaks. The Hausdorff axiom is exactly the technical hypothesis that rules out this multiplicity.

A second counterexample: when stereographic glue fails#

Try to build $$S^2$$ with a single chart from stereographic projection from the north pole alone. The chart $\varphi_N$ misses exactly one point — the north pole itself. As an open set, $S^2 \setminus \{N\}$ is homeomorphic to $\mathbb{R}^2$ , so this single chart works topologically but does not cover $$S^2$$ . Adding a second chart from the south pole solves it. The general principle: a manifold may need many charts; the question is only whether the transition maps between them are smooth on overlaps. A “manifold structure” is precisely an equivalence class of atlases under the relation “smoothly compatible.” Two atlases give the same manifold iff their union is still a smooth atlas. This equivalence is the level of abstraction one above the atlas.

A subtle counterexample showing this is a real distinction: $\mathbb{R}^4$ admits uncountably many inequivalent smooth structures (Donaldson, Freedman, 1982). The topological space is the same, but the smooth atlases are inequivalent: there exist homeomorphisms between $(\mathbb{R}^4, \mathcal{A}_1)$ and $(\mathbb{R}^4, \mathcal{A}_2)$ but no diffeomorphism. This is unique to dimension 4 — every other Euclidean space has a unique smooth structure. The phenomenon is invisible in basic differential geometry but reveals that “smoothness” is much more delicate than it looks.

A worked numerical example: derivative of a smooth map between manifolds#

f(\varphi_N^{-1}(u, v)) = \frac{(2u, 2v, -(u^2+v^2-1))}{u^2+v^2+1}.

\varphi_N \circ f \circ \varphi_N^{-1}(u, v) = \frac{(2u, 2v)/(u^2+v^2+1)}{2(u^2+v^2)/(u^2+v^2+1)} = \frac{(u, v)}{u^2+v^2}.

So in the chart, reflection-across-equator becomes inversion-in-unit-circle — exactly the same transition map we computed earlier. This is not a coincidence: reflection across the equator equals stereographic projection from the south pole composed with inverse stereographic projection from the north, which is exactly the transition function. The differential at $$(u, v) = (1, 0)$$ (which corresponds to the equator point $$(1, 0, 0)$$ ): differentiate $$g(u, v) = (u, v)/(u^2+v^2)$$ . At $$(1, 0)$$ : $g_u = ((u^2+v^2) - u \cdot 2u, -v \cdot 2u)/(u^2+v^2)^2 = ((1 - 2), 0)/1 = (-1, 0)$ . $g_v = (-u \cdot 2v, (u^2+v^2) - v \cdot 2v)/(u^2+v^2)^2 = (0, 1)/1 = (0, 1)$ . So the Jacobian at $$(1, 0)$$ is $\begin{pmatrix} -1 & 0 \\ 0 & 1 \end{pmatrix}$ — a reflection in the chart, matching the geometric expectation.

Common pitfall for beginners#

Beginners often think of tangent vectors as “arrows attached to the manifold.” On an embedded surface in $\mathbb{R}^3$ , this is fine. But on an abstract manifold, there is no ambient space, so where does the arrow live? The resolution — tangent vectors are derivations of smooth functions — feels alien because it abandons the picture entirely.

Why do it? Because derivations are intrinsic. They depend only on the smooth structure of $$M$$ , not on any embedding. The arrow picture is a consequence (via Whitney embedding) of the derivation picture, not the other way around. Skipping straight to derivations buys you generality: the same definitions work on infinite-dimensional manifolds (Banach manifolds), on supermanifolds, on schemes — places where the arrow picture has no analog.

A second pitfall: confusing smoothness of a map $f: M \to N$ with smoothness in coordinates. The right definition is “for every chart $\varphi$ on $$M$$ and every chart $\psi$ on $$N$$ , $\psi \circ f \circ \varphi^{-1}$ is smooth where defined.” This is the only sensible coordinate-free definition. Beginners write down a single formula in coordinates and conclude smoothness, but the formula needs to make sense in every chart. The transition functions of the atlas are exactly the bookkeeping that converts between charts.

Where this matters in physics, computing, and engineering#

In general relativity, spacetime is a 4-manifold without a preferred embedding. The tangent space at each event is a 4-dimensional vector space carrying the metric. Without the abstract-manifold framework, it would be impossible to formulate “spacetime curvature” without specifying an embedding into a higher-dimensional ambient space — and physically, no such ambient space is observable. The abstract framework is not optional; it is the only way to do GR.

In robotics and control theory, the configuration space of a multi-joint robot is a product of circles and spheres — a torus or a more general manifold. There is no natural embedding into a Euclidean space; using one (e.g., embedding $$S^1$$ into $\mathbb{R}^2$ ) introduces redundant coordinates and singularities (gimbal lock in Euler angles). Modern motion-planning libraries (OMPL, MoveIt) work intrinsically on the manifold, treating each chart’s coordinates as local approximations and stitching them together via smooth atlases.

In deep learning, the parameter space of a neural network with weight-sharing constraints (like CNNs) is naturally a manifold. The gradient descent algorithm secretly assumes a Euclidean structure, but tools like Riemannian SGD (used in geometric deep learning) work intrinsically on the parameter manifold. The Christoffel symbols of the natural metric become correction terms in the update rule.

Revisiting “what’s next” with sharper questions#

Article 7 will introduce vector fields, integral curves, flows, and Lie brackets. To prepare:

(1) A vector field on $$M$$ assigns a tangent vector to each point. On $\mathbb{R}^n$ this is just a function $\mathbb{R}^n \to \mathbb{R}^n$ . On a manifold without an embedding, what is the right definition? (2) The flow of a vector field is the family of diffeomorphisms generated by integrating the field. On a curved manifold, two vector fields’ flows generally do not commute. The Lie bracket measures the failure to commute. Why should there be a finite-dimensional algebraic gadget that captures this? (3) The Frobenius theorem says that a distribution (an assignment of subspaces of tangent spaces) is integrable if and only if it is closed under brackets. This is one of the deepest theorems in basic differential geometry. Why should integrability of a geometric distribution reduce to an algebraic identity?

You now have the abstract framework. Article 7 puts dynamics on it. Read it asking “what is the geometric meaning of $$[X, Y] = 0$$ , and how does it generalize from flat space to curved manifolds?”

One last worked example: the projective plane $\mathbb{RP}^2$ as a quotient manifold#

The real projective plane $\mathbb{RP}^2$ is the quotient of $$S^2$$ by the antipodal map. It is a 2-manifold, but non-orientable. As a smooth manifold, it can be given charts by taking the upper hemisphere of $$S^2$$ and identifying antipodal points on the equator.

Construct an atlas: three charts indexed by $$i = 1, 2, 3$$ , where the chart $U_i = \{[x_1 : x_2 : x_3] : x_i \neq 0\}$ uses coordinates $$(x_j/x_i, x_k/x_i)$$ for $j, k \neq i$ . Transition map between $$U_1$$ and $$U_2$$ : a point with coordinates $$(a, b) = (x_2/x_1, x_3/x_1)$$ in $$U_1$$ has $$U_2$$ -coordinates $$(x_1/x_2, x_3/x_2) = (1/a, b/a)$$ . So $\varphi_2 \circ \varphi_1^{-1}(a, b) = (1/a, b/a)$ . Smooth where $a \neq 0$ — exactly the overlap.

This gives a complete smooth atlas. Compute the Jacobian of the transition: $\partial(1/a, b/a)/\partial(a, b) = \begin{pmatrix} -1/a^2 & 0 \\ -b/a^2 & 1/a \end{pmatrix}$ , determinant $$-1/a^3$$ . The sign of the determinant changes with the sign of $$a$$ , which reflects $\mathbb{RP}^2$ ’s non-orientability: there is no consistent orientation across all charts. By contrast, on $$S^2$$ (the orientable cover), all transitions can be chosen with positive Jacobian. The structure-group reduction obstruction is exactly the Stiefel-Whitney class $$w_1$$ , which is nonzero on $\mathbb{RP}^2$ and zero on $$S^2$$ .

What’s next#

The next chapter introduces vector fields and flows — the dynamical side of manifold theory. Vector fields generate one-parameter groups of diffeomorphisms, and their algebraic structure (the Lie bracket) encodes infinitesimal symmetries. After that: differential forms, integration and Stokes’ theorem, Riemannian metrics (recovering geodesics and curvature in the abstract setting), and connections on bundles.

Differential Geometry (6): Smooth Manifolds — Geometry Beyond Embedded Surfaces

Charts, Atlases, and the Definition of a Smooth Manifold#

Examples That Ground the Abstraction#

Tangent Vectors as Derivations#

Smooth Maps, the Differential, and Submanifolds#

The Tangent Bundle, Vector Fields, and Flows#

Partitions of Unity and Global Constructions#

The Whitney Embedding Theorem and the Philosophy of Abstraction#

Deeper Examples and Common Pitfalls#

A worked numerical example: charts on the 2-sphere#

A worked numerical example: tangent vectors as derivations on $$S^2$$ #

Counterexample: a non-Hausdorff “manifold”#

A second counterexample: when stereographic glue fails#

A worked numerical example: derivative of a smooth map between manifolds#

Common pitfall for beginners#

Where this matters in physics, computing, and engineering#

Revisiting “what’s next” with sharper questions#

One last worked example: the projective plane $\mathbb{RP}^2$ as a quotient manifold#

What’s next#

Differential Geometry 12 parts

Liked this piece?

Charts, Atlases, and the Definition of a Smooth Manifold#

Examples That Ground the Abstraction#

Tangent Vectors as Derivations#

Smooth Maps, the Differential, and Submanifolds#

The Tangent Bundle, Vector Fields, and Flows#

Partitions of Unity and Global Constructions#

The Whitney Embedding Theorem and the Philosophy of Abstraction#

Deeper Examples and Common Pitfalls#

A worked numerical example: charts on the 2-sphere#

A worked numerical example: tangent vectors as derivations on $S^2$ #

Counterexample: a non-Hausdorff “manifold”#

A second counterexample: when stereographic glue fails#

A worked numerical example: derivative of a smooth map between manifolds#

Common pitfall for beginners#

Where this matters in physics, computing, and engineering#

Revisiting “what’s next” with sharper questions#

One last worked example: the projective plane $\mathbb{RP}^2$ as a quotient manifold#

What’s next#

Differential Geometry 12 parts

Liked this piece?

Read next

Kernel Methods (8): Deep Kernel Learning vs Deep Learning — A Practitioner's Guide

Kernel Methods (7): Large-Scale Kernels — Nystrom Approximation and Random Fourier Features

Kernel Methods (6): Gaussian Processes — When Kernels Meet Bayesian Inference

A worked numerical example: tangent vectors as derivations on $$S^2$$ #