Essence of Linear Algebra (5): Linear Systems and Column Space

The Central Question#

Almost everything in applied mathematics eventually lands on the same question:

Given a matrix $$A$$ and a vector $\vec{b}$ , does the equation $A\vec{x} = \vec{b}$ have a solution? If so, how many?

The mechanical answer is “row-reduce and look.” The structural answer is far more interesting — and it is the goal of this chapter. Three geometric objects tell you everything:

Column space $$C(A)$$ — the set of vectors $$A$$ can reach. It decides whether a solution exists.
Null space $$N(A)$$ — the set of vectors $$A$$ crushes to zero. It decides how many solutions exist.
Rank $$r$$ — the dimension of the column space. It quantifies how much information $$A$$ preserves.

Once these three are clear, every linear-systems result — existence, uniqueness, least squares, the four fundamental subspaces — becomes the same story told from different angles.

What You Will Learn#

Two complementary perspectives on $A\vec{x}=\vec{b}$ : rows (intersecting hyperplanes) vs. columns (linear combinations)
Gaussian elimination as the operational tool, and as the LU decomposition in disguise
Column space, null space, and rank, with their geometric meaning
The rank-nullity theorem and the four fundamental subspaces
How to read off the structure of any solution set at a glance

Prerequisites#

Chapter 2 : span, linear independence, basis
Chapter 3 : matrices as linear transformations
Chapter 4 : determinants and invertibility

Two Ways to See $A\vec{x} = \vec{b}$ #

Row Perspective: Intersecting Hyperplanes#

\begin{cases} x + 2y = 5 \\ 3x - y = 1 \end{cases}

Each equation describes a line in the plane. A solution is a point that lies on both lines simultaneously — their intersection $$(1, 2)$$ . In three variables, each equation describes a plane and the solution set is the intersection of those planes (a point, a line, a plane, or nothing).

This is the picture most students meet first. It is geometric and concrete, but it hides what really matters: the structure of the matrix itself.

Column Perspective: Combining Vectors#

x \begin{pmatrix} 1 \\ 3 \end{pmatrix} + y \begin{pmatrix} 2 \\ -1 \end{pmatrix} = \begin{pmatrix} 5 \\ 1 \end{pmatrix}

Now the question becomes: can we mix the columns of $$A$$ to produce $\vec{b}$ ? Solving the system is choosing the right amounts of each column.

This single shift in viewpoint is the most important idea in the whole chapter. From it, the column space, rank, and existence of solutions fall out for free.

Ax = b geometrically: solvability lives in the column space

The figure above shows both sides of the story. On the left, the columns $\vec{a}_1, \vec{a}_2$ span the whole plane, so any target $\vec{b}$ can be assembled from them: pick the right scalars and the parallelogram closes onto $\vec{b}$ . On the right, the two columns happen to be parallel, so the column space collapses to a single line. A target sitting off that line is unreachable; the best we can do is project it onto the line — the geometry behind least squares.

Painter analogy. You stand in front of an empty canvas with three tubes of paint (the columns of $$A$$ ). The column space is the set of every color you can produce by mixing. If two tubes are the same shade, you have not gained any new color; your reachable palette is smaller than it looks. That smaller palette is exactly the column space of a rank-deficient matrix.

Gaussian Elimination: The Operational Tool#

The Three Legal Moves#

Elimination simplifies a system without changing its solution set, using only three elementary row operations:

Swap two rows.
Multiply a row by a non-zero constant.
Add a multiple of one row to another.

Why are these legal? Because each one is invertible: any sequence of operations can be undone, so the set of solutions before and after is identical.

A Worked Example#

\begin{cases} x + 2y + z = 2 \\ 3x + 8y + z = 12 \\ 4y + z = 2 \end{cases}

Write the augmented matrix and eliminate downward, one pivot at a time.

Gaussian elimination: turning a system into a triangular ladder

Each highlighted entry is a pivot — the first non-zero in its row. Once the matrix is triangular, back-substitute:

Row 3: $5z = -10 \implies z = -2$ .
Row 2: $2y - 2z = 6 \implies y = 7/2$ .
Row 1: $x + 2y + z = 2 \implies x = -11/2$ .

Three pivots in three columns means three independent constraints on three unknowns — a unique solution.

Pivots and Free Variables#

After elimination, columns split into two kinds:

Pivot columns — columns that contain a pivot. The corresponding variables are determined by the others.
Free columns — columns without a pivot. The corresponding variables can be chosen freely.

This split decides everything:

Situation	Solution set
Every column is a pivot column	Unique solution
Some columns are free	Infinitely many solutions (one per choice of free variables)
A row reads $0 = c \neq 0$	No solution

LU Decomposition: Elimination, Stored#

A = L \cdot U

where $$U$$ is the upper-triangular matrix you ended up with, and $$L$$ is lower-triangular with the elimination multipliers stored in its entries. LU decomposition is just Gaussian elimination, packaged for re-use: once you have $$L$$ and $$U$$ , you can solve $A\vec{x}=\vec{b}$ for any new $\vec{b}$ in $$O(n^2)$$ instead of $$O(n^3)$$ .

LU decomposition as two simple shears in sequence

Geometrically the picture is delightful. $$A$$ may look complicated, but elimination splits it into two of the simplest transformations there are: an upper-triangular shear-and-scale ( $$U$$ ) followed by a lower-triangular shear ( $$L$$ ). Triangular matrices are easy because their action is causal — each output coordinate depends only on earlier inputs — which is exactly why back-substitution works.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
import numpy as np

A = np.array([[1, 2, 1],
              [3, 8, 1],
              [0, 4, 1]], dtype=float)
b = np.array([2, 12, 2], dtype=float)

x = np.linalg.solve(A, b)
print(f"Solution: {x}")
print(f"Verify Ax = {A @ x}")

Column Space: Where the Matrix Can Reach#

Definition#

C(A) = \{ A\vec{x} \mid \vec{x} \in \mathbb{R}^n \} = \text{span}\{ \text{columns of } A \}

Two equivalent ways to read this: it is everything you can output, and it is the span of the columns.

The Existence Theorem#

$A\vec{x} = \vec{b}$ has a solution if and only if $\vec{b} \in C(A)$ .

This is the cleanest statement in the chapter. “Does my equation have a solution?” becomes “is my target in the column space?” — a purely geometric question.

What Column Spaces Look Like#

For a $3 \times 3$ matrix the column space lives inside $\mathbb{R}^3$ , and there are only three possibilities:

Rank	Column space	Meaning
1	A line through the origin	All columns are scalar multiples of one direction
2	A plane through the origin	Two independent directions; the third column is redundant
3	All of $\mathbb{R}^3$	Three independent directions; $$A$$ is invertible

The pattern generalises: for an $m \times n$ matrix, the column space is some $$r$$ -dimensional subspace of $\mathbb{R}^m$ , where $$r$$ is the rank.

Mixer analogy. Imagine an audio mixer with three faders (the columns) and one master output. The set of all mixes you can produce is the column space. If two channels carry the same instrument, sliding their faders changes nothing genuinely new — that redundancy is what “rank deficiency” sounds like.

Null Space: What Gets Crushed#

Essence of Linear Algebra (5): Linear Systems and Column Space — Chapter summary

Definition#

N(A) = \{ \vec{x} \mid A\vec{x} = \vec{0} \}

The null space always contains the zero vector (since $A\vec{0} = \vec{0}$ for any matrix). The interesting question is whether it contains anything else.

Null space N(A) = directions A annihilates

The figure shows the geometric punchline. Left: the matrix $A=\begin{pmatrix}1&2\\2&4\end{pmatrix}$ has linearly dependent rows. Its null space is the entire line $\text{span}\{(-2,1)\}$ — every vector along that direction is mapped to the origin. The image (column space) is a different line, the direction $$(1,2)$$ . Right: the projection $\mathbb{R}^3 \to \mathbb{R}^2$ that drops the $$z$$ -coordinate has the entire $$z$$ -axis as its null space; everything vertical is annihilated.

Why the Null Space Controls Uniqueness#

A(\vec{x}_p + \vec{n}) = A\vec{x}_p + A\vec{n} = \vec{b} + \vec{0} = \vec{b}

\{ \vec{x}_p + \vec{n} \mid \vec{n} \in N(A) \}

The geometric picture is simple: take the null space (a subspace through the origin) and shift it by one particular solution. The result is an affine subspace parallel to the null space — exactly the solution set.

If $N(A) = \{\vec{0}\}$ : the solution is unique (when it exists).
If $$N(A)$$ contains non-zero vectors: there are infinitely many solutions, parametrised by the null space.

Steamroller analogy. A steamroller compresses a 3D object into a 2D pancake. All vertical motion is lost — the vertical direction is in the null space. Two objects whose only difference is vertical produce the same flattened image: the null space is exactly the ambiguity in inverting the flattening.

Rank: Effective Dimension#

\text{rank}(A) = \dim C(A) = \text{number of pivots after elimination}

It is also the maximum number of linearly independent columns, and (a small miracle) the maximum number of linearly independent rows. Row rank equals column rank for any matrix — it is one of those theorems that looks almost trivial once proved, but says something deep about the symmetry between rows and columns.

What Rank Tells You#

Rank is the count of effective dimensions — how many independent directions the transformation actually preserves.

$3\times 3$ matrix with rank	Geometric effect
3 (full rank)	Maps $\mathbb{R}^3$ onto $\mathbb{R}^3$ ; invertible
2	Squashes 3D space onto a plane
1	Squashes 3D space onto a line
0	The zero matrix; everything goes to the origin

Information analogy. Rank is the number of independent information channels. A color photo carries rank-3 information per pixel (R, G, B). Convert it to greyscale and the rank drops to 1; you have lost two whole channels. In machine learning, low-rank approximation is the same idea applied to data matrices: keep only the dominant channels and discard the rest.

1
2
3
4
5
6
7
import numpy as np

A = np.array([[1, 2, 3],
              [4, 5, 6],
              [7, 8, 9]])

print(f"Rank: {np.linalg.matrix_rank(A)}")  # 2  (row 3 = 2*row 2 - row 1)

The Rank-Nullity Theorem#

\boxed{\;\text{rank}(A) + \dim N(A) = n\;}

In words: the dimensions you keep plus the dimensions you crush equal the number of input dimensions you started with. Nothing is created and nothing is lost.

Rank-Nullity Theorem: every input dimension is either preserved or crushed

The bar chart on the left is the theorem in pictures: for every matrix, the blue (rank) and amber (nullity) bars sum to $$n$$ . The pie on the right shows the same thing as a partition of the input space $\mathbb{R}^n$ into a “preserved” part (the row space) and a “crushed” part (the null space).

Worked Example#

\dim N(A) = n - r = 5 - 2 = 3

Three free variables, a 3-dimensional null space, and a 2-dimensional column space living inside $\mathbb{R}^3$ — the full structure decoded from a single number.

Four Cases for $A\vec{x}=\vec{b}$ #

For an $m \times n$ matrix of rank $$r$$ , only four scenarios are possible.

Three faces of Ax = b: unique, infinitely many, none

Case 1: $$r = m = n$$ — Square and Full Rank#

$$A$$ is invertible. For every $\vec{b}$ there is exactly one solution $\vec{x} = A^{-1}\vec{b}$ . The column space is all of $\mathbb{R}^m$ and the null space is just $\{\vec{0}\}$ .

Case 2: $$r = n < m$$ — Tall and Full Column Rank (Overdetermined)#

More equations than unknowns. The column space is a proper subspace of $\mathbb{R}^m$ , so most $\vec{b}$ are unreachable. When a solution does exist it is unique, but in practice we use least squares to find the closest reachable $\vec{b}$ — that is the orange dot in the rightmost panel above.

Case 3: $$r = m < n$$ — Wide and Full Row Rank (Underdetermined)#

More unknowns than equations. The column space fills $\mathbb{R}^m$ , so every $\vec{b}$ has a solution — but the null space has dimension $$n-m>0$$ , so there are infinitely many. The middle panel shows the typical picture: the solution set is a line (or plane, or higher) of equally valid answers.

Case 4: $$r < m$$ and $$r < n$$ — Rank Deficient#

The most delicate case. Some $\vec{b}$ have no solution; others have infinitely many. Both pathologies appear at once.

The Four Fundamental Subspaces#

For an $m \times n$ matrix of rank $$r$$ , four subspaces tell the whole story:

Subspace	Symbol	Lives in	Dimension
Column space	$$C(A)$$	$\mathbb{R}^m$	$$r$$
Null space	$$N(A)$$	$\mathbb{R}^n$	$$n - r$$
Row space	$$C(A^T)$$	$\mathbb{R}^n$	$$r$$
Left null space	$$N(A^T)$$	$\mathbb{R}^m$	$$m - r$$

These four come in two orthogonal pairs:

In $\mathbb{R}^n$ : the row space and the null space are orthogonal complements. Every input vector decomposes uniquely into a “useful” part (row space) and a “wasted” part (null space).
In $\mathbb{R}^m$ : the column space and the left null space are orthogonal complements. Every output direction either lies in the column space or is unreachable.

The matrix $$A$$ acts as a clean bijection from the row space to the column space (both $$r$$ -dimensional), and crushes the null space to zero. Strang calls this the “big picture of linear algebra,” and once you internalise it you stop thinking of matrices as numerical tables and start seeing them as geometric machinery.

Applications#

Least Squares: When There Is No Exact Solution#

A^T A \hat{x} = A^T \vec{b}

Geometrically, $A\hat{x}$ is the orthogonal projection of $\vec{b}$ onto the column space — the closest reachable point.

1
2
3
4
5
6
7
import numpy as np

A = np.array([[1, 1], [2, 1], [3, 1], [4, 1]])
b = np.array([2, 3, 5, 4])

x, *_ = np.linalg.lstsq(A, b, rcond=None)
print(f"Best fit: y = {x[0]:.2f}x + {x[1]:.2f}")

Computer Graphics: Projection#

P = \begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \end{pmatrix}

Its null space is the entire $$z$$ -axis: depth is destroyed, which is why recovering 3D from a single 2D image is genuinely ambiguous (and why you need stereo, motion, or learned priors).

Circuit Analysis#

Kirchhoff’s current law, written in matrix form, says $A\vec{i} = \vec{0}$ where $$A$$ is the network’s incidence matrix. The null space of $$A$$ is the space of valid loop currents, and its dimension counts the number of independent loops in the circuit — a topological fact extracted from pure linear algebra.

Deep Intuition: Three Questions Before Computing#

When you see a linear system, do not start eliminating immediately. First ask:

What is the column space? It tells you which $\vec{b}$ are solvable.
What is the null space? It tells you whether the answer is unique, and if not, what shape the solution set has.
What is the rank? It quantifies how much information $$A$$ preserves.

These three questions are answered by elimination, but elimination is only the bookkeeping. The geometry is what matters.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
import numpy as np

def analyze_system(A, b):
    """Print the solution structure of Ax = b."""
    m, n = A.shape
    r = np.linalg.matrix_rank(A)
    r_aug = np.linalg.matrix_rank(np.column_stack([A, b]))

    print(f"Matrix: {m}x{n}, rank={r}, nullity={n-r}")

    if r_aug > r:
        print("  -> No solution (b is outside the column space)")
    elif r == n:
        print("  -> Unique solution")
        print(f"     x = {np.linalg.lstsq(A, b, rcond=None)[0]}")
    else:
        print(f"  -> Infinitely many solutions ({n-r} free variables)")
        print(f"     one particular x = {np.linalg.lstsq(A, b, rcond=None)[0]}")

analyze_system(np.array([[1, 2], [3, -1]], dtype=float),
               np.array([5, 1], dtype=float))

analyze_system(np.array([[1, 2, 3], [2, 4, 6]], dtype=float),
               np.array([1, 2], dtype=float))

Summary#

Concept	What it tells you
Column space $$C(A)$$	Which $\vec{b}$ are solvable
Null space $$N(A)$$	Whether the solution is unique; the shape of the solution set
Rank	How many independent directions $$A$$ preserves
Rank-nullity	$\text{rank} + \text{nullity} = n$ — a conservation law for dimensions
Four subspaces	The complete structural picture of any matrix

The essential thinking of linear algebra is to understand equations through spaces and dimensions, not through mechanical computation. Elimination remains the workhorse algorithm, but its real job is to expose the geometry that was already there.

Sparse Matrix Shortcut: When $A\vec{x}=\vec{b}$ Costs $$O(n)$$ Instead of $$O(n^3)$$ #

The headline cost of solving $$Ax=b$$ for a dense $n\times n$ matrix is $\Theta(n^3)$ via LU. For $$n=10^6$$ this is $\sim 10^{18}$ FLOPs — impossible. Yet finite-element simulations, recommendation systems, and graph problems routinely solve systems of that size, in seconds. The trick is sparsity.

A matrix is sparse when the fraction of non-zero entries is small — think $\le 1\%$ . The discrete Laplacian on a 2-D grid, for example, has 5 non-zeros per row regardless of how large the grid is. If $$A$$ has $\mathrm{nnz}(A)$ non-zeros, three things change at once:

Storage drops from $$n^2$$ doubles to $\sim 2\,\mathrm{nnz}(A)$ doubles plus index arrays. CSR (Compressed Sparse Row) is the canonical layout.
Matrix-vector products $$Ax$$ cost $\Theta(\mathrm{nnz}(A))$ instead of $\Theta(n^2)$ . For the 2-D Laplacian on a million-cell grid, that is $5\times 10^6$ FLOPs per multiply — under a millisecond.
Iterative solvers like Conjugate Gradient (for SPD $$A$$ ) or GMRES (for general $$A$$ ) build the answer using only matrix-vector products. Total cost: $\Theta(k\cdot \mathrm{nnz}(A))$ where $$k$$ is the iteration count — typically $\sqrt{\kappa(A)}$ for CG, often $$10^2$$ - $$10^3$$ .

A worked example. Solve a 1-D Poisson problem $$-u'' = f$$ on $$n=10^5$$ grid points:

1
2
3
4
5
6
7
8
9
import numpy as np
import scipy.sparse as sp
import scipy.sparse.linalg as spla
n = 100000
diag = 2.0 * np.ones(n)
off  = -1.0 * np.ones(n - 1)
A = sp.diags([off, diag, off], [-1, 0, 1], format="csr")
b = np.ones(n)
x, info = spla.cg(A, b, rtol=1e-8)   # ~600 iterations, ~0.5 s

The same problem with a dense np.linalg.solve would need $10^{15}$ FLOPs and 80 GB of RAM. Sparse iteration finishes in under a second on a laptop.

When does sparsity not help? When the sparsity pattern produces fill-in during direct factorisation. Even if $$A$$ is sparse, $$L$$ and $$U$$ in the LU decomposition can be nearly dense — for an arrowhead matrix, fill-in can turn $$O(n)$$ non-zeros into $$O(n^2)$$ . This is why scipy provides splu with reordering strategies (AMD, METIS) that minimise fill-in, and why iterative methods are often preferred over direct solves for very large systems.

What scipy.linalg.solve Actually Does Under the Hood#

When you call scipy.linalg.solve(A, b) for a dense $n\times n$ matrix, it does not compute $A^{-1}$ and then multiply. It dispatches to LAPACK’s dgesv, which:

Factors $$PA = LU$$ via partial-pivoting Gaussian elimination. This is dgetrf — $\tfrac{2}{3}n^3$ FLOPs. Partial pivoting (swapping rows so that the pivot has the largest absolute value in its column) is what keeps the algorithm numerically stable.
Solves $$Ly = Pb$$ by forward substitution — $$n^2$$ FLOPs.
Solves $$Ux = y$$ by back substitution — $$n^2$$ FLOPs.

Total: $\tfrac{2}{3}n^3 + 2n^2$ . Computing $A^{-1}$ explicitly would cost $$2n^3$$ FLOPs (an extra factor of 3) and — crucially — be less stable numerically. The relative error of solve(A, b) is bounded by $\kappa(A)\cdot \varepsilon$ ; the error of inv(A) @ b adds an extra $\kappa(A)$ factor. There is essentially no situation where forming $A^{-1}$ is the right move. Solve once per right-hand-side; reuse the LU factorisation if you have many.

If you know more about $$A$$ , you can pick a faster routine:

$$A$$ symmetric positive definite — cho_solve (Cholesky), $\tfrac{1}{3}n^3$ FLOPs, twice as fast as LU.
$$A$$ symmetric indefinite — ldl factorisation.
$$A$$ banded with bandwidth $$b$$ — solve_banded, $\Theta(n b^2)$ FLOPs.
$$A$$ triangular — solve_triangular, $$n^2$$ FLOPs (no factorisation needed).

The principle: every piece of structure you can name about $$A$$ has a faster solver attached. Plain solve(A, b) is the slowest correct answer; specialised routines are 2x to 1000x faster. The chapter you just read is about which structure to look for. The answer is encoded in the four fundamental subspaces: rank, symmetry, definiteness, and bandwidth.

What’s Next#

Chapter 6 : Eigenvalues and Eigenvectors. Most vectors change direction under a transformation. A few special ones do not — they only get scaled. These eigenvectors are the natural axes of $$A$$ , the directions in which the matrix becomes a simple stretch. Find them and you understand the long-term behaviour of any linear system.

Essence of Linear Algebra (5): Linear Systems and Column Space

The Central Question#

What You Will Learn#

Prerequisites#

Two Ways to See $A\vec{x} = \vec{b}$ #

Row Perspective: Intersecting Hyperplanes#

Column Perspective: Combining Vectors#

Gaussian Elimination: The Operational Tool#

The Three Legal Moves#

A Worked Example#

Pivots and Free Variables#

LU Decomposition: Elimination, Stored#

Column Space: Where the Matrix Can Reach#

Definition#

The Existence Theorem#

What Column Spaces Look Like#

Null Space: What Gets Crushed#

Definition#

Why the Null Space Controls Uniqueness#

Rank: Effective Dimension#

What Rank Tells You#

The Rank-Nullity Theorem#

Worked Example#

Four Cases for $A\vec{x}=\vec{b}$ #

Case 1: $$r = m = n$$ — Square and Full Rank#

Case 2: $$r = n < m$$ — Tall and Full Column Rank (Overdetermined)#

Case 3: $$r = m < n$$ — Wide and Full Row Rank (Underdetermined)#

Case 4: $$r < m$$ and $$r < n$$ — Rank Deficient#

The Four Fundamental Subspaces#

Applications#

Least Squares: When There Is No Exact Solution#

Computer Graphics: Projection#

Circuit Analysis#

Deep Intuition: Three Questions Before Computing#

Summary#

Sparse Matrix Shortcut: When $A\vec{x}=\vec{b}$ Costs $$O(n)$$ Instead of $$O(n^3)$$ #

What scipy.linalg.solve Actually Does Under the Hood#

What’s Next#

Linear Algebra 18 parts

Liked this piece?

The Central Question#

What You Will Learn#

Prerequisites#

Two Ways to See $A\vec{x} = \vec{b}$ #

Row Perspective: Intersecting Hyperplanes#

Column Perspective: Combining Vectors#

Gaussian Elimination: The Operational Tool#

The Three Legal Moves#

A Worked Example#

Pivots and Free Variables#

LU Decomposition: Elimination, Stored#

Column Space: Where the Matrix Can Reach#

Definition#

The Existence Theorem#

What Column Spaces Look Like#

Null Space: What Gets Crushed#

Definition#

Why the Null Space Controls Uniqueness#

Rank: Effective Dimension#

What Rank Tells You#

The Rank-Nullity Theorem#

Worked Example#

Four Cases for $A\vec{x}=\vec{b}$ #

Case 1: $r = m = n$ — Square and Full Rank#

Case 2: $r = n < m$ — Tall and Full Column Rank (Overdetermined)#

Case 3: $r = m < n$ — Wide and Full Row Rank (Underdetermined)#

Case 4: $r < m$ and $r < n$ — Rank Deficient#

The Four Fundamental Subspaces#

Applications#

Least Squares: When There Is No Exact Solution#

Computer Graphics: Projection#

Circuit Analysis#

Deep Intuition: Three Questions Before Computing#

Summary#

Sparse Matrix Shortcut: When $A\vec{x}=\vec{b}$ Costs $O(n)$ Instead of $O(n^3)$ #

What scipy.linalg.solve Actually Does Under the Hood#

What’s Next#

Linear Algebra 18 parts

Liked this piece?

Case 1: $$r = m = n$$ — Square and Full Rank#

Case 2: $$r = n < m$$ — Tall and Full Column Rank (Overdetermined)#

Case 3: $$r = m < n$$ — Wide and Full Row Rank (Underdetermined)#

Case 4: $$r < m$$ and $$r < n$$ — Rank Deficient#

Sparse Matrix Shortcut: When $A\vec{x}=\vec{b}$ Costs $$O(n)$$ Instead of $$O(n^3)$$ #