Arun Pandian M

Android Dev | Full-Stack & AI Learner

Feb 10, 2026

Left Null Space — The Error Your Model Cannot Learn

At some point a model stops improving, but not in a dramatic way. The loss doesn’t blow up. It doesn’t fluctuate. It simply settles.

You tweak the learning rate. You train longer. You restart with better initialization.

Nothing changes.

The model isn’t unstable — it has just reached the edge of what it can represent.

What remains isn’t a training issue anymore. It’s a structural limitation.

Linear algebra gives that leftover error a name: the left null space.

What’s Actually Happening

https://storage.googleapis.com/lambdabricks-cd393.firebasestorage.app/left_nullspace.svg?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=firebase-adminsdk-fbsvc%40lambdabricks-cd393.iam.gserviceaccount.com%2F20260225%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20260225T031501Z&X-Goog-Expires=3600&X-Goog-SignedHeaders=host&X-Goog-Signature=3c534dd9080b9197e775e08be83e68f9dd4271fbf8cc7c4f90021ab87f2e1fb69d1e1c518f3f0e5806d2dbfc778ebfd2c51041ed3a09df239777bed3bf3969be76f6fddf6d26461ed444aae55bf0f5cb6e66c7a254d489e6a72c33f4de22105ac0fd0a1c6d1d65a59894650ca415e0b03a3fa8ab037f550f3cb0a868bd3bb6dd38bcef6fad6233e9da7ed0c6c2b6f5ab6d064a00647d57232462e336214e0ee06e07715bb44c7df794e751b8e215427cdb842db0db68e5ae4ee93fefe0242d69e0dd5b008d963feab2db1e9ef7d5784ac8fe4d09c07a49e5be6dc5d8d89d93f4a24fca27abd10290bb0ecaa2f62bf3c6667e1add1b121cf2901864567023404c

We usually picture training as a simple loop:

change the weights → predictions change → error reduces

But here the pattern shifts:

change the weights → predictions move slightly → the error remains

The optimizer is still running, updates are still happening, yet progress feels cosmetic.

At this point you’re not really improving the model anymore — you’re moving inside the space it already understands.

The remaining difference isn’t due to bad training. It exists because the model has no way to represent it.

A tiny math example (no fear)

https://storage.googleapis.com/lambdabricks-cd393.firebasestorage.app/left_nullspace_sample.svg?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=firebase-adminsdk-fbsvc%40lambdabricks-cd393.iam.gserviceaccount.com%2F20260225%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20260225T031501Z&X-Goog-Expires=3600&X-Goog-SignedHeaders=host&X-Goog-Signature=95a0d7229f1aadece1c3d0bec0cb36c7d7422af5ad917bfa5e280904ceb9c025a705cf0b54903dbdbe8c0691f756e2f50a5207494fd7ec95e969c0b263d523bbb16ac85d70e346480c00aa558ad80688c4ed074193fe6725dcdfabc19cc54b8c859ddb855a277da858a9ea310749fe3e97ec52a8f5a8ce0cf48707be6cb6f31950c9d0072814ce944728fd5c88757e4b1137ffef2d610d6c95f670b87c7e8ffc0bbb7f29840d60e550cf36044eb61e67e1fa4b344dbde8360c0bdd5f0221afc661e09959dacfa9ce84c54d7cc3eb719629a058e494d753a2a8105479e2355b84692e414cbcef9f78040737265001555ece79219efa6db3db5100694a51614517

When Ax Can’t Reach b

Let’s imagine we collect two features for every data point:

the first value x and another value y

At first it looks like we have two independent measurements. But after looking closer, we notice something interesting:

the second feature is always twice the first.

y = 2x

So although the data appears two-dimensional, it really isn’t. All points sit along a single direction — like dots drawn along a straight path

Writing the data as a matrix

We can place a few samples together into a matrix:

A = \begin{bmatrix} 1 & 2 \\ 2 & 4 \\ 3 & 6 \end{bmatrix}

This matrix represents the inputs given to the model.

The model learns weights:

x = \begin{bmatrix} w_1 \\ w_2 \end{bmatrix}

And predictions come from multiplying them:

Ax

What the model is actually capable of

Carrying out the multiplication:

Ax = \begin{bmatrix} w_1 + 2w_2 \\ 2w_1 + 4w_2 \\ 3w_1 + 6w_2 \end{bmatrix}

If you look carefully, every row shares the same pattern. We can rewrite it as:

Ax = (w_1 + 2w_2) \begin{bmatrix} 1 \\ 2 \\ 3 \end{bmatrix}

This reveals an important limitation:

no matter how the weights change, the model can only move along one direction.

Training can slide predictions forward or backward on that line, but it cannot leave the line

When reality asks for more

Now suppose the true output is:

b = \begin{bmatrix} 5 \\ 11 \\ 20 \end{bmatrix}

We would like the equation

Ax = b

to have a solution.

But that would require b to lie on the same line as the model’s predictions. It doesn’t.

So no choice of weights can reproduce it exactly.

What training really does

Instead, the model settles for the nearest possible output — we’ll call it b̂, the best approximation it can produce.

The remaining difference

r = b - \hat{b}

never disappears — not because training failed, but because the model has no way to express it.

That leftover gap is the part of reality outside the model’s language. In linear algebra, it lives in the left null space.

Real life version

Imagine predicting salary using:

years of experience

months of experience

But:

months = 12 × years

Your model really only has one degree of freedom.

Now HR introduces a performance bonus.

Your training keeps running — but the error never disappears.

The model isn’t lazy.

It literally has no way to represent “bonus”.

That missing expressiveness is the left null space.

A calmer analogy

Fitting a straight ruler onto a curved road.

You slide it. Rotate it. Press harder.

Eventually nothing improves.

You didn’t stop optimizing — you reached the limit of what a straight line can do.

The remaining gap is the left null space.

Why this matters in ML

This explains moments where:

loss plateaus early

model underfits perfectly

adding epochs changes nothing

bigger models suddenly work

Because training adjusts parameters. but architecture decides possibility. The left null space is where reality lives outside your model’s language.

The quiet takeaway

Null space means:

the model can move without changing behavior

Left null space means:

reality can change without the model being able to follow

The optimizer stops when gradients vanish. But learning stops when you hit the left null space.

#LinearAlgebra#MathBehindAI#MachineLearning#AIFoundations#VectorSpaces#LeftNullSpace#IrreducibleError#ModelCapacity#LeastSquares#DataGeometry#LearningLimits#AIConcepts

← PreviousNull Space: The Directions a Model Quietly Ignores Next →The Dot Product — The Smallest Idea Behind Modern AI

Recommended for you

Ax = b: Understanding Linear Systems in Real Life and AI

1 min read

The Hidden Geometry of Data — Understanding Column Space

1 min read

Matrix Multiplication: The Hidden Engine Behind Machine Learning Predictions

1 min read

Why Adding More Rows Doesn’t Always Add More Understanding

1 min read