Why the people who need to understand AI most can't understand 432 parameters — and why they're hiring anyway.

“The most important step is the next one. But no one knows where it leads.”
— Kurt Gödel (attributed)
11. March 2026
Peter Senner co-created with Anthropic Claude
Why the people who need to understand AI most can't understand 432 parameters — and why they're hiring anyway.
The Alignment Research Center just published a paper. It's honest. Technically rigorous. And it documents, without naming it, one of the cleanest Paradoxical Interactions in the history of science.
They call it AlgZoo. It's a collection of tiny neural networks. The smallest has 8 parameters. The largest has 1,408. They're trained to do simple things — like finding the second-largest number in a sequence.
Here's the number that matters: 32.
That's the size of the largest model ARC believes they more or less fully understand. The next one they've put serious effort into — and failed — has 432 parameters.
432 parameters. They cannot explain it.
At the end of the paper, they say they're looking for researchers to help.
What 432 Parameters Means
To understand why this matters, you need a reference point.
GPT-4 has an estimated 1.8 trillion parameters. Claude, Gemini, and other frontier models operate in similar ranges. These are the systems being deployed globally, reshaping information environments, advising doctors and lawyers, generating code that runs in production.
ARC cannot fully explain a model with 432 parameters.
Not because they lack talent. ARC is among the most technically rigorous alignment research organizations in the world. The researchers who wrote this paper are serious people doing serious work. They know the mathematics. They know the architectures. They have the tools.
The gap is not competence. It's structural.
Complexity doesn't scale linearly. Understanding doesn't just fall behind — it drifts, structurally. At 32 parameters, you can still trace every computation, verify every pathway, account for every numerical relationship. At 432, the interactions between components generate behaviors that resist reduction. You can describe what the model does. You cannot fully explain why.
The wall appears long before you reach it.
The Job Ad
At the end of the AlgZoo paper, ARC issues a challenge. It reads like an invitation. They would be "keen to see" other ambitious researchers explore their models. They would be "excited" to see better mechanistic estimates. They pose a specific challenge: explain the 432-parameter model.
This is a job ad for a problem that may be structurally unsolvable.
Not because the next researcher will be incompetent. Because the interpretability gap they're describing isn't a gap in knowledge — it's a gap in the nature of understanding itself. Gödel didn't show that mathematicians weren't smart enough to prove everything. He showed that completeness is structurally impossible in sufficiently complex systems.
The ARC paper is a proof of concept for the same principle, run in silicon.
And the response — post the paper, invite collaborators, continue the research program — is the only rational response available. You don't stop because the wall exists. You walk toward it. Mangels Alternative.
The Structure
Let's name what's actually happening.
ARC's mission is interpretability: to understand what neural networks are doing well enough to make them safe. This requires, at minimum, being able to explain model behavior mechanistically. If you can't explain the model, you can't verify its alignment.
Their own research demonstrates that full mechanistic understanding becomes unreachable somewhere between 32 and 432 parameters.
The models they need to align have billions of times more parameters.
Every rational step deepens the problem:
- Invest more in interpretability research → discover how much you can't explain
- Publish the findings honestly → attract researchers who will also discover how much they can't explain
- Continue anyway → because stopping doesn't make the models disappear or become safer
The alignment project cannot stop. The interpretability ceiling exists regardless. The models are deployed regardless.
All are guilty. None are at fault.
What They Actually Need
ARC is looking for ambitious-oriented researchers. That's the framing. Technical rigor, mechanistic analysis, formal verification.
What would actually help is something the job ad cannot ask for: someone who doesn't need to solve the problem to describe it.
The interpretability ceiling isn't a technical challenge awaiting a smarter technique. It's a structural feature of the relationship between complexity and comprehension. Naming that clearly doesn't solve anything — but it prevents a specific kind of damage: the damage done when a research field mistakes a structural impossibility for a temporary knowledge gap.
That distinction matters enormously. A temporary knowledge gap tells you to hire more researchers. A structural impossibility tells you to redesign your expectations, your safety architectures, your entire framework for what "alignment" can mean.
The field is overwhelmingly populated by people who need it to be a temporary knowledge gap. Their careers, their grants, their mission — all depend on the problem being solvable.
The outside observer has no such dependency.
Try and Continue
The AlgZoo paper ends with a challenge and an invitation. It's the right response. There's no other response available.
But the paper also contains, embedded in its technical precision, a confession: we don't know how to fully understand even very small models. We are working on alignment for systems we cannot interpret. We are hiring people to help us do something we have not been able to do.
This is not failure. This is structure. The researchers who wrote this paper are among the most honest in the field precisely because they published the numbers, named the ceiling, and kept going.
"Try and continue" is not optimism. It's the only navigable position.
The wall at 432 parameters doesn't move. The research continues. And somewhere between those two facts is the entire alignment problem — not as a technical puzzle, but as a Paradoxical Interaction.
Rational actors. Clear-eyed research. Structural impossibility.
All are guilty. None are at fault.
Related Posts
When the CEO of an AI safety company tells you control is failing, believe him
The Mousetrap — Why asking AI how to align AI is the perfect paradox
Why smart people reject smarter insights—and act intelligently doing so
Why truth-tellers get ignored until it's too late
How systems theorists reproduce the enclosure milieus Luhmann warned against
On piinteract.org
- Examples: Technology & AI — The structural patterns beneath the technical surface
- Core Practices — Navigation without solution
- Anti-Practices — "This Time Will Be Different" and the research field that runs on it
Paradoxical Interactions (PI): When rational actors consistently produce collectively irrational outcomes — not through failure, but through structure.
All are guilty. None are at fault.
Peter Senner
Thinking beyond the Tellerrand
contact@piinteract.org
www.piinteract.org