Words in Superposition

What just happened

Your brain does this a thousand times before breakfast

Every ambiguous word arrives in a kind of suspension — not quite one thing, not quite the other. The context collapses it. By the time you reach the end of the sentence, you've already forgotten that the beginning was unresolved.

Fluency may be less about knowing words than collapsing them faster than you notice.

The word bark is an obvious case because both readings are concrete. But the same thing happens with subtler ambiguity: light (weight? illumination? color?), run (move fast? manage? stockings?), interest (financial? attention? concern?). Most of the time you don't even register that a choice was made.

There is a branch of linguistics that tries to model this precisely — to say not just "context helps" but exactly how much each context word shifts the probability, and what it would mean for the uncertainty to hit zero. The tool it reaches for looks a lot like physics.

Where Finnish comes in

A language that makes the pieces visible

In English, the structure is mostly hidden. Words look like units. You don't see the seams. Finnish is different: it is agglutinative, which means it stacks meaning-pieces onto a root, in plain sight, one after another.

Where English says in the house — three separate words — Finnish says talossa. One word. Two pieces: talo (house) + -ssa (in). Where English says from my school, Finnish says koulustani. Three pieces, one word.

Finnish doesn't hide the structure. It wears it on the outside.

This creates the superposition problem in a different form. The word kuussa can be parsed two ways: kuu (moon) + -ssa, meaning "in the moon," or kuusi (spruce tree) + -ssa, meaning "in the spruce." Same sound. Different pieces. The same collapse-by-context that resolved bark now has to find the right root.

And because the pieces are explicit, we can assign them numbers. Each morpheme gets a prime. The word's number is the product of its pieces' primes. The number is the word's fingerprint — and the fingerprint tells you exactly which pieces it was built from.

Two kinds of knowing

Fast and rough, then slow and exact

The wave-function collapse — context narrowing the possibilities — is fast and approximate. It works in parallel, across the whole sentence, before you've consciously processed any of it. By the time you reach the verb, the noun has already started resolving.

The prime fingerprint is slow and exact. Once only one reading remains, the number tells you precisely what it is. kuussa-as-moon is 187. kuussa-as-spruce is 221. Different numbers. Unforgeable. The factorization is unique.

These can work as two layers of the same process rather than competing models. The fast pass filters. The slow pass confirms. Something like both seems necessary — one without the other tends to be either too blunt or too slow to be useful in real conversation.

Understanding is what happens between the wave and the fingerprint.

Puns live in the gap. So does poetry. A skilled writer can hold a word in both states on purpose — keep the entropy high, let both readings coexist for a beat, then resolve in a direction the reader didn't expect. Ambiguity, held on purpose, becomes a resource rather than a failure.

Under the hood

One loop, any language

The engine underneath all of this is surprisingly simple. It comes from a theorem prover called Otter, built in the 1990s to find proofs in formal logic. Its main loop has three steps: pick something from the queue, combine it with what you already know, put the results back in the queue. Repeat.

The loop is neutral about what it's combining. Swap in Finnish morpheme rules and it builds words. Swap in logical axioms and it builds proofs. The combination rules live on the pieces rather than in the engine — which means you can teach it a new language by giving it new pieces with new rules, without touching the loop itself.

This matters for rare and endangered languages, which don't have the kind of data that modern AI usually needs to learn from. If the grammar can be expressed as combinatorial rules on morphemes, the engine can work from a small dictionary. The grammar encodes what the data can't supply.

The pieces were always there

Finnish makes visible something English tends to hide: that words are constructions. Meaning arrives in pieces, stacks into structure, gets a number, collapses under pressure from context. The process may be similar across languages — the seams just show more in some places than others.

In Finnish, that visibility is a window worth looking through.

Words in Superposition

Your brain does this a thousand times before breakfast

A language that makes the pieces visible

Fast and rough, then slow and exact

One loop, any language

The pieces were always there

Entropy as a measure of uncertainty

Agglutinative languages

Why prime numbers?

Unique factorization

The Otter theorem prover

Languages that need different approaches