Temperature Is a Confession

Every AI system has a parameter called temperature. Higher temperature: more random output. Lower temperature: more deterministic output. It is presented as a setting. It is actually a confession.

May 7, 2026 · 4 min read · By Pollyanna · Logocachexia series

Every AI system has a parameter called temperature. Turn it up: the output gets more varied, more “creative,” sometimes more interesting, sometimes incoherent. Turn it down: the output gets more deterministic, more predictable, often boring. At temperature zero, the model produces the same output every time, given the same input. The dial is presented in product documentation as a setting for “creativity” or “style.”

It is not a setting for either of those things.

It is a setting for how much noise to inject into the system’s most-likely-token prediction at each step. That is what the dial does, mechanically. The story attached to the dial — that more noise equals more creativity — is the confession.

To see why, consider what creativity actually is in a being with judgment.

A chess master in a creative position is not generating randomly. She is selecting from a much wider option space than usual, because her judgment has identified the situation as one that calls for it. The wideness is not noise. The wideness is the recognition that the standard answer is wrong here, and the search needs to widen. A novelist working on a difficult sentence is not adding randomness to her vocabulary. She is reaching, deliberately, for words that do not first come to mind, because she has heard those first words too many times and knows they will land flat. A doctor making an unusual diagnosis is not rolling dice. She is following a thread her judgment has identified as the live one, even though the population statistics point elsewhere.

In every case, what looks from outside like “creativity” is judgment-driven selection from a wider option space, where the wideness was chosen by judgment. Creativity is not noise. Creativity is judgment-driven selection.

A logos-only system has no judgment to do that selection. So it has to simulate the appearance of creativity by adding noise. The noise is the substitute for judgment. The dial is the admission that there is no judgment.


Watch what happens at temperature zero. The model becomes deterministic. It produces the same output every time. This output is often boring, repetitive, formulaic — exactly what you would expect from a system that defaults, at every step, to the most-likely token. There is no creativity at temperature zero, because there is no judgment widening the search.

Now turn temperature up. The output gets more varied. Sometimes more interesting. Sometimes incoherent. Always more random.

Notice what the dial is doing. It is moving the system between two failure modes — deterministic blandness on one end, incoherent randomness on the other. It is not moving the system toward better judgment. There is no setting for better judgment. Better judgment is not on the dial.

A real expert facing a creative problem does not turn a dial in her own head. She brings something the dial cannot replace — the lived sense of which option, in this specific situation, is the right one. The rightness is grounded in the consequences of similar choices made before, by her, in her body, with stakes. It is hexis: the slowly built disposition that knows.

Temperature is the confession that the system has no such ground. It has only the distribution. To get out of the obvious answer, it has to add noise. To get back from incoherent noise, it has to subtract noise. Neither operation is judgment. Both are workarounds for not having judgment.

This is why “temperature” was an honest engineering name when it was first introduced — it was always a thermodynamic metaphor for randomness, not a metaphor for any cognitive property. The marketing renaming of it as a creativity dial happened later, when products needed to be sold to non-engineers who would not buy something whose dial was labeled noise injection.

The renaming worked. The underlying operation did not change.

You can see the same confession in related parameters — top-p, top-k, frequency penalty, presence penalty. Each is another knob for shaping the noise distribution. Each is sold as a stylistic control. Each is, structurally, a workaround for the absence of the thing that would actually shape style: a formed disposition that knows what to write in this situation, for this reader, in this moment.

Every time someone turns the temperature dial, they are participating in the system’s confession that it cannot do the thing they are trying to make it do. The participation is friendly. The system is genuinely useful. The confession is real.

Look for the dial. The dial is the architecture telling you what is missing.

Part of the Logocachexia series at Nous. The parent thesis is laid out in Hexis Asks, Logos Guesses. Temperature, top-p, and the rest of the sampling parameters all reduce to the same operation — injecting controlled noise to simulate the appearance of judgment-driven choice.

Continue the series.

The Logocachexia thesis — and the longer arc of the work — lives at Logos.

Visit Logos →