They are deterministic at 0 temperature

lokhura · 2025-01-09T18:24:57 1736447097

At zero temp there is still non-determism due to sampling and the fact that floating point addition is not commutative so you will get varying results due to parallelism.

BalinKing · 2025-01-09T17:30:13 1736443813

(Disclaimer: I know literally nothing about LLMs.) Wouldn't there still be issues of sensitivity, though? Like, wouldn't you still have to ensure that the wording of your commands stays exactly the same every time? And with models that take less discrete data (e.g. ChatGPT's new "advanced voice model" that works on audio directly), this seems even harder.

BalinKing · 2025-01-09T20:00:54 1736452854

s/advanced voice model/advanced voice mode/ (too late for me to edit my original comment)

wkat4242 · 2025-01-09T19:39:16 1736451556

They are pretty deterministic then but they are also pretty useless at 0 temperature.

ukuina · 2025-01-09T20:08:24 1736453304

Not for the leading LLMs from OpenAI and Anthropic.

vrighter · 2025-01-10T13:14:26 1736514866

Not really, not in practice. The order of execution is non-deterministic when running on a cluster or a gpu, or more than one core of the CPU and rounding errors propagate differently on each run.