/Some/ people bullshit themselves stating the plausible; others check their hypo...

catskul2 · 2024-11-22T17:22:33 1732296153

Everyone, every last one of us, does this every single day, all day, and only occasionally do we deviate to check ourselves, and often then it's to save face.

A Nobel prize was given for related research to Daniel Kahneman.

If you think it doesn't apply to you, you're definitely wrong.

mdp2021 · 2024-11-22T17:48:08 1732297688

> occasionally

Properly educated people do it regularly, not occasionally. You are describing a definite set of people. No, it does not cover all.

Some people will output a pre-given answer; some people check.

stavros · 2024-11-22T08:43:34 1732265014

How are you going to check your hypotheses for why you preferred that jacket to that other jacket?

mdp2021 · 2024-11-22T09:20:39 1732267239

Do not lose the original point: some systems have a goal to sound plausible, while some have a goal to say the truth. Some systems, when asked "where have you been", will reply "at the baker's" because it is a nice narrative in their "novel writing, re-writing of reality", some other will check memory and say "at the butcher's", where they have actually been.

When people invent explicit reasons on why they turned left or right, those reasons remain hypotheses. The clumsy will promote those hypotheses to beliefs. The apt will keep the spontaneous ideas as hypotheses, until the ability to assess them comes.

DSingularity · 2024-11-22T09:19:18 1732267158

Is that example representative for the LLM tasks for which we seek explainability ?

stavros · 2024-11-22T09:23:10 1732267390

Are we holding LLMs to a higher standard than people?

f_devd · 2024-11-22T10:03:09 1732269789

Ideally yes, LLMs are tools that we expect to work, people are inherently fallible and (even unintentionally) deceptive. LLMs being human-like in this specific way is not desirable.

stavros · 2024-11-22T10:18:03 1732270683

Then I think you'll be very disappointed. LLMs aren't in the same category as calculators, for example.

f_devd · 2024-11-22T12:06:40 1732277200

I have no illusions on LLMs, I have been working with them since og BERT, always with these same issues and more. I'm just stating what would be needed in the future to make them reliably useful outside of creative writing & (human-guided & checked) search.

If an LLM provides an incorrect/orthogonal rhetoric without a way to reliably fix/debug it it's just not as useful as it theoretically could be given the data contained in the parameters.